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Institutional Panethnicity: 
Boundary Formation in Asian-American Organizing 
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Abstract 

In the wake of the civil rights movement, new organizations formed which were based on 
the collective interests and identities of their constituencies. Some of these organizations 
brought together national origin groups who often differed by ethnicity, language, 
culture, religion and immigration history. In this paper, I focus on the conditions that 
facilitate the institutionalization of a socially constructed panethnic community. Using 
a new longitudinal data set of Asian American organizations, I draw upon a theory of 
panethnicity which emphasizes the structured relations between groups at different 
boundary levels to understand panethnic organizational foundings. When controlling for 
resource availability, political opportunities and organizational dynamics, the boundary 
formation variables remain important in explaining new organizational activity. 


Since the 1970s, organizational activity based on a collective identity derived 
from members' common interests and experiences has become an increasingly 
prevalent option for social change efforts. The organizational forms that emerged 
during the Civil Rights era illuminate the new ways in which groups not only 
fulfilled the social and economic needs of their respective communities, but 
demonstrate how ethnic minorities were able to institute collective identities 
and provide a framework for the articulation of group interests. Some of these 
organizations served and brought together national origin groups who often 
differed by language, culture, religion and immigration history. Based on the 
socially constructed notion of a panethnic community, organizations such as 
the National Council of La Raza, Asian Pacific American Labor Alliance and the 
National Urban Indian Development Corporation played an important role in 
securing civil rights for their respective communities by providing services and 
advocacy for different ethnic and tribal communities under a unified panethnic 
framework while contributing to the creation of new cultural and political 
identities. Recently, such organizations have been successful in campaigns 
against anti-immigrant legislation, unfair wages, poor working conditions and the 
lack of affordable housing and economic opportunities. Under what conditions 
are national-origin groups who differ by language, culture, collective history, 
religion and immigration status able to construct a panethnic identity upon which 
to form a larger collective? What conditions influence the institutionalization of a 
socially constructed panethnic community? 


I thank Eric Grodsky, Mignon Moore and Kimberlee Shauman for their advice on earlier 
drafts of this paper. This research was supported by the Visiting Scholars Program at the 
Russell Sage Foundation in 2004-05 and a National Science Foundation Dissertation 
Improvement Grant (SES-9900774). Direct correspondence to Dina G. Okamoto, 
University of California-Davis, Department of Sociology, One Shields Avenue, Davis, CA 
95616. E-mail: dgokamoto@ucdavis.edu. 
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Contemporary research on panethnicity suggests that polítical policies and 
mainstream institutions influence the construction of group boundaries by 
increasing the symbolic and material value of racial and ethnic identities. Some 
scholars argue that government agencies view racial groups such as Asian 
Americans, Native Americans and Latinos as “a single administrative unit” when 
distributing resources (Espiritu and Ong 1994). Imposing a racial structure on 
minority communities has led groups to use panethnicity as a strategy to secure 
federal and state funding as well as employment benefits. In his study of Latino 
mobilization, Padilla (1985) finds that Mexicans and Puerto Ricans organized 
collectively and created new panethnic organizations in Chicago during the 
1980s to combat overt language discrimination in the labor market, using equal 
opportunity and affirmative action policies to make their claims. Similarly, Nagel 
(1995) finds that Native Americans organized across tribal lines during the same 
period as a response to governmental policies that treated groups racially. 

Other scholars emphasize the role of external threats in the creation 
and strengthening of group boundaries. When under threat in the form of 
ethnic prejudice, discrimination or violence, ethnic group members tend to 
see themselves as having interests in common with others who have similar 
characteristics, and they build group solidarity in order to protect themselves. 
According to Portes (1984), ethnic enterprises were created in response to 
perceived hostility from the host society directed at new immigrants. Similarly, 
Espiritu (1992) finds that after the 1982 Vincent Chin incident — when two Detroit 
men mistook a Chinese American man for a Japanese national and beat him to 
death with a baseball bat — Asian Americans engaged in "reactive solidarity" by 
forming panethnic organizations to monitor, report and protest anti-Asian violence. 
In their study of the rise and fall of ethnic newspapers in the early 20th century, 
Olzak and West (1991) discovered that ethnic conflict against white immigrants 
encouraged the formation of ethnic organizations. In sum, the emergence of 
panethnic boundaries is viewed as a reaction to external threats manifested in 
the form of racial prejudice and discrimination. 

Clearly, political policies and external threats are significant factors that 
influence the formation of panethnic organizations despite differences in national 
origin, culture, language and generation. The studies reviewed above point to 
racialization as the driving force leading national origin groups to cross boundaries 
and collectively organize. In other words, as the dominant group or the larger 
society views and treats Asian Americans, Native Americans, African Americans 
and Latinos racially, not making distinctions based on nativity, generation, tribal 
affiliation or national origin, these groups respond by asserting a panethnic 
identity and constructing a common culture that is facilitated by common 
experiences vis-à-vis the dominant group (Cornell 1990, Espiritu 1992). Studies 
on panethnicity tend to focus on the importance of racialization by others in the 
construction of panethnic identities, but such work has less to tell us about the 
structural conditions that influence ethnic groups to organize collectively along 
panethnic lines. In other words, even if the state imposes a racial structure on 
groups or if ethnic groups experience discrimination and racism, panethnicity 
does not always occur. | argue that local, structural conditions play an important 
role in facilitating or discouraging panethnic activity. 


Institutional Panethnicity ° 3 


Building on past research, this article contributes to the empirical and theoretical 
literature by examining the conditions that encourage the institutionalization of a 
socially constructed notion of panethnic community. Using a new longitudinal data 
set of Asian American organizations, | apply a theory of panethnicity to organizational 
formation, recognizing that the layering of ethnic identities can complicate the extent 
to which different ethnic or national origin groups can act collectively. | use this 
theoretical framework to focus on how structural conditions based on ethnic and 
racial boundaries can promote or deter panethnic collective action. 


Organizations as Mobilizing and Identity Structures 


Organizations based on collective identities not only provide a structural 
context for the development of a common culture and identity, but they also 
can be conceptualized as “mobilizing structures” (McCarthy 1996). By bringing 
together groups of potential participants, organizations facilitate solidarity, 
communication and network ties, which in turn contribute to potential collective 
action. These mobilizing structures include formal organizations such as social 
movement groups that mobilize a constituency with the political goal of obtaining 
a collective good. Voluntary organizations are also mobilizing structures because 
even though they are not primarily aimed at movement mobilization, they may 
work on behalf of the movement or provide social locations where mobilization 
may be generated (Kriesi 1996). Still other organizations that provide services 
for group members strengthen communities by increasing self-sufficiency and 
education. Not only do these groups construct effective ideologies and social 
environments that encourage commitment, they also attract material resources 
that contribute to the success of their organizational goals. 

In this paper, | consider organizations to be panethnic when the membership 
or community served is comprised of culturally and linguistically diverse national 
origin groups that are often seen as homogeneous by outsiders (Lopez and 
Espiritu 1990). Participation in a panethnic organization can lead an individual to 
develop a panethnic consciousness - seeing himself or herself as part of a larger 
group whose members have similar interpretations of personal experiences and 
the larger society (Espiritu 1992, Kibria 2002). In addition, given that panethnic 
organizations attempt to extend the civil, political and economic rights of a 
larger collective, the formation of panethnic organizations not only measures the 
extent to which a constructed category becomes institutionalized, but it is also 
an indicator of collective action and increasing movement strength. Even though 
all ethnic group members may not participate in the planning, support and 
implementation of a panethnic organization — much like the selective participation 
of group members in protests and demonstrations — collective work across 
ethnic boundaries by a segment of the Asian American population represents a 
remarkable collective effort by several different ethnic communities. 


Panethnic Organizations in the Asian American Community 


Beginning in the late 1800s and early 1900s, Asian ethnic communities formed 
traditional associations which served the social and economic needs of 
immigrants, offering them protection and aid, and even negotiating disputes 
among members. The district and family associations of the Chinese community 
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were part of the most extensive organizational structure of al the Asian ethnic 
groups (Chan 1991). Japanese organizations during this period were based on 
prefectural lines and provided economic aid and social events for the community 
(Daniels 1988). Soon after immigrating to the United States, Filipinos, Koreans 
and Asian Indians also organized on the basis of common location within a home 
province or along religious lines. 

As early as the 1850s, Chinese community leaders formed an organization 
that was based on national identity instead of clan and kin. In response to the 
growing hostility directed towards Chinese immigrants, the Chinese Consolidated 
Benevolent Association (CCBA) was born. The CCBA directly governed the 
different district associations and functioned as a benevolent and protective 
organization, often representing the Chinese community to outsiders (Lai 1987). 
After the Gentleman's Agreement was forged in 1908, at the height of the anti- 
Japanese movement, the Japanese Association of America (JAA) was founded 
to protect the rights of Japanese immigrants. All Japanese residents, despite 
their village and prefecture differences, belonged to the organization (Daniels 
1988). The Korean National Organization was formed in 1909 in San Francisco 
by the merger of several organizations with the objectives of promoting the 
economic development of Korean immigrant communities and working for the 
independence for Korea (Takaki and Stefoff 1994). Like the JAA and CCBA, all 
Korean immigrants were required to join the organization and pay dues which 
supported the Korean newspaper and patriotic fund. For Filipinos, after the 1930 
Watsonville riots where hundreds of Filipinos were attacked by white mobs, 
most associations, including the Filipino Community Center in New York, were 
established on the basis of ethnicity rather than kin or province (Cordova 1983). 

Economic and demographic changes after World War ll contributed to the 
further erosion of the traditional forms of organization within the respective Asian 
ethnic communities (Kwong 1987). Voluntary associations and government- 
funded agencies emerged to provide aid and services to the Asian population. 
In addition, with the educational, spatial and marital assimilation of each passing 
generation, ethnic group members had different needs and interests from those 
of the past. New organizations based on a panethnic boundary were formed to 
serve the needs of the collective Asian American community. 

The first pan-Asian organization, Asian American Political Alliance (AAPA), was 
formed in 1968 in the wake of the San Francisco State strike where students from 
different racial and ethnic backgrounds protested against the lack of diversity in 
the university curriculum and demanded ethnic studies programs (Wei 1993). The 
AAPA brought Asian national origin groups together to increase the effectiveness 
and political visibility of their activism. During this period, panethnic organizations 
flourished on university campuses as Asian American students protested against 
the Vietnam War and the imperialist nature of U.S. participation. 

The pan-Asian concept was not only used on college campuses, but it began to 
be used extensively by professional and community spokespersons to lobby and 
organizeforthe welfare and rights of Asian Americans. The strategy of forming pan- 
Asian coalitions and associations was often used because government agencies 
preferred to fund "multi-ethnic" rather than single-ethnic organizations. Awarding 
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funds to pan-Asian organizations avoided any charge of favoritism toward certain 
Asian ethnic communities (Espiritu 1992, Kimura 1990). But even with funding as 
an incentive, building panethnic organizations was often problematic. Vo's (1996) 
case study of a panethnic organization in San Diego illustrates how economic 
and generational changes within the Asian American community complicated 
the development of a successful pan-Asian organization. In addition, Ong and 
Espiritu (1994) document conflicts between Asian ethnic groups surrounding 
leadership and equal representation in panethnic organizations. 

The pan-Asian organizations that emerged after the Civil Rights era were 
diverse in goals and strategies, but ultimately served to create, sustain and 
expand upon the idea of an Asian American community (Lien 2001, Wei 1993). 
These organizations lobbied against discriminatory immigration legislation, 
brought media attention to racial profiling, and pursued employment issues that 
relate to Asians and other ethnic minorities, while providing a critical context 
for the development of networks and relationships across Asian national origin 
boundaries, and for the construction and dissemination of an Asian American 
history and culture (Kibria 1997). 


Past Research on Organizational Formation 


Organizational foundings have been the focus of scholars who seek to understand 
the conditions that influence the organizing of social movements across the 
United States, whether they be associated with ethnic minorities seeking fair 
representation and access to resources or militia group members who believe 
that citizen action is necessary for individual and group survival (Minkoff 1995, 
Van Dyke and Soule 2002). Contextual factors, such as resources, have been 
important predictors for the founding of voluntary and social movements 
organizations. McCarthy et al. (1988) find that population size and growth, in 
addition to community resources in the geographic area, such as high levels of 
education and socioeconomic status, increased the founding rate of local citizen 
action groups against drunk driving throughout the United States. Examining the 
foundings and failures of newspapers, Olzak and West (1991) find that ethnic 
conflict has differing effects for immigrants and African Americans: attacks 
against immigrants spur the formation of immigrant newspapers, but attacks 
against African .Americans dampen the rate of African American newspaper 
foundings. Given that the ethnic violence directed toward African Americans was 
more frequent and intense than attacks against immigrants, the authors suggest 
that the resource base of the African American community was undermined, 
making it difficult for them to fend off attacks and create new organizations. 
Thus, studies on organizational formation have found support for the resource 
mobilization perspective, which generally suggests that the availability of and 
access to economic and organizational resources are central to successful 
collective action (McCarthy and Zald 1988, Tilly 1978). 

Political opportunity may also be an important contextual variable to consider 
in regards to the formation of organizations representing social movements. 
The concept of political opportunity continues to be expanded and refined, but 
generally is viewed as a function of the level of elite receptivity of particular groups 
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and movements (Tarrow 1991, Tilly 1978). The general political ‘climate can affect 
how minority and special interest groups are perceived and eventually funded, or 
accorded certain rights and privileges. In short, openings and shifts in the political 
structure should improve the prospects for organizational activity. Minkoff's 
(1995) research indicates that a Democratic presidential administration in the 
previous year promoted new organizational foundings among women and ethnic 
minorities. Here, supportive elites in positions of power represent openings in 
the political structure that aid the growth and strength of minority communities. 
However, other studies find that political opportunity variables do not have 
robust effects when predicting the formation of new organizations. For example, 
McCammon (2001) finds that political opportunities, such as the openness of 
state legislatures to the voting rights of women and party competition, did not 
influence state suffrage organizing. 

Organizational ecology approaches have also had success in explaining 
organizational formation rates. Several studies have found that organizational 
density has a positive effect on new foundings, indicating that density is one 
mechanism leadingto the legitimacy of particularorganizational forms (Hannan and 
Freeman 1989). However, asthe numberof active organizations grows, competition 
among organizations emerges, and founding rates of new organizations decline 
(Hannan and Carroll 1992). In her study of the evolution of women's and minority 
organizations, Minkoff (1995) discovered that their proliferation was influenced 
by increases in the number of active organizations. The density of organizations 
promoted the institutional acceptability and legitimation of organizations based 
on the interests of women and ethnic minorities, and provided a model of action 
for new organizations. 

Taken together, past research on organizational foundings suggests that 
resource availability, political opportunities and organizational dynamics shape 
organizational formation. While useful, missing from this literature is a theoretical 
body of work that addresses the structured relations between racial and ethnic 
groups. Given that this paper focuses on the institutionalization of a constructed 
panethnic community, a theory of panethnicity must be considered to more fully 
understand the conditions that lead to panethnic organizational foundings. 


A Theory of Panethnicity 


Okamoto's (2003) theory of panethnicity provides a useful framework for 
understanding the structural conditions that influence the formation of 
panethnic organizations. Drawing on theories of ethnic boundary formation, this 
framework focuses on how the structuring of groups at different boundary levels 
influences panethnic outcomes. Specifically, the theory of panethicity posits that 
segregation and competition can occur between and among racial groups, and 
these dynámics have differing effects on panethnicity. 


Segregation Processes 


The cultural division of labor theory claims that the systematic and differential 
distribution of ethnic populations into the occupational structure influences 
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group and identity formation, as well as collective action (Hechter 1999 [1975]). 
This cultural division of labor can be segmental, where ethnic groups are 
highly occupationally specialized, or hierarchical, where ethnic populations are 
concentrated at the bottom of the occupational structure. According to the 
theory, a common social identity will arise among ethnic group members who 
are segregated into certain occupations due to their similar structural position, 
shared work experiences and daily interaction. In addition, group boundaries will 
be strengthened when ethnic populations come to view their life chances as 
largely determined by their membership in an ethnic group. This heightening of 
ethnic boundaries will lead to collective action in the form of protest or voting 
behavior. Several empirical studies have found support for the cultural division of 
labor theory in contexts ranging from the Celtic Fringe in the 1880s to Israel in 
the 1990s (Diez Medrano 1994; Hechter 1999 [1975], 1978; Mettam and Williams 
1998; Peled 1998). 

The cultural division of labor theory can be applied to panethnic collective 
action (Okamoto 2003): to the extent that Asians as a group are occupationally 
segregated from other racial groups, solidarity based on panethnic boundaries 
should develop and panethnic collective action should follow. However, 
the cultural division of labor theory does not specify the extent to which the 
occupational segregation of several different ethnic groups will counteract or 
enhance panethnicity. Recognizing the expansion and contraction of ethnic 
boundaries, the theory of panethnicity posits that the occupational segregation 
of Asian ethnic subgroups from one another, not simply the segregation of Asians 
as a group, will affect collective action outcomes. In fact, Asian ethnic groups will 
have fewer opportunities for interethnic interaction and a lower level of shared 
economic interests if they are specialized in different occupations, which should 
dampen the formation of a panethnic community. In sum, the predictions can 
be applied to organizational formation and are as follows: When Asians as a 
group are concentrated in particular occupations, this leads to the heightening of 
panethnic boundaries and leads to panethnic outcomes whereas the segregation 
of Asian subgroups into different occupations heightens ethnic boundaries and 
has a dampening effect on panethnicity. 


Competition Processes 


Competition theory posits that when two or more ethnic groups are in competition, 
collective action will occur because each group is attempting to maintain or 
improve its position in the social hierarchy (Olzak 1992). Even though competition 
theorists agree that occupational segregation can lead to the salience of ethnic 
identities, they argue that an additional incentive — economic processes that 
affect changes in the distributions of racial/ethnic populations — must be present 
in order for ethnic boundaries to be activated and ethnic collective action to occur. 
These economic processes, such as an increase in immigration or a decline in 
labor market segregation, lead to contact between racial and ethnic groups 
within a context of scarce resources, resulting in competition and collective 
action. This collective action may take the form of protest or conflict against 
other groups in order to exclude them from particular resources, or to ensure that 
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resources remain held by certain groups. Competition theory also suggests that 
economic contraction will also lead to ethnic competition and collective action. 
Several empirical studies indicate that when labor markets become integrated 
and contact between different ethnic/racial groups is initiated, competition 
between ethnic populations ensues, leading to ethnic collective action in the 
form of ethnic protest or ethnic conflict (Bonacich 1973, Olzak 1992, Olzak et al. 
1996, Ragin 1979). 

These theoretical ideas can be easily extended to the case of panethnicity: 
competition between Asians and other racial groups, such as whites or Latinos, 
will activate a panethnic boundary, leading to panethnic collective action. But in 
a multi-ethnic context where ethnic boundaries can shift and contract, the theory 
makes no predictions about how competition between Asian ethnic groups will 
affect collective action outcomes. In other words, if Asian ethnic groups are 
competing for the same resources, this should have a countervailing effect on 
panethnicity. Thus, recognizing that ethnic boundaries can shift and change, this 
theory can be extended to make predictions about the effects of competition 
measured at different boundary levels on panethnic outcomes (Okamoto 2003). 
To summarize, competition between Asians and other racial groups should be 
positively related to collective action based on a panethnic boundary, whereas 
competition between ethnic group members comprising a panethnic group 
should be negatively associated with panethnic activities, such as organizational 
formation. 


Data Collection 


To test the theoretical predictions above, | constructed a new data set of Asian 
American organizations from 1970 to 1998 using the Encyclopedia of Associations, 
National Organizations (Gale Research Co.). The Encyclopedia of Associations is 
considered to be the most comprehensive directory of national organizations in 
print and contains detailed information on nonprofit membership organizations, 
such as founding date, membership composition and size, objectives, budget, 
publications and affiliations. New organizations were identified by EOA in a 
variety of ways: by annual IRS information on new groups that have filed for 
nonprofit status, web sites, various source books and directories of associations, 
trade associations journals, snowball sampling and write-ins from associations 
that want to be listed in the publication. Following Minkoff (1995), | coded each 
edition of FOA separately in order to gain complete information about old and 
new entries. 

Organizations were coded as panethnic if they are predominantly comprised 
of Asian American members or serve an Asian American population. For example, 
the Organization of Pan Asian American Women is panethnic because it includes 
all ethnic group members in the category of Asian or Asian American. The Society 
for Asian Art is also panethnic. Even though it is not solely comprised of members 
from Asian ethnic groups, such organizations facilitate the development or 
maintenance of a panethnic identity through the dissemination of Asian history 
and culture. Organizations that disbanded or were founded before the starting 
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point of the study, and organizations located in other countries for part or the 
entire duration of their lives, were not included in the sample. 

Within the Asian American community, | identified 55 panethnic organizations 
that formed between 1970 and 1998. The sample includes a diverse set of 
organizations that deal with political advocacy, service or resource provision, 
cultural production and/or distribution, professional interests, cross-cultural 
understanding, research, business, education, human rights and cultural 
heritage. The goal of these organizations is generally to promote the status of 
Asian Americans. A range of means, including public education, lobbying efforts 
and protest strategies, is used to reach this goal. All organizations in the sample 
pursue civil, political, cultural or economic rights for their constituency and can 
be conceptualized as similar to social movement organizations because they 
influence the formation of panethnic identities and function as communication 
links in local and national networks. 


Dependent Variable 


The dependent variable, a count of pan-Asian organizational foundings, is coded 
for 30 metropolitan areas with the highest Asian American populations over a 
29-year period. (See Table 1.) These metropolitan areas coincide with the largest 
metropolitan areas in the United States, and provide a sample of sizable Asian 
populations. However, simply having a high Asian population does not guarantee 
that different ethnic groups can coordinate their efforts to institutionalize an 
imagined panethnic community. National pan-Asian organizations formed in 12 
of these metropolitan areas over the given time period, but it is still important to 
include all 30 metropolitan areas in the analysis to avoid sample selection bias. 


Independent Variables 


All of the independent variables were created from 1970, 1980, 1990 and 2000 
Public Use Microdata Samples(PUMS)census data(U.S. Departmentof Commerce, 
Bureau of the Census 1971, 1985, 1995, 2003) unless stated otherwise. Because 
PUMS data are available in 10-year internals, | used linear interpolation to fill in 
the values in between census years. | also include a number of national-level 
variables to measure resource availability and political opportunities because past 
research has shown that such variables significantly affect the rate of formation 
among national organizations serving women and ethnic minorities (i.e., Minkoff 
1995). However, most of the variables included in the analysis are measured at 
the metropolitan level because local processes are also important for panethnic 
organizational formation. For example, in the case of American Citizens for 
Justice, a national organization that deals with Asian American civil rights issues, 
local variables shaped the institutionalization of a panethnic community, as the 
effort to create the organization was spearheaded by local groups and leaders 
in response to a local event (Espiritu 1992, Zia 2000). Recent research on ethnic 
and panethnic organizations serving the Asian American community provides 
further evidence of how local demographics and events shape the organizational 
activities of national Asian American organizations (Okamoto 2005). Descriptive 
statistics for all variables are presented in Table 2. 
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Table 1: 30 Metropolitan Areas with the Largest Asian American Populations 


Anaheim-Santa Ana-Garden Grove, CA 
Atlanta, GA 

Baltimore, MD* 

Bergen-Passaic, NJ* 

Boston, MA 

Chicago, IL* 

Dallas-Ft. Worth, TX 

Denver, CO 

Detroit, MI* 

Fresno, CA 

Honolulu, HI 

Houston, TX 

Los Angeles-Long Beach, CA* 
Middlesex-Somerset-Hunterdon, NJ 
Minneapolis, MN 

Nassau-Suffolk, NY New York, NY-NJ* 
Newark, NJ 

Philadelphia, PA* 

Phoenix, AZ 

Portland, OR-WA 

Riverside-San Bernardino, CA 
Sacramento, CA* 

San Diego, CA* 

San Francisco-Oakland, CA* 

San Jose, CA 

Seattle-Everett, WA* 

Stockton, CA 
Vallejo-Fairfield-Napa, CA 


Washington, D.C., MD-VA* 


*National pan-Asian organizations formed in these locations, 1970-1998. 


Competition Variables 

The competition perspective suggests that increases in racially- and ethnically- 
distinct others, contact between ethnic groups due to a decline in labor force 
segmentation, and a weakening economy will intensify competition and foster 
collective action. 


In-migration Rate 

To measure increases in racially distinct others in a metropolitan area, | constructed 
an in-migration variable. These data were gleaned from a set of questions from 
the PUMS regarding the respondent's prior residence.' If a respondent resided 
outside of the United States or in a different state or metropolitan area five years 
ago, the in-migrant variable was coded 1 and aggregated to the metropolitan 
area level. | then calculated the percent non-Asian in-migrants by dividing the 


Institutional Panethnicity + 11 


Table 2: Descriptive Statistics for Independent and Dependent Variables 


Variables Mean S.D. 
Panethnic organizational foundings .07 .32 
Attacks against Asians .02 .13 
Ethnic heterogeneity .81 .09 
Percent Asian 6.22 10.93 
Total population size 14.51 18 
Federal funds 8.04 .37 
Philanthropic funding 4.70 17 
Highly educated Asian population 62.14 12.08 
Democratic presidential administration SH 46 
Democratic advantage 83 BN 
Organizational density 2.00 446 
Non-Asian in-migrant rate 13.34 8.49 
Asian-white unemployment ratio .01 OS 
Asian occupational segregation 1 43 
Asian occupational hierarchy -.01 .07 
Poverty rate i LEON E 
Unemployment rate 5.74 2.04 
Unemployment Ratio by Ethnic Group 
Chinese 1.03 .61 
Filipino 15091 E99 
Japanese ITA 1:08 
Korean 149 2.01 
Non-ethnic Group Asian In-migration Rate 
Chinese 105 147 
Filipino 102 1.34 
Japanese 321 1.59 
Korean pel 
Occupational Hierarchy by Ethnic Group 
Chinese -.09 .10 
Filipino -.08 .09 
Japanese -.03 .08 
Korean -.04 .08 
Occupational Segregation by Ethnic Group 
Chinese 1.40 al 
Filipino 1.42 38 
Japanese 1.85 19 
Korean 1.45 19d 
Note: N = 870 


number of non-Asian in-migrants by the total metropolitan area population. 
The in-migration rate measuring the level of in-migration of ethnically-distinct 
others within the Asian category was also calculated as a percentage of the total 
metropolitan area. For example, the non-Chinese in-migration rate measures 
the percentage of Asian in-migrants in a metropolitan area that are not Chinese. 
According to competition theory, the entrance of ethnically-distinct groups into 
the metro area should dampen the possibilities for panethnic outcomes. 
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Unemployment Ratio 

This ratio variable measures the extent to which two groups are experiencing 
relative disparity in terms of percent unemployed. Specifically, the unemployment 
ratio, Au/Wu, compares percent Asian unemployed and percent white 
unemployed. If Asians have a lower rate of unemployment than whites, the ratio 
might look like this: AuWu = 5/10, or .5. If the value of the ratio increases, this 
reflects a decrease in percent white unemployed or an increase in percent Asian 
unemployed. In either situation, the relative degree of disparity, or the extent to 
which the two groups are unequal, has actually declined. Here, whites have made 
gains relative to Asians and the relative resources of the groups have become 
more equal. Under these conditions, competition and collective action on the part 
of Asians should erupt. | constructed unemployment ratios comparing Asians 
with other racial groups, such as whites, and unemployment ratios comparing 
each Asian ethnic group to all other Asians. 


State of the Economy 
| include the overall unemployment rate and poverty rate to measure economic 
downturns in metropolitan areas. 


Segregation Variables 

The cultural division of labor theory suggests that it is the separation, not the 
integration, of racial groups within occupational spaces that leads to higher 
levels of collective action. | constructed two variables to measure the extent 
to which segmental and hierarchical cultural divisions of labor exist within 
metropolitan areas. 


Occupational Segregation 
The occupational segregation index (Charles 1992) measures the degree to which 
an ethnic group experiences occupational specialization in the following form, 


(/7) x Z;[In(A;/N;) - [Ur x E n(A;/N;)]| 


where Aj equals the number of workers from a specific Asian ethnic group in 
occupation /, Nj is the number of workers from all other Asian subgroups in 
occupation j, and / is the number of occupational categories. When the value 
of the index increases, ethnic groups are more occupationally specialized. To 
measure the degree to which Asians as a group are concentrated in certain 
occupations, Aj equals the number of Asian workers and Nj equals the number 
of non-Asian workers. When the index increases, Asians experience high levels 
of occupational segregation relative to other racial groups. The advantage of this 
occupational segregation index is that is it not influenced by the ethnic and racial 
composition of the labor force or the size of different occupational categories. 


Occupational Hierarchy 
The occupational hierarchy index (Diez Medrano 1994, Okamoto 2003) measures 
the degree to which an ethnic group is concentrated in low-paying occupations 
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in the following form, 
A, / W, - (VI x >, A, / W.) 


where A/s equals the total number of low-skill workers from a specific Asian 
subgroup, W/s is the total number of low-skill workers from all other Asian 
subgroups, / is the number of occupational categories, Aj is the total number of 
Asian subgroup members in occupation /, and W/ is the total number of workers 
from all other Asian subgroups in occupation /. When the index increases, there 
is more hierarchical segregation among Asian ethnic groups. | also calculated 
a measure of the extent to which Asians as a group are concentrated into low- 
paying occupations relative to all other racial groups. Thirteen broad occupational 
census categories were used in the calculation of all occupational segregation 
indices at the metropolitan area level.? 


Additional Variables 
To properly estimate the effects of the main independent variables, | also include 
a number of additional variables that may explain organizational founding rates. 


Resource Availability 

The.availability of resources has been an important predictor of the founding of 
social movements organizations and voluntary organizations. | created a variable 
measuring the annual amount of federal grants to state and local governments 
for community and regional development. These funds are locally administered 
to benefit low-income and other vulnerable groups in the community, and often 
support organizations that address affordable housing, new job creation and 
the expansion of business opportunities in urban areas. These funding figures 
are taken from the Budget of the U.S. Government: Historical Tables Fiscal Year 
2005, presented in billions, and measured at the national level. | also constructed 
a measure for philanthropic funding, which includes annual donations from 
corporations, individuals and foundations. This information is reported at the 
national level in Giving USA: The Annual Report on Philanthropy, a publication by 
the American Association Fund Raising Council (AAFRC) Trust for Philanthropy. 
Both variables are logged for ease of interpretation of estimates. 

Another important resource for the formation of organizations is a large, highly 
educated constituency. Because of its financial and social capital which can be 
used to organize the community, this population is viewed as a major resource. To 
measure the extent of a highly educated Asian American constituency, | include a 
metropolitan-level variable measuring the percentage of Asians, age 25-44, who 
have completed some college or higher. 


Political Opportunity 

A Democratic advantage at the federal and state levels should influence the 
extent to which the political environment is open to the needs and interests of 
minority groups. | include two variables to measure the political environment and 
potential opportunities for the founding of Asian American organizations. | created 
a national-level variable measuring whether the presidential administration in the 
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previous year was Democratic or not. In addition, | include a variable measuring 
Democratic advantage at the state level. If more Democrats than Republicans 
were elected to Congress in the previous year, this variable was coded 1. Data 
from the Statistical Abstract of the United States (U.S. Bureau of the Census 1974, 
1978, 1985, 1989, 1995, 2000) were used to create both variables. 


Organizational Dynamics 

| include a measure of organizational density or the number of panethnic 
organizations currently active in a metropolitan area. Established panethnic 
organizations not only provide a working model for operations, but they promote 
the institutional acceptability and legitimation of organizing along a panethnic 
boundary. 


Control Variables 

| control for the size of metropolitan area by including /og of total population 
and the size of the Asian population by including percent Asian. | also include a 
variable measuring the extent of ethnic diversity within the Asian population in 
each metropolitan area. If a multitude of Asian ethnic groups live in the metro 
area, it may be more likely that a diverse set of local community members 
will participate in the formation of the organization, and national leaders may 
choose to start a national organization in a location where they can reach local 
ethnic communities as well as national needs. Finally, given that contemporary 
explanations of panethnicity suggest that external threats, which are part of the 
racialization process, encourage groups to participate in panethnic collective 
action, | constructed a dummy variable measuring whether attacks against Asian 
Americans occurred in a metropolitan area in a given year. The data used to 
create this variable were taken from newspaper articles published in the New 
York Times, Los Angeles Times and Chicago Tribune. (See Okamoto 2003 for a 
discussion of data collection procedures.) 


Analytic Strategy 


Count data is typically analyzed using a Poisson or negative binomial regression 
model (Long 1997). Because zero counts in the data are greater than expected 
in a Poisson or negative binomial distribution, | use a zero-inflated regression 
model (Lambert 1992, Long 1997). This model assumes a dual-state process 
involving a zero-response state with probability p; and a non-zero-response state 
with probability 1 - pi, where pj is an unknown parameter to be estimated. In 
other words, the zero-inflation model assumes that there are two latent groups 
in the sample with two different distributions: the first state refers to those 
metropolitan areas that always have zero counts, while the second state refers 
to other metropolitan areas with organizational formation frequencies that follow 
a Poisson or negative binomial distribution. 

Diagnostic tests reveal that the dependent variable follows a Poisson-like 
distribution. To confirm this, | estimated a negative binomial regression and 
the likelihood test indicated that alpha was not significantly different from zero, 
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suggesting that the negative binomial model does not fit the data better than the 
Poisson model. In addition, Vuong tests show that the zero-inflated Poisson (ZIP) 
model fits the data better than a Poisson model (Vuong 1989). Robust standard 
errors, adjusted for the clustering of organizational foundings within metropolitan 
areas that may be non-independent across years, were estimated. In addition, all 
models were run with fixed effects for each metropolitan area to control for any 
unmeasured factors associated with the different metropolitan areas. 


Results 


| documented the formation of 55 pan-Asian national organizations with Asian 
American memberships and/or constituencies from 1970 to 1998, within the 30 
metropolitan areas under study. Examining the pattern of organizational formation 
over time, | find that the number of pan-Asian organizations has increased since 
1970 and throughout the 1980s, with the high point of formations occurring in 
1980 (Figure 1). Asmaller number of national Asian American organizations formed 
in the 1990s which may be due to a rapid expansion of diverse Asian populations 
or perhaps Asian Americans are becoming assimilated into mainstream culture 
with each passing generation and look to mainstream institutions for social and 
economic support. 

Table 3 presents the results of nested zero-inflation Poisson regression models. 
| include all of the theoretically-derived covariates in the Poisson part of the 
model because | am primarily interested in the contextual factors that influence 
organizational formation among metropolitan areas which are at higher risk. The 
inflated part of the model includes simple demographic variables to predict the 
probability of the zero state.? The first model includes control variables in the 
Poisson part of the model. The second model estimates the effects of resource 
availability, organizational dynamics and political opportunity, net of controls, on 
organizational formation. The third model adds boundary formation variables 
and represents the full model. It is important to note that the comparison of 
log likelihoods from Model 2 and Model 3 indicates that including the boundary 
formation variables significantly improves the fit of the model. | discuss the 
findings from the three models below. 

The results from Model 1 indicate that one of the control variables, percent 
Asian, is significant. An increasing Asian population raises the odds of a panethnic 
organization forming in a given metropolitan area by six times." In the inflated 
part of the model, the increasing size of the metropolitan area and a growing 
Asian population leads to a higher probability of the zero state. In other words, 
in areas with large overall and Asian populations, there are fewer organizational 
foundings.? It could be that in such areas, Asians are forming organizations along 
ethnic rather than panethnic lines, and/or that increasingly large metropolitan 
areas make it difficult to organize a panethnic community. 

Model 2 shows that a number of the variables used to predict the formation 
of social movements organizations are significant. Interestingly, the effects 
of federal and philanthropic funding are both negative: federal grants to local 
communities as well as foundation, corporate and individual donations actually 
hinder new organizational activity. Past research reveals that funders often view 
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Figure 1. Pan-Asian Organizational Foundings in the United States, 1970-1998 
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Asian Americans as a population that is not in need due to the model minority 
stereotype (Kimura 1990, Espiritu 1992, Shiao 1998). It is likely that philanthropic 
and federal funds are directed toward other groups which are able to frame their 
needs and interests as more compelling, and this dampens the ability of Asians to 
garner financial support to start organizations. Another type of resource, the size 
of the highly educated sector of the Asian American population, has a positive 
effect on the formation of panethnic organizations. This finding suggests that 
such a constituency provides financial and social capital as well as leadership for 
organizational efforts. 

The results also show that one of the political opportunity variables is 
significant: when the federal administration in the previous year is Democratic, 
this increases the odds of panethnic organizational formation, suggesting that 
political opportunities at the national level influence organizing based on a 
panethnic boundary. However, there is no effect of Democratic advantage on 
panethnic organizational formation, but the density of panethnic organizations 
raises the number of foundings, just as organizational ecology perspectives 
predicted. In fact, an increase in the number of panethnic organizations in a 
metropolitan area raises the likelihood of new foundings by 21 percent.® 

Model 3 presents the full model. The results show that three boundary 
formation variables are significant. When the rate of non-Asian in-migrants to a 
metropolitan area increases, this raises the odds of the formation of panethnic 
organizations. This result supports competition theory which claims that a rising 
supply of racially-distinct workers intensifies competition and collective action. 
Another measure of competition, the unemployment ratio, is not significant. As 
the relative resources of Asians and whites become more equal, this does not spur 
the formation of panethnic organizations as competition theory would suggest.’ 

Continuing to explore the effects of the boundary formation variables, | 
find that occupational segregation has a small, significant effect on panethnic 
foundings. In addition, the estimate for occupational hierarchy is large and highly 
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Table 3: Zero-Inflated Poisson (ZIP) Regression Models Estimating 
the Effects of Independent Variables on the Formation of Pan-Asian 
Organizations, 1970-1998 


ee L n 
Independent Variable Model 1 Model 2 Model 3 


Poisson Part 
Control Variables 


Attacks against Asians -.43 ET. 66% 
(.47) (.24) (.25) 
Ethnic heterogeneity -7.35 2.24 5.21* 
(5:75) (2.51) (2.73) 
Percent Asian 1.8375 1.48 -4.19 
(.60) (2.13) (6.45) 
Total population 12 39 .69* 
(.30) (.26) (.38) 
Additional Variables 
Federal funding — -.43* -.35 
(.26) (.37) 
Philanthropic funding — -6.59*** -6.29*** 
(1.09) (2.03) 
Highly educated Asians — 5:28*^* 3.61* 
(1.58) (1.96) 
Democratic administration — KC wa Bac? 
(.22) (.22) 
Democratic advantage — -.21 $12 
(.30) (.35) 
Panethnic organization density — Eo oun 
(.02) (.03) 
Boundary Formation Variables 
Non-Asian in-migration rate — — 6.82* 
(4.23) 
Asian-white unemployment ratio — — 6.47 
(9.24) 
Asian occupational segregation — — .24* 
(.18) 
Asian occupational hierarchy — — 3.63 
(.76) 
Poverty rate E -— sl? 
(.09) 
Unemployment rate — — 10.32 
(10.7) 
Intercept 4.22 19.00** 10.30 
(6.24) (6.22) (9.25) 
Inflated Part 
Percent Asian -4.03*** -7.80*** -1.91 
(1.25) (1.97) (1.24) 
Total population AA -3.63*** -4.46* 
; (.82) (1.00) (2.30) 
Ethnic heterogeneity .96 290555 TELS 
(5.32) (2.07) (3.16) 
Intercept 9:97 (Sis 1.65 
(1.06) (.45) (.99) 
-2 log likelihood 333.30 236.65 223.37 
McFadden's R-squared .18 .42 AS 


*px.05 *p<.01 **p=<.001 (one-tailed tests) 

Note: N = 870. Robust standard errors are presented in parentheses. 
The Poisson part of the model estimates the non-zero-state probability 
as a poisson function, and the inflated part estimates the zero-state 
probability as a logistic function. Values for likelihood ratio tests are 
as follows: Model 1 vs. Model 2 = 96.65 (p < .001, d.f. = 6; Model 2 vs. 
Model 3 = 13.28 (p S .05. d.f. = 6); Model 1 vs. Model 3 = 109.93 (p < 


0015 df =12). 
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significant. This suggests that when Asians find themselves concentrated in the 
lower levels of the occupational structure, group boundaries are heightened, 
leading to panethnic mobilization. Finally, the unemployment rate and poverty 
rate were not significant, indicating that economic downturns have no effect on 
panethnic organizational foundings. 

In Model 3, the effects of the variables measuring resource availability, 
political opportunities and organizational dynamics remain highly robust, with 
the exception of federal funding for community development, which is no longer 
significant in the full model. Finally, three of the control variables in Model 2 are 
significant. An increasing Asian population, attacks against Asians and a large 
total population in the metropolitan area increase the likelihood of new panethnic 
organizational activity. 

To find out if competition and segregation between Asian ethnic groups have 
any effect on panethnic organizational formation, | turn to the models in Table 4 
that include variables decomposed by Asian ethnic group.? The results show that 
ethnic occupational hierarchy is highly significant and negative across all models: 
when Chinese, Filipinos, Japanese and Koreans are occupationally segregated 
in low-paying occupations vis-a-vis other Asians, this dampens the likelihood 
of panethnic organizational formation. This supports the cultural division of 
labor perspective which predicted that when different Asian ethnic groups are 
separated from one another in the occupational structure, ethnic boundaries 
will be heightened, the creation and maintenance of interethnic networks will 
decline, and panethnic organizational formation will be less likely to occur. Ethnic 
occupational segregation is negative and significant when the Chinese are 
segregated, but the segregation indices for the other Asian ethnic groups do not 

have an effect on panethnic foundings. 

Competition theory predicted that competition between Asian ethnic groups 
would decrease the likelihood that pan-Asian organizations would form. The results 
reveal that the unemployment ratios comparing specific ethnic groups to all other 
Asians were not significant in three of the models. In other words, a decrease 
in disparity between Chinese, Filipinos and Japanese respectively and all other 
Asians has no effect on the likelihood of panethnic organizational formation. For 
Koreans, a decrease in disparity actually facilitated new organizational activity, 
which is counter to competition theory predictions. The effects of the in-migration 
variables were highly significant for Chinese and Japanese. For example, in 
Model 2, as the percentage of non-Japanese Asian in-migrants increases, this 
raises the likelihood of panethnic organizational formation. Competition theory 
posited that when ethnically-distinct others enter the metropolitan area, there 
will be competition for scarce resources, leading to less panethnic activity since 
each ethnic group is collectively working to improve or maintain its own share 
of the resources. This claim is not supported here. Instead, an increase in Asian 
in-migrants leads to new panethnic organizational activity, which is likely due to 
the need of new Asian ethnic populations. 


Discussion and Conclusion 


New organizational forms based on collective identities emerged in the 1970s, 
including those that brought together and served distinct ethnic groups under a 
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Table 4: Zero-Inflated Poisson (ZIP) Regression Models Estimating Ethnic-Specific Boundary 
Formation Variables on the Formation of Pan-Asian Organizations, 1970-1998 


Model 1 Model 2 Model 3 Model 4 


Independent Variable Chinese Filipino Japanese Korean 


Ethnic-specific Boundary Formation Variables 


In-migration rate Eis 6.25 02 .68 
(.49) (5.86) (.37) (.54) 
Unemployment ratio .05 -.45 -.05 .08* 
(.46) (.34) (.20) (.04) 
Occupational segregation -.59* -.38 -.20 -.52 
(.34) (.34) (.22) (.33) 
Occupational hierarchy -.03*** -.04** -.02*** H 
(.01) (.02) (.01) (.06) 
Intercept RT 1.56* 1.40* 2.78 
(.82) (.82) (.82) (5.98) 
Inflated Part 
Percent Asian -4.47** -4.23* -1.11* 1.04 
(1.82) (2.41) (.57) (1.40) 
Total population 4.09 -3.30* -3.20* 5.21 
(1.22) (1.58) (1.66) (5.85) 
Ethnic heterogeneity 7.74** dE 1,19" 2.69 
(3.05) (3.60) (4.00) (2.11) 
Intercept -1.89 -1.89 -2.03 0:97 
(1.44) (1.44) (1.19) (5.60) 
-2 log-likelihood 217.54 219.84 221.11 226.17 


McFadden's R-squared .46 Ap AN A4 


*p < .05 **p <.01 ***p < .001 (one-tailed tests) 

Note: N = 870. Robust standard errors are presented in parentheses. 

Variables measuring political opportunity, resources, organizational dynamics, control variables, 
and Asian boundary formation variables are included in the models but are not shown here. The 
Poisson part of the model estimates the non-zero-state probability as a poisson function, and the 
inflated part estimates the zero-state probability as a logistic function. 

*In Model 1, this variable measures the percentage of non-Chinese Asian in-migrants who have 


entered the metropolitan area in a given year. 


larger unified panethnic framework. Past studies on panethnicity emphasize the 
importance of racialization, or the ways in which groups are viewed and treated 
as monolithic and homogeneous by outsiders, in the formation of panethnic 
organizations. However, such studies do not directly address the role of local 
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contextual factors in the consolidation of group boundaries. This paper fills 
a gap in the literature and examines the structural conditions that lead to the 
institutionalization of a socially constructed panethnic community. Using a new 
longitudinal data set of Asian American organizations, | test a theory of panethnicity 
which specifies that competition and segregation at different boundary levels will 
influence organizational formation based on a panethnic boundary. 

The results from the regression models clearly indicate that certain types 
of political opportunities and resources encourage the formation of pan-Asian 
organizations, and established panethnic organizations in metropolitan areas 
provide a legitimate organizational form for others to follow. The analyses also 
demonstrate that the ways in which racial and ethnic groups are structured 
in relation to one another, as measured by the boundary formation variables, 
influence panethnic outcomes. Net of all other variables, | find that the occupational 
segregation of Asian Americans encourages the formation of national pan-Asian 
organizations. When Asians are concentrated in the occupational structure, this 
not only heightens a panethnic boundary, leading Asian Americans to found 
their own institutions, but it also signals that in the post-Civil Rights era, Asian 
Americans were not completely integrated into mainstream society. These 
organizations, largely based on the idea that Asian Americans have specific 
needs and issues as well as shared experiences of historic and contemporary 
discrimination, were formed to effectively navigate within the larger society and 
to negotiate the Asian American community's social, economic and political 
position. In addition, as predicted, the segregation of specific Asian ethnic 
groups - Chinese, Filipinos, Japanese and Koreans — into the lower part of the 
occupational structure discourages panethnicity. 

Finally, the results show that competition variables, such as the overall poverty 
rate and unemployment ratio comparing resource levels of whites and Asians, do 
not have an effect on new organizational activity. In the analysis including ethnic- 
specific variables, | find that competitive efforts between Asian ethnic groups 
are not present. In fact, the effects of unemployment ratios comparing specific 
ethnic groups with all other Asians are positive or non-significant, and the in- 
migration of different Asian ethnic groups actually facilitates the formation of 
pan-Asian organizations. Even though not all ethnic group members identify as 
Asian American, the results suggest that Asian in-migrants — native and foreign- 
born — contribute to a population that can be served by new organizations with a 
diverse Asian American constituency. 

Why wasn't competition more central in explaining institutional panethnicity? 
The distribution of the different types of organizations in the data set indicates 
that more than one-fourth of the pan-Asian organizations formed between 1970 
and 1998 were political organizations that shared the common goals of promoting 
civil, economic and political rights for Asian Americans as well as Asians in their 
countries of origin. For example, the Asian American Voters Coalition promotes 
the equal treatment of Asian Americans in the U.S. political system and sponsors 
voter registration and education drives. The Committee against Anti-Asian 
Violence works to eradicate racism and violence against Asian Americans, and 
offers services such as legal consultation and education. Another fourth of the 
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organizations were professional organizations with Asian American memberships. 
These organizations often had the shared goals of developing networks, 
encouraging equal employment opportunities, and exchanging knowledge in the 
field. But members in these organizations had more than their professions in 
common. The stated goals of the Asian American Journalists Association are not 
simply to increase employment of Asian/Pacific American journalists, but also to 
encourage fair and accurate news coverage of Asian/Pacific American issues. 
Simllarly, one of the stated goals of the Association of Asian/Pacific American 
Artists is “to encourage equal employment opportunities in all aspects of the 
entertainment industry in order to assure realistic images and portrayals of Asian/ 
Pacific people as they exist in real life and in the mainstream of America.” (Gale 
Research Co. 1999) These data suggest that pan-Asian organizations form to fill 
a need that is not being met by the market or state. As Espiritu and Ong (1994) 
have previously noted, pan-Asian organizations address general Asian American 
issues, such as political representation and stereotypes in the media, which are 
adaptive responses to the dominant group. These findings resonate with Vo's 
(1996, 2004) study of the Asian Business Association in San Diego: members 
Joined the organization because of shared professional interests and common 
experiences of economic exclusion and employment discrimination. Overall, 
these findings suggest that pan-Asian organizations are not forming because 
Asian Americans are in competition with other racial groups for resources. 
Instead, Asian Americans find themselves without equal opportunities and fair 
treatment, and therefore form pan-Asian organizations to attain these collective 
goods and create a supportive community within which to strategize about 
collective issues. 

This research has emphasized the importance of understanding the contextual 
factors that encourage or deter the institutionalization of a socially constructed 
panethnic community. Considering competition and segregation processes that 
address the structured relations between groups at different boundary levels in 
addition to factors such as political opportunities, resources and organizational 
dynamics, provides a more complete understanding of panethnic organizational 
foundings. Future research should focus on the ways in which panethnicity is 
expressed and negotiated at the organizational level, as well as the contextual 
factors that facilitate the survival of these organizations over time. 


Notes 


1. The in-migration variable is a considerable improvement over variables using 
only immigration data because it includes native and foreign-born migrants 
to metropolitan areas. 


2. See Okamoto (2003) for details regarding segregation indices. 


3. The mean variation inflation factor (VIF) for the full model is 1.67 and the VIF 
for each individual variable is 3.0 or less, which suggests that multicollinearity 
is not a problem. 


4. To calculate the odds, the coefficient is exponentiated (e^). 
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5. It is actually not contradictory for an increasing Asian population to lead to 
fewer organizational foundings in the inflated model, and then to have a 
positive effect on organizational foundings in the Poisson part of the model. 
This effect in the Poisson part of the model simply suggests that among 
those metropolitan areas which are at higher risk, percent Asian facilitates 
new organizational activity along a panethnic boundary. 


6. Percent change is calculated as (e* - 1)*100. 


7. | also included an unemployment ratio comparing percent unemployed 
Latino and black with percent unemployed Asian. The coefficient for this 
variable was not significant. 


8. | do not include analyses for Asian Indians and Vietnamese because data on 
these groups are not available for the entire time period under study. 
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Dropping the Hyphen? 
Becoming Latino(a)-American through Racialized Assimilation 


Tanya Golash-Boza, University of Kansas 


Abstract 

Early assimilation theorists predicted the eventual loss of ethnic distinctiveness for 
immigrants in the United States. In this paper, the author not only questions the 
possibilities that Latino and Latina Americans have for losing their ethnic distinctiveness, 
but also proposes that these possibilities vary widely within the Hispanic population. The 
central question posed in this paper is: In addition to traditional predictors of assimilation, 
do experiences of discrimination also affect ethnic identification patterns for Hispanics? 
Based on the analysis of two data sets, the 1989 Latino National Political Survey and the 
2002 National Survey of Latinos, the author provides evidence that Latino/a Americans 
who have experienced discrimination are less likely to self-identify as “Americans,” and 
more likely to self-identify with pan-ethnic or hyphenated American labels. The author 
contends that this is because experiences of discrimination teach some Latinos and Latinas 
that other citizens of the United States do not view them as “unhyphenated Americans." 
The author further proposes that, through a process of racialized assimilation, these Latin 
American immigrants and their children are becoming Latino and Latina Americans. 


Introduction 


At a rally in Siler City, North Carolina, one chilly afternoon in February 2000, David 
Duke told his attentive, although sparse, crowd that foreign elements wanted to 
take over his and their America. In Duke's disparaging diatribe about the failure of 
Latin American immigrants to blend into the melting pot, he did not mention that 
Irish and Italian immigrants had also been deemed incapable of assimilating in the 
early 20th century. Duke did make himself clear that when he refers to America 
and American values, he is referring only to the direct descendants of European- 
Americans. By doing this, he elucidated what normally goes unspoken - that 
the unmarked label "American" in most cases means European-American. For 
this reason, when we refer to African-Americans or Asian-Americans, we specify 
with hyphenated labels. On the other hand, when we are referring to European- 
Americans, "American" usually suffices. For example, Feagin (2000:100) reports 
that, in a six-month study of 65 U.S. newspapers, the term "American" was 
found many times, but the expression “white American” was exclusively used in 
juxtaposition to another racial category, usually “African-American.” 

If the unhyphenated American label is reserved for white Americans as 
Feagin (2000) argues, then to what extent can Latin American immigrants and 
their descendants become Americans? Feagin contends that the unhyphenated 
American label refers to those people in the United States who have the luxury 
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of pretending they do not have a racial or ethnic status. Pamela Perry's (2002) 
ethnographic study of racial identities among high school students reveals that 
many whites see themselves as not having a racial status or identity. For Perry's 
white high school informants, those who were attuned to the prevalence of 
discrimination in U.S. society were more likely to recognize their white privilege, 
and thus their racial status as white. Among Bonilla-Silva's (2003) informants, 
one of the primary distinctions between the racial attitudes of blacks and whites 
was the recognition of the existence and prevalence of discrimination. His 
analyses demonstrate that only a small percentage of whites are aware of the 
omnipresence of discrimination in U.S. society. Perry's (2002), Feagin's (2000) and 
Bonilla-Silva's (2003) analyses demonstrate that the recognition of the existence 
of discrimination plays a key role in the awareness of one's racial status. 

| will argue that the recognition of discrimination plays a fundamental role 
in determining not only one's racial attitudes, but also one's racial or ethnic 
identification. Part of being white is being able to ignore the prevalence of racial 
discrimination in U.S. society (Mclntosh 1998). In the United States, the ethnic 
identity "American" is an unmarked ethnic identity just as white is an unmarked 
racial identity (Dyer 1999). As such, part of being American is being able to 
ignore the prevalence of discrimination based on national origin in the United 
States. While whites self-identify as Americans, non-white Americans recognize 
that they are not Americans, but African-Americans, Native Americans, Asian 
Americans or Latino/a Americans. In this sense, how one becomes American 
or how one assimilates into American society depends in large part on one's 
racial status. Nevertheless, as | will demonstrate, scholars of immigration have 
not given sufficient consideration to racial status when studying patterns of 
immigrant adaptation. 


Hispanics and Discrimination in the United States 


It should come as no surprise to most social scientists that Hispanics face 
discrimination in the United States. Many people in the United States view 
Hispanics as poor, uneducated, unclean, illegal aliens and prone to teenage 
pregnancy (Oboler 1995:14, Giménez 1999, Young 2000: 161). Popular media has 
contributed to these perceptions. In a study of the portrayal of Hispanics in the 
media, Lichter and Amundson (1997) found that, "compared to both Anglos and 
African Americans, television's Hispanics were low in number, low in social status, 
and lowdown in personal character, frequently portraying violent criminals." (71) 

survey analysis has also been used to demonstrate the prevalence of 
Stereotypes about Hispanics. Iris Young (2000) tells us that Hispanics are 
often perceived as less intelligent or culturally advanced than Anglos. Bobo 
and Johnson (2000:144) show that Latinos/as are perceived as less affluent 
than whites or Asians. Their results also indicate that whites' stereotypes of 
Latinos/as are very similar to whites' stereotypes of blacks. That is, Latinos/as 
are seen as less intelligent, more prone to be on welfare, and more likely to be 
involved in drugs or other criminal activity than whites or Asians. In addition, 
blacks and Asians have a more negative view of Latinos/as than they do of 
whites. These widespread stereotypes about the characteristics of Hispanics are 
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bound to influence interactions between non-Hispanics and people perceived 
to be Hispanics. | will argue that these interactions are likely to affect the ethnic 
identifications of Hispanic Americans. 

Given the prevalence of prejudicial beliefs in the United States about Hispanics, 
one can surmise that people who are labeled as Hispanic are likely to experience 
discrimination at some time in their lives. However, within the segment of the 
U.S. population that the Office of Management and Budget defines as Hispanic, 
only some of the individuals actually have the physical and cultural features that 
result in their being considered Hispanic in daily interactions. Because not all 
Hispanics are easily categorized as Hispanic, Hispanics experience varying levels 
of discrimination. In this article, | will make the case that those Hispanics who 
do experience discrimination are less likely to self-identify as American because 
this discrimination increases their awareness of their non-white status in the 
United States. | will also argue that Hispanics who do experience discrimination 
are more likely to self-identify as Hispanic or Latino/a, because experiences of 
discrimination teach Hispanics that they are labeled as Hispanics or Latinos/as 
by others in the United States. 

The analyses presented in this paper are based on ethnic self-identifications 
in national surveys. | acknowledge that survey analysis is not adequate to gain 
an in-depth understanding of the respondent's identity. Nevertheless, it is 
reasonable to propose that individuals who self-identify on a survey as Hispanic 
or as American have some attachment to these labels. This attachment, or lack 
thereof, provides a window into which we can glimpse a key part of their ethnic 
identity. If a person chooses not to identify as American when asked if that can 
be used to describe them, then we can safely assume that American-ness is not 
an integral part of his or her identity. This is important for the argument made 
in this paper, since the core assumption of traditional assimilation theory is that 
immigrants will eventually become un-hyphenated Americans. | will examine 
these assimilation theories in more detail in the following section. 


Assimilation 


Straight-Line Assimilation 


The concept of assimilation was developed in the beginning of the 20th century, 
and can be understood as the process by which immigrants are incorporated 
into the host society. Two underlying assumptions in assimilation theory are 
that all immigrants will eventually assimilate, and that the primary factor that 
determines their level of assimilation is their generational status. Gordon (1964) 
argued that assimilation equals Americanization, and that Americanization means 
assimilating into the Anglo-Saxon core. However, more recent scholars such as 
Alba and Nee (1997) contend that assimilation can also take other forms. They 
specify that, while some immigrants assimilate into majority culture and become 
white Americans, others assimilate into minority culture and become black 
Americans. All of these theorists agree that assimilation entails the eventual 
loss of attachment to one's country of origin and the disappearance of the 
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ethnic distinctiveness of immigrants. Gans (1979) asserted that by the seventh 
generation all traces of ethnic distinctiveness disappear. 

Self-identification has traditionally been considered to be a key component of 
the assimilation process (Gans 1979). As immigrants and their children progress 
from self-identifying as Italian, Italian-American, to American, they effectively are 
assimilating into U.S. society. In fact, dropping the hyphen could be seen as the 
ultimate act of assimilation. Early assimilation theorists such as Gordon (1964) 
argued that all immigrants will eventually assimilate into U.S. society. While 
Gordon operated under the assumption that all immigrants take the same path 
of assimilation, other scholars such as Rumbaut and Portes (2001) have pointed 
out the variation in immigrants' paths of adaptation. 


Segmented Assimilation 


Rumbaut and Portes (2001:6) offer an alternative to the straight-line assimilation 
model, known as segmented assimilation. In this model, assimilation into 
mainstream U.S. society is an outcome for some, but not all, immigrants and 
their children. They argue that processes of assimilation are affected by a number 
of factors including: the human and financial capital of the immigrant parents, 
the pace of acculturation, family structure, cultural and economic obstacles, the ` 
perceived race of immigrants, and the community and family resources available 
to the immigrants and their children in the United States. 

Segmented assimilation theorists (Zhou 1997, Rumbaut and Portes 2001) 
delineate three exclusive paths of assimilation into U.S. society. Those immigrants 
` with high levels of human, social and financial capital are likely to experience 
a favorable reception in the United States and embark on a path towards 
assimilation and upward mobility. Another path is that of downward assimilation, 
which occurs when immigrants with few resources are unable to find employment 
with suitable wages and are confined to poor inner-city areas. Consequently, 
their children identify with the experiences of historical minorities and do not 
adopt the "immigrant optimism" (Kao and Tienda 1995) of their parents. The third 
path is that of selective acculturation, where immigrant parents encourage their 
children to conform to the norms of success in the United States, while keeping 
them from assimilating into oppositional adolescent culture, by reinforcing 
traditional cultural values and ways of life. Rumbaut and Portes (2001) purport 
that selective acculturation is most beneficial for children of immigrants, as it 
allows them to hold on to the positive traits of their ethnic communities without 
risking downward assimilation into the underclass. 

Segmented assimilation theory provides a comprehensive description of the 
contemporary processes of assimilation in the United States, yet it does not fully 
address the primacy of race in processes of assimilation. For example, Portes 
and Rumbaut (2001:173) argue that pan-ethnic identity is "an indicator of growing 
awareness of externally defined ethnic categories," yet they do not discuss the 
role of racial discrimination in reinforcing these categories. They further posit that 
non-whiteness poses a barrier to occupational mobility and social acceptance 
(56). They also point out that "the children of Asian, black, mulatto and mestizo 
immigrants cannot so easily reduce their ethnicity to a level of a voluntary decision." 
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(55) They could have used these findings to conclude that brown and black 
immigrants' assimilation options are constrained by their skin color, but they do 
not come to that conclusion. As it stands, their theory of segmented assimilation 
does not fully account for the role of racialization in forging immigrants' paths of 
adaptation. This is because these scholars do not address the extent to which 
whiteness is a prerequisite for assimilation into dominant culture, nor do they 
fully interrogate the meanings of the ethnic identities they discuss. As Itzigsohn 
et al. (2005:53) point out in their recent article, "in segmented assimilation theory, 
there is no exploration of the meanings attached to identities." In the following 
section, | discuss the meanings of ethnic labels while arguing that whiteness is 
implicit in the unhyphenated American label and positing that Hispanicity implies 
foreignness. Then, | outline the implications that the meanings of these signifiers 
and the consequent discrimination have for understanding immigrants’ paths of 
adaptation. 


Becoming American? 


Proponents of assimilation theory insist that ethnic distinctiveness will eventually 
disappear for the children of immigrants in the United States. However, a 
number of other scholars have found that many immigrants associate being 
American with being white. For example, Rumbaut (1994) recounts that a 16- 
year old Cambodian girl, when asked about her American identity responded: 
“How could | be American? | black skin, black eyes, black hair ... My English is 
not good enough and my skin color black.” (750) In addition, a Hispanic-looking 
appearance entails treatment as a foreigner even for native-born Americans. For 
example, Prudence Carter’s (2005) Dominican informant, despite having been 
born in the United States, said that she was Hispanic and not American because 
American society does not accept her. Patricia Zavella (1996) recounts the story 
of a Chicano, who despite the fact that he and three generations of his ancestors 
had been born in the United States, was deported to Mexico, because he could 
not produce his birth certificate. One can clearly see here how this man's non- 
whiteness prevented him from being perceived as an American. This would never 
happen to the great-grandson of an Irish immigrant, for his American-ness would 
not be put into question. 

The fact that Hispanicity is associated with foreignness can also be seen in 
questions Latinos often get about their origins. Many U.S.-born Latinos and Latinas 
report that when they are asked where they are from, the answer California or 
Texas only begets the well-known follow-up: "But, where are you really from?" 

While Alba and Nee (1997) acknowledge that assimilation can involve becoming 
a white or a black American, it is not clear that these options are available for all 
immigrants from Latin America, for all Puerto Ricans or for all Chicanos/as, since 
many of these people see themselves as neither white nor black (Rodriguez 
2000). Many of the aforementioned immigration scholars acknowledge that race 
plays a role in the process of immigration, yet none have specifically addressed 
the relationship between Latinos/as' racial status, experiences of discrimination, 
and the varying paths of assimilations for Latinos and Latinas. 
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As mentioned earlier, Rumbaut and Portes (2001) specify three paths of 
assimilation. However, these three paths do not account for the experiences of 
all Hispanics. The first path is assimilation into the dominant culture. This can be 
interpreted as becoming white because it implies becoming an unhyphenated 
American, and the unqualified term, American, assumes whiteness (Tuan 1998). 
Many scholars argue that Latinos/as are summarily categorized as non-white in the 
United States (Itzigsohn and Dore Cabral 2001, de la Cadena 2000, Alcoff 2000). 1 
would suggest that this is an exaggeration and that some Latinos/as can and do 
become (or remain) white in the United States, but that these are not the majority of 
cases. At any rate, it is likely that those Hispanics who are categorized as non-white 
will not be able to assimilate un-noticed into mainstream American culture. 

The second path of assimilation is downward assimilation into minority 
oppositional culture. The main problem with this depiction is that it fails to 
distinguish between distinct minority groups. For example, while both Latino 
Americans and black Americans are victims of racial oppression in the United 
States (Feagin 2000), their experiences with this oppression are quite distinct. 
While it is undeniable that African-Americans are also hyphenated Americans, 
people in the United States do not assume that blacks are foreigners, as they 
do for Hispanics. The critical difference here, and the difference that is most 
relevant for the arguments presented in this paper, is that regardless of actual 
citizenship status, black Americans are assumed to be U.S. citizens, while Hispanic 
Americans are often assumed to be foreigners, and are thus frequently asked: 
Where are you really from? This treatment by other Americans is what leads 
Hispanic-Americans to reject the label, American. Other authors (Fordham and 
Ogbu 1986, Waters 1999, Zhou 1997) have argued that experiences of racism lead 
many African-descended youth to develop an oppositional identity. Conversely, 
| am arguing that national minority oppression inhibits Hispanic-Americans from 
developing an identity as Americans. This latter process is theoretically distinct 
from downward assimilation primarily because of the foreignness associated 
with Hispanics. 

The third path of assimilation is selective acculturation. Without denying 
the importance of this trend, it must be acknowledged that this path will only 
persist across several generations when there is a concentration of Latinos in 
a particular area, such as Cubans in Miami or Puerto Ricans in New York City. 
Without such a community, after a number of generations, the attachment to the 
homeland will be weakened, and the children of immigrants will lack meaningful 
cultural resources with which to shelter their own children. Mia Tuan (1998) 
documents the loss of cultural ties to the homeland for third and later generation 
Asian Americans in California. It is reasonable to expect to witness a similar trend 
for Latino Americans, although the trend may be slower in light of the relative 
proximity of the United States to Latin America, and the prevalence of ethnic 
enclaves. Judith Blau (2003) suggests that the great physical distance between 
the United States and Asia may encourage Asian-Americans to assimilate faster 
than Latino-Americans, whose homelands are geographically closer and thus 
more accessible. Nevertheless, it is worth mentioning that while there are 5,179 
miles between Seattle, Washington and Seoul, Republic of Korea, there are 
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5,538 miles between Seattle, Washington and La Paz, Bolivia, and 6,885 miles 
between Seattle and Buenos Aires. Again, geographic proximity (or lack thereof) 
contributes to the diversity of the Latino community in the United States. 

What is absent in the aforementioned theories of assimilation is a more 
comprehensive understanding of the racialization of Hispanics in the United States, 
and of how this racialization affects Hispanics' paths of assimilation. The question 
that | will address in detail is the extent to which the experiences that Hispanics 
have that teach them that they are not Americans affect the process of ethnic 
identification. This involves the recognition that not all Hispanics have the same 
experiences in the United States. One important way in which their experiences 
vary is based on their perceived racial status here in the United States. 

In contemporary U.S. society, we learn our racial place through interactions 
with others. If others classify us as white, we learn to expect preferential 
treatment. If others classify us as something other than white, we learn to expect 
marginalization. This marginalization plays out in different ways, depending on a 
wide variety of factors, including, but not limited to, skin color, manner of speaking, 
body language, hair texture and facial features. There is clearly not a "monolithic 
minority" (Vaca 2004) experience; people of color experience marginalization in 
distinct ways. There has been little research on how racism works in different 
ways for blacks and Latinos in the United States. Nevertheless, it is clear from 
current research that Latinos experience different forms of oppression, according 
to whether or not they are perceived to be white, black or Hispanic. Turnovksy 
(2004), for example, recounts that black Panamanians had to perform Latinidad 
in order to be deemed eligible for day labor position (and not to be confused with 
African-Americans), while Ecuadorians that were mistaken for Mexicans were 
presumed to be hard-working and willing to work for low pay. Aranda and Rebollo- 
Gil (2004) tell readers that their phenotypically white Puerto Rican informants 
often were met with surprise when they informed people that they were Puerto 
Rican. Some of these white Puerto Ricans have the option of passing for white, 
while others did not because of their accent or forms of expressiveness. These 
and other studies show that only phenotypically white people born in the United 
States who speak Standard Received English and don't move their hands too 
much can call themselves unhyphenated Americans without this label being 
questioned by others. 

It is important to point out that, despite the fact that external categorizations 
affect identity (Nagel 1994, Jenkins 1994), each individual has the ability to 
accept, embrace or reject these categorizations. For example, Aranda and 
Rebollo-Gil's (2004) informants rejected others assumptions that they were 
white, and insisted that they were Puerto Rican. Likewise, Turnovksy's (2004) 
Panamanian informants rejected others’ labeling of them as black by emphasizing 
their non-African-American ethnic attributes. These and other studies also show 
that people’s categorizations are contextual. Some people may be alternatively 
categorized as black, Latino or Dominican in distinct contexts. In addition, some 
individuals know how to manage these categorizations such that they have some 
control over how they are categorized. In any case, whether people choose to 
accept, embrace or manipulate their categorizations, these categorizations have 
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an effect on how individuals think of themselves and how they understand their 
social location. In this article, | argue that Latinos who are categorized as non- 
white are more likely to experience discrimination. | am further arguing that many 
of these Latinos willinterpret this discrimination as an indication that they are not 
welcome in white American spaces, and that they are thus more likely to reject 
the label “American” as a self-identifier. 


Racialized Assimilation 


The term, Hispanic, refers to persons of Latin American, Chicano or Puerto Rican 
descent who live in the United States. It is important to note that the Hispanic 
label does not explicitly imply belonging to a particular racial group. In fact, 
Hispanics in the United States are a racially diverse group of people (Rodríguez 
2000). We know that individuals in the United States are treated differently on 
the basis of their perceived race (Tuan 1998). As such, it is reasonable to expect 
that not all Hispanics receive the same treatment here in the United States. Other 
scholars have documented this. For example, Clara Rodríguez found that many 
dark-skinned Dominicans in New York recognize that fellow New Yorkers view and 
treat them as black and not as Hispanic (2000:140-1). Ginetta Candelario (2001) 
reported that the majority of Dominicans in the primarily black city of Washington, . 
D.C. racially identified themselves as black, because they are considered to be 
black by others. Steven Ropp tells us that Asian Latinos are usually treated as 
Asians, and not as Latinos/as in daily interactions (2000:24). Because not all 
Puerto Ricans, Chicanos, Latinos and other immigrants from Latin America fit 
the prevailing stereotype of what Hispanics are supposed to look or act like, 
Hispanicity can become an "ethnic option" (Waters 1999) for white, black and 
Asian Hispanics. 

Despite the fact that "Hispanic" is officially an ethnic label, it functions in 
some ways as a racial label. In other words, not unlike the labels, Asian and 
black, the label Hispanic is often assigned to people on the basis of their physical 
appearance. When people discriminate against blacks because of their race, 
they make the assumption that their skin color entails that they have certain 
cultural characteristics (i.e., that they like to eat fried chicken or that they enjoy 
rap music). The same process occurs with Latinos who fit into the somatic image 
most people in the United States have of Latinos. For these Latinos, people in 
the United States might assume that they are willing to work for low pay or 
that they are undocumented immigrants. Latinos who do not fit the mestizo/ 
mulatto image that people in the United States have of Latinos are more likely to 
experience ethnic than racial discrimination as Latinos. This could be based on 
their surname or any accent they may have when speaking English. 

Contrary to arguments made by Aranda and Rebollo-Gil (2005), | would argue 
that Latinos who do not look Latino do not experience racism for their Latinidad, 
while they may experience ethnic discrimination. This is because, while ethnic 
and racial labels are both social constructions, ethnicity invokes a language of 
place while race invokes a language of phenotype (Wade 1999). The distinction 
here is that ethnic characteristics can be expected to dissipate over the course 
of generations, while racial characteristics are more likely to persist across 
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generations, especially when there are racially endogamous unions. The ethnic 
label Hispanic refers to people whose geographic origins lay south of the Rio 
Grande, and is applied based on cultural characteristics while the racial label, 
Hispanic, is applied based on phenotype to people who look like they have Latin 
American ancestry, according to stereotypes that exist in. the North American 
imaginary. The way this plays out in daily life is that Latin Americans and their 
descendants with an Indian/mestizo phenotype, or that appear to be mixed- 
race are more likely to be labeled as Hispanic than their European-descended 
or African-descended counterparts who look white or black (Lopez and Stanton- 
Salazar 2001). Moreover, as a result of this racialization, some Latin Americans 
are beginning to recognize a commonality here in the United States and are 
themselves taking on the label, Hispanic or Latino. 

Discrimination occurs when Latinos can be ethnically or racially classified 
as Latinos, but racial discrimination is more likely to affect Latinos who fit the 
Hispanic somatic norm image, regardless of their level of assimilation. Moreover, 
Hispanics' perceived racial classification is a fundamental part of the process 
of assimilation. The United States is a country where one's perceived race 
often outweighs one's class, gender or even nativity status in determining how 
individuals are treated on a daily basis (Feagin 2000, Tuan 1998, Waters 1999). 
Because Hispanics belong to different racial groups, it is reasonable to expect 
that not all Hispanics will be treated equally. Hispanics who are viewed.as white 
in the United States are less likely to face racial discrimination and more likely to 
follow a similar path of assimilation to that of Irish, Italian or Polish Americans. In 
other words, they will become Americans, perhaps with a symbolic attachment 
to their national origins. However, Hispanics who are perceived to be black in 
the United States are likely to face discrimination as other African-Americans do, 
and to develop an oppositional identity as African-Americans. On the other hand, 
those Hispanics who have the racial and cultural features that result in their being 
perceived as Hispanics are less likely to assimilate and adopt an identity as an 
American, and more likely to develop a hyphenated identity as Latino or Latina 
Americans. This idea that racial status plays a key role in immigrant adaptation is 
central to the concept of racialized assimilation. 

As such, | expect to find in my analyses that Latino/a Americans who face 
discrimination here in the United States are less likely to adopt the ethnic label 
American than those who do not experience discrimination. | further expect 
darker-skinned Hispanics and Hispanics who have experienced discrimination in 
the United States to be more likely to self-identify as Hispanics or Latinos/as than 
as Americans because they view that label as reserved for white Americans. 


Datasets: 1989 LNPS and 2002 NLS 


Latino National Political Survey Data 


In order to ascertain the ethnic identification patterns of Hispanics, this research 
relies on the 1989 Latino National Political Survey, a survey of nearly 2,800 
Cubans, Puerto Ricans and Mexicans in the United States. In 1990, Mexicans 
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were 61.2 percent of the Hispanic population, Puerto Ricans were 12.1 percent, 
and Cubans 4.8 percent.2 They accounted for 78.1 percent of the Hispanics in 
the United States in 1990. It is conceivable that the findings discussed in this 
paper are specific to these three national-origin groups and not to all Hispanics, 
but | believe that my conclusions are applicable for Hispanics from other national 
origin groups. This is because when Hispanics face discrimination in the United 
States, it is often because they are categorized as Hispanic not as a particular 
nationality and presumed to share the characteristics associated with Hispanics. 
It is likely that people in the United States who discriminate against Hispanics are 
not able to distinguish between a Mexican and a Guatemalan, or a Puerto Rican 
and a Dominican, or even between a Bolivian and a Cuban. 

The LNPS includes interviewer-coded skin color; the respondents were 
all adults; and it is nationally representative of the Puerto Rican, Mexican and 
Cuban populations in the United States in 1989. This data set is particularly well 
suited to address the questions posed in this paper because of the broad sample 
of Hispanics from all over the country, and of different generational statuses. 
In addition to the representative sample population of the LNPS, | was also 
interested in the unique data on interviewer-coded skin color. 

The Latino National Political Survey (LNPS) was conducted in 40 Standard 
Metropolitan Statistical Areas, and is representative of 91 percent of the Mexican, . 
Puerto Rican and Cuban populations in the United States. All respondents are at 
least 18 years of age, and have at least one parent who is solely of Mexican, 
Cuban or Puerto Rican ancestry, or at least two grandparents who are. The overall 
response rate for Latinos/as was 74 percent. The LNPS includes 1,546 Mexicans, 
589 Puerto Ricans and 682 Cubans. Latinos/as who lived in states that were 
either less than 5 percent Latino or who lived in communities that were less than 
3 percent Latino in 1989 were not included in the sampling universe. Interviews 
were conducted in Spanish and English by bilingual interviewers. ? 

Of the respondents in the LNPS dataset, 64 percent were first generation 
immigrants, 21 percent were second generation, and 14 percent were third 
generation. Among the first generation respondents, the average age at 
immigration was 24 years. Sixty-three percent of the respondents were U.S. 
Citizens, and 41 percent had completed high school. Eleven percent of the 
respondents only spoke English at home, and 90 percent agreed or strongly 
agreed that citizens of the United States should learn English. The descriptive 
statistics for the variables used in the analyses are presented in Table 1. 


Pew Hispanic Center/Kaiser Family Foundation 2002 National Survey of Latinos 


The 2002 National Survey of Latinos (NSL) is a representative sample of the 
Hispanic population in 2002. The 2002 NSL was conducted by telephone during 
the time period, April 4, 2002 to June 11, 2002 among a nationally representative 
sample of 4,213 adults 18 years and older, including 2,929 Latinos and 1,284 non- 
Latinos. Interviews were conducted with 1,047 Mexicans, 317 Puerto Ricans, 
343 Cubans, 204 Salvadorans, 235 Dominicans and 214 Colombians. In total, 
341 Central Americans and 394 South Americans were interviewed. All of the 
respondents indicated that they were of Hispanic or Latin origin or descent such as 
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Table 1: Descriptive Statistics for Variables Used in Analyses, 1989 Latino National Political Survey 
A AA A A A 


Variable Name Percentage Observations 
Dependent Variable: Ethnic Self Identification 
American 16.56 462 
Hispanic, Latino, Hispano or Spanish 38.42 1,072 
Chicano, Nuyorican, Mexican-American, 8.49 237 
Spanish-American, Cuban-American, Raza 
Mexican(o), Cuban or Puerto Rican 36.52 1,019 
Dichotomous Independent Variables 
Female 58.28 1636 
Male 41.72 1171 
-R owns home 32.61 914 
R rents or lives with others 67.39 1889 
R has experienced discrimination 27.65 776 
R has not experienced discrimination 712.35 2,031 


Multi-category Independent Variables 
National Origin (Ref: Cuban) 


Cuban 24.12 677 
Puerto Rican 20.95 588 
Mexican 54.90 1541 
Generational Status (Ref: Came to the U.S. after age 18) 
After age 18 43.50 1221 
Before age 10 9.05 254 
Between 11 and 17 11.86 333 
Second Generation 21.34 599 
Third or Fourth Generation 14.25 400 
Educational Attainment (Ref: Less than High School) 
Less than High School 56.08 1458 
High School Graduate 22.31 580 
Some College 14.96 389 
College Graduate 6.65 173 
Language Use (Ref: Spanish Dominant) 
Spanish Dominant 36.41 1021 
English Dominant 11.38 319 
Bilingual 52.21 1464 
Spouse's Ethnicity (Ref: Latino/a Spouse) 
Latino/a Spouse 46.75 1309 
No Spouse 43.25 1211 
Anglo Spouse 5.82 163 
Other Spouse 4.18 117 
Interviewer-coded Skin Color (Ref: Light or Very Light) 
Light or Very Light 47.31 1328 
Medium 35.34 992 
Dark 14.57 409 
Very Dark 2.10 59 
Annual Household Income (Ref: Less than $20,000) 
Less than $20,000 51.35 1,491 
Between $20,000 and $34,999 24.54 638 


More than $35,000 18.12 471 
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Mexican, Puerto Rican, Cuban, Dominican, Central or South American, Caribbean 
or some other Latin American background. The sample for the data was obtained 
through a stratified random digit dialing system. All of the telephone interviewers 
were bilingual and the respondents were given the opportunity to complete the 
survey in Spanish or English. 

In the sample used in this paper, 56 percent were female, 49 percent had a 
household income greater than $30,000; 39 percent owned their own homes; 
52 percent were immigrants to the United States who came after the age of 18; 
32 percent had less than a high school education; and 45 percent are Spanish- 
language dominant. The descriptive statistics for the variables used in the 
analyses are presented in Table 2. 


Cross-Tabulations: Self-identification in the 2002 National Survey of Latinos 


In the 2002 Latino National Survey, respondents were asked “People choose 
different terms to describe themselves. ... Please tell me whether you have 
ever described yourself as ... An American.” They were also asked two other 
questions which were identical, but instead of American, they were asked if they 
identified as Latino, and then by their national origin. | ran cross tabulations on 
these three variables with variables on generational status in order to see the 
extent to which generational status correlates with ethnic identification. As you ` 
can see in Table 3, ethnic identification is highly correlated with generational 
status, as Gans (1979) would have predicted. 

In Table 3a, we see that respondents who have been in the United States 
longer are increasingly likely to self-identify as American. In Table 3c, we see that 
people who have been in the United States longer are increasingly less likely 
to self-identify with their national origin. The one exception to this is the fact 
that third and fourth generation immigrants are more likely to self-identify with 
their national origin than second generation immigrants. While only 19 percent 
of second-generation immigrants identify with their parents' country of origin, 35 
percent of third and fourth generation immigrants identify with their grandparents’ 
country of origin. This is likely because nearly all (88 percent) of the third and 
fourth generation immigrants report their national origin as Mexico (67 percent 
of total), Spain (13 percent of total), or Puerto Rico (7 percent of total). Given the 
proximity of Mexico, the prevalence of Mexican and Puerto Rican enclaves, as 
well as the circular migration patterns between Mexico and the United States, 
and Puerto Rico and the United States, self-identification with a Mexican or Puerto 
Rican national origin in the third or fourth generation is less surprising. Table 3b 
is also interesting in that it indicates that people who self-identify as Latino or 
Latina are those who have been in the United States for the least amount of time. 
Nevertheless, even Gordon (1964) would agree that there are factors other than 
generational status that affect the assimilation process. 

The question that will be addressed in this paper is: What are the other factors 
that affect ethnic identification? More specifically, in addition to the traditional 
assimilation variables, do experiences of discrimination also affect ethnic 
identification? Logistic regressions are a useful tool to use to respond to these 
questions, as they will permit me to analyze the likelihood that respondents 
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Table 2: Descriptive Statistics for Variables Used from 2002 National Survey of Latinos Data 


Variable Name Percentage Observations 
Dependent Variables 
Self-identifies as American 54.66 1589 
Does not self. identify as American 45.34 1318 
Self-identifies as Latino or Latina 80.75 2358 
Does not self. identify as Latino or Latina 19:25 562 
Self-identifies as National Origin 88.24 2537 
Does not self-identity with National Origin 11.76 338 
American identity is most important 20.27 578 
American identity is not most important 79.73 2273 
Dichotomous Independent Variables 
Female 56.13 1644 
Male 43.87 1285 
R owns home 38.51 1128 
R rents or lives with others 61.49 1801 
R has experienced discrimination 61.69 1807 
R has not experienced discrimination 38.31 1122 
R is married or co-habiting 59.97 1750 
R ís not married or co-habiting 40.03 1168 


Multicategory Independent Variables 
National Origin (Ref: Other Countries) 


Other Countries 38.10 1116 
Puerto Rican 10.82 317 
Mexican 35.75 1,047 
Colombian 1:21 214 
Dominican Republic 8.02 235 
Generational Status (Ref: Came to the US after age 18) 
After age 18 48.12 1,292 
Before age 10 TAS 192 
Between 11 and 17 11.66 313 
Second Generation 19.59 526 
Third or Fourth Generation 13.48 362 
Educational Attainment (Ref: Less than High School) 
Less than High School 31.63 920 
High School Graduate 31.59 919 
Some College 17.88 520 
College Graduate 18.91 550 
Language Use (Ref: Spanish Dominant) 
Spanish Dominant 44.68 1309 
English Dominant 23.45 687 
Bilingual 31.86 933 
Annual Household Income (Ref: Household Income less than $30,000) 
Less than $30,000 50.99 1439 
Between $30,000 and $49,999 26.29 649 


More than $50,000 22.72 561 
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Table 3: Cross Tabs from 2002 National Survey of Latinos Data 


 —— ns V 
Table 3a: Row Percentages of Self-identity as American by Generational Status 


Self-identifies as American 


Yes No Total N 
Immigrated age 18 or older 32.00 68.00 1278 
Immigrated age 11 to 17 36.54 63.46 312 
Immigrated before age 10 61.90 38.10 189 
Born in the United States 85.55 14.45 526 
R and parents born in the United States 96.95 3.05 361 
Total Responses 59.60 49.44 2666 


Table 3b: Row Percentages of Self-identity as Latino by Generational Status 


Self-identifies as Latino 


Yes No Total N 
Immigrated age 18 or over 84.07 15.93 1287 
Immigrated age 11 to 17 85.90 14.10 312 
Immigrated before age 10 TT.60 22.40 192 
Born in the United States 77.14 22.86 525 
R and parents born in the United States 67.78 32.22 360 
Total Responses 80.27 19.73 2676 


Table 3c: Row Percentages of Self-identity as National Origin by Generational 
Status 


Self-identifies as National Origin 


Yes No Total N 
Immigrated age 18 or over 95.42 4.58 1287 
Immigrated age 11 to 17 93.87 6.13 310 
Immigrated before age 10 92.71 7.29 192 
Born in the US 19.41 80.59 510 
R and parents born in the US 35.10 64.90 339 
Total Responses 72.59 27.41 2638 


will self-identify as American or Latino/a, while simultaneously taking a number 
of individual-level characteristics into account. Specifically, | can consider the 
likelihood that respondents who have experienced discrimination will self-identify 
as Latino/a, American or with their national origin while holding other variables 
related to assimilation constant. 


Operationalizing Ethnic Identification: 1989 Latino National Political Survey 


In order to gain insight into the extent to which Hispanics are identifying as 
Americans, | developed a system of categorizing the possible alternatives to the 
American label that respondents to the 1989 Latino National Political Survey could 
have chosen, for the purposes of performing multinomial logistic regressions. 
The four types of identifiers, used to ascertain levels and types of assimilation, 
are displayed in Table 4. In Table 4, anyone who self-identified as American was 
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coded as American regardless of additional labels they may have chosen. By 
contrast, 21.72 percent who self-identified as Hispanic or Latino were coded as 
American, and 78.28 percent as Hispanic. 


Table 4: Row Percentages of Translation of Self-labels to Labels Used in Analyses 





Labels Used in Analyses 


Self-label American Hispanic/Latino/a Hyph. Amer. Natl. Origin 
American 100.00 .00 .00 .00 
Hispanic/ 

Latino/a 21.72 78.28 .00 .00 
Hyphenated 

American 26.27 35.87 37.86 .00 
National Origin 13.13 32.58 6.09 48.19 
Definition of Labels 

American Identified as American 


Hispanic/ Latin Identified as Hispanic, Latino, Hispano, Spanish or Hispanic/Latino- 
American and NOT as American 

Hyphenated Identified as Chicano, Nurican, Mexican-American, Spanish-American, 

American Cuban- American, Tex-Mex or Raza and NOT as American, Hispanic, 
Latino or Hispano 

National Origin Identified as Mexican, Mexicano, Cuban or Puerto Rican, and NOT as 
Chicano, Nurican, Mexican-American, Spanish-American, Raza, 
American, 


The dependent variable in the multinomial logistic regressions presented in 
this paper is the respondent's self-identification. This question was the second 
question in the interview that the respondents were asked, just after the question 
on their date of birth. The respondents were asked: "Here is a list of names that 
are used to describe persons of Spanish heritage. Please tell me all of those, if 
any, you call yourself." The choices were: Mexican, Mexicano, Cuban, Puerto 
Rican, Spanish, Chicano, Nuyorican, Mexican-American, Spanish-American, Raza, 
American, Hispanic, Latino or Hispano. Respondents also wrote in: Black, White, 
Cuban-American, MexTex/Tex-Mex, Hispanic/Latino-American, Mexican Indian 
and American Indian. 

The first identifier is American. This category encompasses all respondents 
who chose to check off the American box when given a list of possible identifiers. 
This category includes respondents who chose other identifiers in addition to 
American. Only 2 percent of respondents chose only the American identifier; 
yet 14 percent chose the American identifier in addition to another identifier. 
American is the reference category in the multinomial logistic regressions 
because respondents in the other categories did not self-identify as American. 

The second category is the adoption of a pan-ethnic label. | have included 
adoption of Hispanic, Latino, Hispano or Spanish^ in the same category, because 
these are all pan-ethnic labels. The decision as to which of these labels an individual 
chooses is often determined by regional location within the United States, with 
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Hispano being most popular in New Mexico, Hispanic in rural areas, and Latino 
in large cities (Melville 1988). Without ignoring the relevant literature on the 
distinction between these pan-ethnic labels (Oboler 1995; Melville 1988), | have 
chosen to concentrate on the characteristic they share - that of a recognition of 
a shared experience with other persons of Latin American descent. The adoption 
of a pan-ethnic label is a decision based on racialized experiences here in the 
United States. Adoption of one of these pan-ethnic labels is a process that most 
likely occurs in the United States because these labels are not commonly used 
in Latin America. 

The next category is that of hyphenated American categories. These labels are 
distinct from pan-ethnic labels, as they indicate an association with a particular 
national origin, and not necessarily with other Hispanics. This category includes 
labels such Chicano, Nuyorican and Cuban-American. Like pan-ethnic labels, 
these are identifications specific to the United States, and thus indicate a step in 
the assimilation process. While they signify the recognition of a commonality of 
experiences with others that share one's national origin, they do not indicate an 
allegiance to Latin Americans from other countries. For example, self-identifying 
as Chicano indicates solidarity with people of Mexican descent who live in the 
United States, but does not necessarily signify solidarity with immigrants from 
Bolivia or the Dominican Republic. The label, Latino, on the other hand would 
encompass immigrants from Argentina to Venezuela to Mexico. 

The final category is national origin. This includes those respondents who 
did not choose an American or an American-made ethnic identification. They 
identified as Mexican(o), Cuban, Puerto Rican or Spanish. 


Independent Variables: 1989 Latino National Political Survey 


The analyses presented include the following demographic control variables: 
gender, national origin and age. They also include the following social assimilation 
variables: generational status, education, home ownership and income. They 
further include the following variables accounting for cultural assimilation: 
language use in the home and spouse's ethnicity. The two additional variables 
of interest are whether or not the respondent has been discriminated against, 
and the respondent's skin color. Gender is a dichotomous variable coded as 1 if 
the respondent is female and as 0 if the respondent is male. For national origin, 
Cuban is the reference category, and Mexican and Puerto Rican are the included 
categories. Age is a continuous variable. 

Also included are social and cultural assimilation variables in order to determine 
if measures of skin color and experiences of discrimination affect processes of 
ethnic identification, after accounting for traditional measures of assimilation. 
Education is a categorical variable, with two dummy variables. The included 
categories are completed high school and at least some college. The reference 
category is less than a high school education. Yearly income is coded into three 
categories, with the reference category including those households with an 
annual household income of less than $20,000. Home ownership is coded as 1 
if the respondent reports that they own a home, and as O if they are renting or in 
other arrangements. Generational status is a categorical variable. The reference 
category includes those respondents who were not born in the United States. The 
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two dummy variables are Second Generation and Third Generation immigrants. 
Second generation means the respondent's parents were born abroad, but the 
respondent was born in the United States. Third generation means the respondent 
and his or her parents were born in the United States. 

Language use is based on the question: "What language do you usually speak 
at home?" This is coded into three categories. The reference category is if the 
respondent speaks mostly or exclusively Spanish at home. The three included 
categories are speaking only or mostly English at home, and speaking Spanish 
and English equally at home. Spouse's ethnicity is based on the question: "Is 
your spouse/the person you live with Mexican, Puerto Rican, Cuban, other Latino 
origin, Anglo or some other origin?" In the analyses, a Latino/a, Mexican, Puerto 
Rican or Cuban spouse is the reference category. | have included three other 
dummy variables: No spouse, Anglo spouse or other spouse. 

The variable for discrimination is coded as 1 if the respondent reports that he 
or she has experienced discrimination, and 0 otherwise. This variable is based 
on the question: “Because you are a Cuban [Mexican, or Puerto Rican] have you 
ever been turned down as a renter or a buyer of a home, or been treated rudely 
in a restaurant, or been denied a job, or experienced other important kinds of 
discrimination?” 

The variable for skin color is based on the interviewer's coding of the 
respondent's skin color. The interviewer reported whether the respondent was: 
Very. Light, Light, Medium, Dark or Very Dark. This is a categorical variable. 
The reference category included Light and Very Light. | combined these two 
categories under the presumption that they are more likely to be able to pass for 
white than the other respondents. The two included categories are Medium and 
Dark or Very Dark. Dark and Very Dark are combined because of the small n of 
Very Dark (n = 59). 


Dependent Variables: 2002 National Survey of Latinos 


There are four dependent variables used in these analyses. The first is the 
respondents’ response to the question: “People choose different terms to 
describe themselves. ... Please tell me whether you have ever described yourself 
as ... An American.” The second is the respondents’ response to the question: 
“People choose different terms to describe themselves. ... Please tell me whether 
you have ever described yourself as ... A Latino or Latina.” The third is this same 
question with their national origin (such as Mexican or Cuban) inserted at the 
end. For these three binary logistic regressions, | coded as 0 those respondents 
who answered no to this question, and 1 those who responded yes. The fourth 
logistic regression is based on the respondents’ answer to the question as to 
which of the terms they would use to describe themselves first. Respondents 
who answered “American” to this question are coded as 1 and those who gave 
another identifier are coded as 0. 


Independent Variables: 2002 National Survey of Latinos 


For this analysis, variables in the 2002 NSL were matched as closely as possible 
with those from the 1989 Latino National Political Survey and are displayed in 
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Table 2. Demographic variables included: Female (coded as 1 if female, 0 if 
male); National Origin (included: Mexican, Puerto Rican, Dominican Republic, 
Colombian); Age (continuous); Education (coded as 0 if less than high school, 
with three dummy variables: high school or GED completed, some college, and 
college graduate); and Married or co-habiting (reference: single). The social and 
cultural assimilation variables include: annual household income (the excluded 
category includes those respondents who report this to be less than $30,000; the 
two included categories are between $30,000 and $49,999, and over $50,000); 
home ownership status (coded as 0 if R rents or lives with parents and 1 if R owns 
own home); Generational status (coded as 0 if arrived in the United States after 
the age of 18, with four dummy variables — came to the United States before age 
10, came to the United States between the ages 11 and 17; born in the United 
States, but parents born abroad, and the final category — both respondents and 
parents born in the United States, and language ability (coded as 0 if Spanish- 
language dominant, with two dummy variables for bilingual and English dominant). 
Language use is a three-category variable based on the respondents’ reported 
use of English and Spanish. 

Finally, | included a variable for whether or not the respondents had experienced 
discrimination (coded as 1 if he or she has, and as O if he or she reports no 
cases of discrimination). This variable is based on a series of questions where - 
the respondent is asked whether or not he or she has experienced discrimination 
based on their racial or ethnic background in their daily life or in looking for 
jobs. Respondents were coded as having experienced discrimination if they 
responded positively to any of the following questions about their experiences 
of discrimination. First, they were asked if, during the past five years, they had 
experienced discrimination because of their racial or ethnic background. Then, 
they were asked if they ever had experienced four particular forms of discrimination 
based on their racial or ethnic background in three specific situations. (1) If they 
are ever treated with less respect. (2) If they ever receive poorer service than 
others in restaurants or stores. (3) If they are every called names or insulted. 
(4) If they have ever been denied a job. Sixty-two percent of the respondents to 
the 2002 NSL reported that they had experienced discrimination in response to 
at least one of these questions. In contrast, only 27 percent of the respondents 
to the 1989 LNPS reported having experienced discrimination. This may be 
because respondents to the 1989 LNPS were only asked one question about 
discrimination, as opposed to five. The question 1989 LNPS respondents were 
asked was: Because you are a [Mexican/Puerto Rican/Cuban], have you ever been 
turned down as renter or buyer of a home, or been treated rudely in a restaurant, 
or been denied a job, or experienced other important types of discrimination? 

In contrast to the LNPS, the NSL did not have a variable for skin color. As 
a telephone survey, that would have been tricky to manage. Nevertheless, 81 
percent of the respondents who reported discrimination indicated that it had 
been based on either their physical appearance, their background, the language 
they speak, or a combination of these factors. This indicates that in most cases, 


respondents perceive the discrimination they experience as being based on their 
Hispanicity. 
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In the entire 1989 LNPS survey, only 52 of the 2,800 respondents self- 
identified as black. However, 156 of the 2,893 respondents in the 2002 NLS 
survey self-identified as black. Overall, self-identified blacks were less likely than 
self-identified whites to ethnically identify as American or as Hispanic or Latino, 
and more likely to identify with their national origin. 


Results 


1989 Latino National Political Survey 


The results from the logistic regression for the LNPS are presented in Table 5, 
which indicates that respondents who are more assimilated are more likely to 
self-identify as American. For example, respondents with a high school education 
are 45 percent more likely to self-identify as American than those who did not 
complete high school. Those respondents with at least some college are 47 
percent more likely to self-identify as American, and respondents with a college 
degree are 64 percent more likely to self-identify as American. Second-generation 
immigrants are five times more likely to self-identify as Americans than foreign- 
born immigrants, and third generation immigrants are 10 times as likely. In any 
case, respondents who are more socially and culturally assimilated are more 
likely to self-identify as American than their less assimilated counterparts. It is 
worth noting that female respondents were 27 percent less likely to self-identify 
as American. This may be because women are less likely to work outside the 
home, and that this is impeding the assimilation process. It may also be because 
some Latinas experience gendered discrimination that precludes them irom 
feeling welcome in this country. 

This brings us to the coefficients that have not traditionally been used as predictors 
of assimilation — skin color and experiences of discrimination. The coefficients 
in Table 5 indicate that respondents who have experienced discrimination are 24 
percent less likely to self-identify as American than their counterparts who have not 
experienced discrimination. The coefficients for skin color are not significant at the 
(p « .05) level, although they are significant with a one-tailed test, or at the (p « .10) 
level. In this case, the one-tailed test is appropriate, since the coefficients are in 
the direction of the stated hypotheses. These findings support the hypothesis that 
respondents who have experienced discrimination or who have darker skin are less 
likely to self-identify as American. 

The results from the multinomial regression are presented in Table 6 where 
the factors that affect self-identification as American as well as self-identification 
as Hispanic/Latino/a, as a hyphenated American, or with one's National Origin are 
examined more closely. 

These findings put the results from the experience of discrimination and skin 
color variables into perspective. We see that respondents who experienced 
discrimination were 29 percent (p « .05 - one-tailed) more likely to adopt a 
Hispanic or Latino label than an American label, and 45 percent (p « .05 - two 
tailed) more likely to adopt a hyphenated American label. And, respondents with 
darker skin were more likely to adopt a national-origin identifier than an American 
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Table 5: Beta Coefficients from 1989 LNPS logistic Regression on Self-ID as American 





Coefficient Standard Error 

Female (Ref: Male) -.319*** (.12) 
National Origin (Ref: Cuban) 

Mexican -1.294*** (.19) 

Puerto Rican -.197 (.19) 
Annual Household Income (Ref: less than $20,000) 

Between $20,000 and $34,999 -.154 (.15) 

More than $35,000 .297* (.17) 

R owns home .238* (.13) 
Educational Attainment (Ref: Less than High School) 

High School Graduate Qr (.14) 

Some College .388** (.17) 

College Graduate .495** (.23) 
Spouse's Ethnicity (Ref: Latino/a Spouse) 

No Spouse -.098 (.14) 

Anglo Spouse .409* (.22) 

Other Spouse -.271 (.28) 
Generational Status (Ref: Came to the U.S. after age 18) 

Before age 10 .247 (.25) 

Between 11 and 17 -.288 (.29) 

Second Generation ONES (.23) 

Third or Fourth Generation 2.333" (.25) 
Language Use (Ref: Spanish Dominant) 

English Dominant sor" (.25) 

Bilingual .356* (.19) 
R has experienced discrimination -.281** (.13) 
Interviewer-coded skin color (Ref: Light or Very Light) 

Medium -.231* (.13) 

Dark -.322* (.18) 

Constant 1.950% (.19) 
Observations 2585 


Standard errors in parentheses. 


*** significant at a = .01 (two-tailed); **significant at a = .05 (two-tailed) * significant at a = .05 
(one-tailed) 
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identifier. In fact, respondents with dark or very dark skin were 83 percent more 
likely to self-identify with their national origin label. The combination of these 
findings implies that experiences of discrimination compel some Hispanics to 
take on a politicized pan-ethnic label, while darker-skinned Hispanics are less 
likely to leave behind their national origin labels and adopt any labels in the United 
States. They also indicate that experiences of discrimination and darker skin color 
affect the assimilation paths for Hispanics. 

Looking closely at Table 6, it becomes evident that women and Mexicans are 
the least likely to adopt any new labels in the United States and are the most likely 
to continue to use their national origin as an identifier. In contrast, the respondents 
who have been in the United States for longer, who primarily speak English, who 
own their own homes, and who have more education are more likely to take on 
another identifier here in the United States. The finding that immigrants who are 
more socially and culturally assimilated are more likely to take on Americanized 
identifiers provides support for my argument that self-labels are indicators of 
assimilation. By the same token, the finding that darker-skinned respondents are 
the most likely to not take on an identifier in the United States indicates that dark 
skin may be an impediment to assimilation for Hispanics. 


Findings from 2002 National Survey of Latinos 


The question on self-identification was posed on the 2002 National Survey of 
Latinos in a different manner than on the 1989 Latino National Political Survey. 
Respondents were asked, "People choose differentterms to describe themselves. 
l'm going to read you a few different descriptions. Please tell me whether you 
have ever described yourself as any of the following (Respondent's/Parent's 
country of origin, Latino/Hispanic, an American. As such, | was able to run three 
separate logistic regressions that consider the factors that affect the likelihood 
that respondents will self-identify with each of these three identifiers independent 
of one another, and an additional regression on whether or not the respondents 
would describe themselves first as American. These regressions are presented 
in Table 7. It is worth noting that 85 percent of those respondents who self- 
identified as American also self-identified with another identifier. This indicates 
that most of the respondents who are included in the American category in this 
analysis could be labeled as hyphenated Americans because they identify as 
American in addition to an ethnic identifier. However, the number of respondents 
who exclusively identify as American was so small that the regression analyses 
with such a dependent variable would not be very revealing. For this reason, it 
was more useful to run a regression with respondents who foremost identified 
as American as the omitted category. 

In Regression 1 of Table 7, respondents who have experienced discrimination 
are 19 percent less likely to self-identify as American. In Regression 2 of Table 
7, respondents who have experienced discrimination are 32 percent more likely 
to self-identify as Latino/a than as American. In Regression 3, the coefficient for 
this variable is not significant for self-identification with one's national origin. The 
combination of these findings provides support for the idea that self-identification 
as Latino/a is a politicized choice made in response to conditions of oppression 
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Table 6: Beta Coefficients from 1989 LNPS Multinomial Logistic Regression on Ethnic Self-ID 
EUR E AT S E A A ee ee ee 
Hispanic Hyph. Amer. Natl. Origin 


Female (Ref: Male) So” .209 
(.13) (.18) 
National Origin (Ref: Cuban) 
Mexican (266: 1.807*** 
(.20) (.37) 
Puerto Rican SEH -.726 
(.20) (.50) 
Annual Household Income (Ref: Household Income less than $20,000) 
Between $20,000 and 34,999 .108 .186 
(.16) (.22) 
More than $35,000 .303 317 
(.18) (.25) 
R owns home -.368*** .199 
(.14) (.19) 
Educational Attainment (Ref: Less than High School) 
High School Graduate -.115 -377 
(.16) (.22) 
Some College -.218 -.255 
(.18) (.25) 
College Graduate -.313 -.808** 
(.25) (.40) 
Spouse's Ethnicity (Ref: Latino/a Spouse) 
No Spouse .090 -.071 
(.15) (.21) 
Anglo Spouse -.439 -.582* 
(.26) (.33) 
Other Spouse 127 357 
(.31) (.41) 
Generational Status (Ref: Came to the U.S. after age 18) 
Before age 10 .017 -.219 
(.26) (.70) 
Between 11 and 17 .419 .808 
(.30) (.55) 
Second Generation -1.465*** 1.096*** 
(.21) (.39) 
Third or Fourth Generation -1,903*** .364 
(.25) (.42) 
Language Use (Ref: Spanish Dominant) 
English Dominant -.943*** -.530 
(.27) (.41) 
Bilingual -.176 -.270 
(.20) (.35) 
R has experienced discrimination 200° .374* 
(.14) (.19) 
Interviewer-coded skin color (Ref: Light or Very Light) 
Medium .167 212 
(.14) (.20) 
Dark 1233 .113 
(.20) (.26) 
Constant .869** -2.622*** 
(.21) (.43) 
Observations 2570 2570 


Standard errors in parentheses. 


*** significant at w = .01 (two-tailed); **significant at a =.05 (two-tailed) * significant at a =.05 


(one-tailed) 


.310** 
(.14) 


1.028*** 
(21) 
-243 
(21) 


197 
(17) 
282 
(21) 
-416** 
(.16) 


-812 
(.18) 
STE 
(21) 
-513* 
(.28) 


130 
(.16) 
-221 
(.32) 

346 
(.35) 


-.532* 
(27) 
186 
(.30) 
-2.754 
(.24) 
-4.339 
(.34) 


181% 
(.32) 
- 515% 
(.20) 
241 
(.16) 


EE 
(.16) 
Biren 
(21) 
1T 
(22) 
2570 
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here in the United States, particularly since experiences of discrimination do not 
have a significant effect on self-identification with one's national origin, while 
they do with Latino/a self-identification. Additionally, the findings in Regressions 
1 and 2 strengthen the results from the LNPS dataset in that we see very similar 
results in a more recent dataset, with a population that is more representative of 
the entire Latino population. 

It is worth noting that none of the coefficients for the social assimilation 
variables are significant with respect to the Latino label. The only variables that are 
significant are those related to language ability and experiences of discrimination. 
Respondents who are fluent in English are significantly less likely to self-identify 
as Latino/a than respondents who lack English abilities. These findings indicate 
that there are most likely other underlying factors that lead people to self-identify 
with the Latino label, and further call into question the practice of restricting 
analyses to traditional measures of assimilation. This is an area for future 
research with datasets more adequate to address this question. At any rate, it 
is remarkable that, while only 39 percent of the respondents in the 1989 LNPS 
dataset self-identified as Latino or Hispanic, 81 percent of the respondents in 
the 2002 National Survey of Latinos took on this label. My speculation is that 
this is most likely due to the fact that the latter data was collected 13 years after 
the first, and that the Latino and Hispanic labels have grown in popularity and 
acceptance in the United States. 


Conclusion 


This article demonstrates the false nature of claims made by white nationalist 
groups that Latinos/as refuse to assimilate or to become Americans. As it turns out 
some Latinos/as are becoming Americans. However, those Latinos/as who face 
discrimination and othering on a daily basis are learning that they are not viewed 
as equals in this country. Despite.their official citizenship in the United States, 
many Latinos/as feel unwelcome here. The rejection of the label American and the 
embracing of the Latino label is a response to this discrimination and exclusion. 

This research demonstrates that experiences of discrimination discourage 
immigrants from Latin America and their children from self-identifying as 
American despite the fact that many of these immigrants and their children are 
U.S. citizens. The finding that respondents who have experienced discrimination 
are less likely to self-identify as American indicates that the decision of people 
born in the United States not to describe themselves as American is a decision 
made in light of experiences of exclusion here in the United States. | have argued 
that this exclusion is part of racism in U.S. society, and is due to the implicit 
whiteness in the label American. 

This paper also elucidates the diversity of experiences and assimilation 
patterns for Latinos/as in the United States. Some Latinos/as are holding onto 
their national origin identifiers and refusing to hyphenate themselves, even in 
the third generation. Others are taking on a hyphenated American identity, and 
still others are assuming the pan-ethnic label. Which path these individuals 
take depends on their experiences in the United States. Those Latinos/as who 
appear ‘white’ and do not face discrimination are more likely to assimilate into 
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U.S. society and become unhyphenated Americans. They, like immigrants from 
Europe, can disappear unnoticed into the melting pot, if they so choose. 

However, those Latinos/as who face discrimination and who are not perceived 
to be white are less likely to be viewed by others, and consequently by themselves, 
as Americans. Yet, even if Latinos/as born in the United States are not Americans, 
they are also not Mexicans or Cubans, since they are also viewed as foreigners in 
Mexico or Cuba. Latin Americans and their descendants in the United States have 
responded to this denial of full citizenship by fostering a new ethnic identity that 
recognizes their shared experiences of discrimination and exclusion in the United 
States. This new identity is that of Latino and Latina Americans. This identity is, of 
course, enriched and fueled by Latino political activism and cultural production. 
Both political and cultural leaders encourage Hispanics to see themselves as part 
of a larger political and cultural community, composed of immigrants from all 
countries in Latin America, as well as Puerto Ricans and Chicanos. 

Nevertheless, it is worth asking how long the idea that to be American entails 
being of exclusively European descent can last. Scholars such as Yancey (2003) 
contend that the boundaries of whiteness will expand to include Hispanics and 
Asians. Others such as Bonilla-Silva (2002) argue that many Hispanics will join 
the "collective black," yet a few will become honorary whites. From where we 
stand today, the future racial status of Latin Americans and their descendants is: 
unclear. What is clear is that many Hispanics are getting the message that they 
are not full citizens of this nation, and are responding by clinging onto ethnic 
labels. As such, the "twilight of ethnicity" (Alba 1985) is unlikely to precede the 
twilight of racism. 


Notes 


1. Reports of this were found on several websites written by Latinas or 
Latinos. Some of these were: "| am a proud morena Chicana but the 
questions posed to me most often are, ‘Where are you from?’” from: 
http://Awww.hispanicmagazine.com/2001 /julaug/Letters/ (accessed 5/24/2005 
5:40 p.m.). "I was at the nail salon the other day and the manicurist asked me 
where | was from, | replied, “From Brownsville. Then she asks, “No but where 
are you really from?” Then | said, ‘Well my parents are Mexican.” From: 
http://dailytexican.blogspot.com/2004/08/soy-chicana-y-no-estes-chingando.html 
(accessed 5/24/2005 5:41 p.m.) "People ask, "Where are you from?" And when 
you say, "New York" or "Idaho," they insist, "No, but where are you really from?" 
— meaning, why are your eyes so dark or your skin so (pick one) brown, black, cin- 
namon, white, tan?" from: http:/fuantornoe.blogs.com/ hispanictrending/books/ 
(accessed: 5/24/2005 5:42 p.m.). 


2. From Census website: http://www.census.gov/apsd/wepeople/we-2r.pdf 


3. From codebook at website: 
http://www.princeton.edu/—data/datalib/codebooks/ codebooks.html 


4 | chose to include Spanish in this category as a pan-ethnic label because 
Spanish is often used to refer to all people of Latin American descent in the 
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United States. To ensure that Spanish was not being used as a national-origin 
identifier, | looked at the self-reported ancestry of respondents who self-iden- 
tified as Spanish. | found that only 41 of the 327 respondents who self-iden- 
tified as Spanish actually reported having a mother, father, grandmother or 
grandfather from Spain. 
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Double-Checking the Race Box: 
Examining Inconsistency between Survey Measures 
of Observed and Self-Reported Race 


Aliya Saperstein, University of California-Berkeley 


Abstract 

Social constructivist theories of race suggest no two measures of race will capture the same 
information, but the degree of “error” this creates for quantitative research on inequality 
is unclear. Using unique data from the General Social Survey, I find observed and self- 
reported measures of race yield substantively different results when used to explain income 
inequality in the United States. This occurs because inconsistent racial classification is 
correlated with other respondent characteristics such as immigrant generation, educational 
attainment and age. 


Over the past decade, it has become commonplace in scientific literature and 
public debate to note that classifying Americans by race is much more difficult 
than it used to be. This observation has been used to advocate for everything 
from increasing the number of categories offered and changing the way the 
questions are asked to calling for an end to counting by race in government 
statistics. Strictly speaking, though, it is not true that the racial classification 
of Americans has suddenly become more complex. The proportion of the U.S. 
population that gives fluid, complicated or seemingly contradictory responses to 
survey questions about race has increased since the 1960s, along with related 
increases in interracial marriages and immigration from Asia and Latin America. 
However, the difficulty of developing a valid and reliable measure of the concept 
called "race" did not cross the border following the McCarran-Walter Act, nor 
was it born in the wake of the Supreme Court's decision in Loving vs. Virginia. 
It existed long before: the U.S. Census Bureau has not used the same number, 
names or definitions of racial categories from one decade to the next, dating from 
the very first enumeration in 1790 (Nobles 2000), and racial identities were no 
less context specific or politically charged in the 1800s (Lopez 1996, Williamson 
1995) than they are now. 

The challenge of describing Americans (or anyone else) by race stems not from 
changing demographics, but from what race is - a socially defined characteristic 
of individuals or groups that is related to the power distribution in society and 
can, therefore, vary over time, place and social context — and what race is not - an 
innate, biologically determined characteristic that explains group ability rankings 


An earlier version of this paper was presented at the 2004 Population Association of Amer- 
ica Annual Meeting. The research was supported by a training grant from the National 
Institute of Child Health and Human Development (T32-HD007275). Many thanks go to 
Mike Hout, Claude Fischer, Leo Goodman, Edward Telles, Samuel Lucas, Rachel Sullivan, 
Ben Moodie, Laurel Westbrook, Damon Mayrl and two anonymous reviewers for their 
helpful comments and suggestions. Direct correspondence to Aliya Saperstein, University 
of California-Berkeley, Graduate Group in Sociology and Demography, 2232 Piedmont 
Avenue, Berkeley, CA 94720. E-mail: asaper@berkeley.edu. 
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and group differences in behavior. Some have argued that its social genesis 
means race is not “real” and therefore a flawed tool for scientific research or 
government data collection. The sociological consensus is that even if race is not 
a scientifically justifiable way of differentiating people, such distinctions still affect 
where Americans live, whom they marry and where they work (ASA 2003). 

Unfortunately, using race, the social construct, in scientific research is 
considerably more challenging than using race, the stable biological trait. If race 
is context specific, it is possible that the same individual could be described in 
different ways by different people. Theoretically, researchers should askthemselves 
whose description is the "right" one for their particular project. Practically, given 
the widespread use of secondary data from large national surveys, researchers 
should at least acknowledge that nonrandom variability in racial classification 
could bias their results. Numerous studies have shown that different methods of 
measuring race can yield disparate population estimates (e.g., Hahn, Mulinare and 
Teutsch 1992; Harris and Sim 2002). What has yet to be clearly demonstrated is the 
effect inconsistent racial classifications have on quantitative research conclusions 
regarding race and inequality in the United States. 

| examine differences between two possible measures of race - one, a self- 
identification reported by the respondent, the other, an observed classification 
recorded by the survey interviewer — using unique national cross-sectional data- 
from the General Social Survey. My results show that the two measures of 
race are not equivalent descriptions of the U.S. adult population, and that using 
these different measures in an otherwise similar analysis of income inequality 
generates not only discrepant results but also substantively different conclusions 
about the socioeconomic rankings of racial populations. At the individual level, 
| find that immigrant generation, educational attainment and age help predict 
inconsistencies between the respondents' observed and self-reported race. 
Further, while much of the individual-level complexity is hidden in aggregate data, 
this is not true for immigrant generation: using self-reported race results in a 
significantly lower estimate of whites with foreign-born parents (as well as fewer 
nonwhites with native-born parents) than does observed race. 

These findings suggest caution when interpreting results based on survey 
research that relies on a single measure of race. If different measures of race 
are associated with different individual characteristics then, as Telles (2002) 
noted, they likely are better suited to different types of research questions. This 
complexity highlights the need for further study of the implications of various 
racial classification methods for substantive research conclusions. Advancing 
scientific understanding of these dynamics will improve all quantitative research 
that uses race as an independent variable — the mainstay of studies of inequality 
in the United States in disciplines from economics to epidemiology — as well as 


the public policies designed to address the racial inequalities revealed by such 
research. 


Different Methods, Different Results 


In the years leading up to and immediately following Census 2000 — with its 
new racial classification system of "mark all that apply” — a flurry of studies 
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were conducted examining how people answer questions about race, and how 
fluid and complex those answers can be. Responses to questions about an 
individual's race have been shown to vary by region or neighborhood minority 
concentration (Xie and Goyette 1997), as well as by the context of the survey 
itself — whether it was administered at school, at home or with someone else in 
the room (Harris and Sim 2002). Researchers have also identified characteristics 
of individuals, such as age, educational attainment or immigrant status, that 
help explain unreliable, unusual or seemingly discrepant responses about race 
or ethnicity (Harris and Sim 2002, Lieberson and Waters 1993, Montalvo and 
Codina 2001, Xie and Goyette 1997). Others have demonstrated that inconsistent 
racial classification methods across data sources affect the calculations of vital 
rates (Hahn, Mulinare and Teutsch 1992; Sugarman et al. 1993). However, most 
of these studies highlight discrepancies in classification for relatively small 
subpopulations, such as American Indians, Latinas or multiracial children. It is 
not clear to what extent these inconsistencies are lost, or become statistically 
irrelevant, when researchers use broad racial categories and analyze large surveys 
designed to be representative of the entire U.S. population. Further, no one has 
explored whether or not using different measures of race in an otherwise similar 
regression analysis alters the results of national-level research on inequality and 
racial difference in the United States. 

In the United States, racial classifications for everything from opinion surveys 
and school records to birth and death certificates are collected, not only at different 
times and places, but from different sources (e.g., interviewers, respondents, 
teachers, doctors, parents, spouses). This means the same individual could be 
described across data sources by a hodge-podge of classifications based to an 
unknown extent on appearance, ancestry, social status or demeanor. Studies 
have shown that a person's multiracial ancestry is rarely accurately determined 
through visual inspection (Harris 2002, Williamson 1995), and that descriptions 
of a respondent's skin tone can vary by whether the race of the interviewer and 
the respondent match (Hill 2002). Goldstein and Morning (2000) also note that 
there are numerous possible definitions of a multiracial population, including: the 
population with mixed racial ancestry, the population of people aware of having 
mixed racial ancestry, and the population of people who report that they are 
multiracial on surveys. Thus, an individual's physical appearance may not always 
match her reported racial identity, which in turn may not necessarily match her 
ancestry. Omi and Winant (1994) go one step further in their theory of racial 
formation, arguing that official racial classifications and minority group identities 
are a subject of constant conflict. Yet, across disciplines, race continues to be 
used as a largely unexamined independent variable, with observed, self-reported 
or ancestrally inferred measures of race often treated as interchangeable. 

It is true that for the vast majority of the U.S. population racial classifications 
collected through different methods will yield equivalent data (Hirschman 2004). 
However, given the types of information various measures of race capture, there 
is no reason to believe that will always be the case. An individual's self report 
of race can draw not only from her ancestry, but also from social cues about 
which identity would be appropriate to claim (perhaps based on her physical 
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appearance) and which identity would be valued in a given context. Numerous 
studies show that Americans are strategic in their use of racial or ethnic identities, 
whether in pursuit of political clout, citizenship rights and economic advantages 
(Almaguer 1994, Davis 2001, Lopez 1996) or for more symbolic, social reasons 
(Waters 1990). Individuals may report only one ancestry despite their knowledge 
of other options or try to "pass" entirely from a disadvantaged group into a more 
advantaged one. Meanwhile, an interviewer might reach a conclusion about a 
respondent's race based on the individual's physical appearance, the appearance 
of others in the home or neighborhood, or the respondent's English proficiency, 
demeanor or visible signs of social status. 

Such is certainly the case in Brazil, where race is defined more by physical 
appearance and social status than by ancestry. For example, Telles (2002) finds 
that Brazilian interviewers "whiten" highly educated, darker-skinned respondents 
(i.e., classify them as white instead of brown, or brown instead of black) even more 
often than the respondents lighten themselves. Using these same observed and 
self-reported measures of race from a Brazilian national survey, Telles and Lim 
(1998) demonstrate that when otherwise similar analyses of income inequality 
between "whites" and "browns" are compared, there is a 10 percent difference 
between the estimates based on the interviewers' classification of race and the 
estimates based on the respondents' self reports. Racial classification is generally. 
considered less ambiguous in the United States because of long-standing legal 
definitions that rely on blood quantum, such as the "one-drop rule," which 
describes anyone with any black ancestry as black. Nevertheless, the potential 
inconsistencies between different measures of race discussed above suggest 
that discrepant results could be found using similar U.S. data. 


Data and Methods 


The General Social Survey (GSS) is a full-probability sample of American 
adults living in households. All data, including wide-ranging social background 
information and attitudinal measures, is derived from an in-person interview. Prior 
to 2002, the standard procedure for measuring race was to direct interviewers 
to designate a respondent's race under three categories: “White,” "Black" and 
"Other (SPECIFY)." This classification by observation was stipulated "only if 
there is no doubt in your mind," otherwise the interviewer was to ask, "What 
race do you consider yourself?" By 1994, as much as 14 percent of the racial 
classifications was based on the respondent's self report. 

In part because the number of self reports was on the rise, the GSS conducted 
a "race experiment" in 1996 and 2000 with the stated goal of developing "valid 
and reliable measures of race that also identity socially meaningful groups and 
are consistent or calibrated with current procedures for measuring race." (Smith 
1997) For the experiment, racial classification was split into two parts: first, the 
respondent's race was recorded as determined by the interviewer, and then 
the respondent was asked to identify her race. Along with their observed racial 
classification, interviewers were prompted to code how confident they were 


in their assessment, using a four-point scale that ranged from "no doubt" to 
"completely unsure." 
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Overall, the GSS race experiment generated 2,869 cases — 1,450 from 1996 
and 1,419 from 2000. In each year, the experiment included 50 percent of the 
total GSS sample for that year. Ninety-five percent of the experiment sample was 
consistently classified.’ This is a much higher percentage than the 79 percent 
consistency found by Telles (2002) among “whites,” “browns” and “blacks” in 
Brazil, which reflects the comparative rigidity of the white-black racial divide in 
the United States. 

However, GSS respondents in the residual “Other” category were consistently 
classified in just 50 percent of cases, and comparing the two measures of race 
results in different estimates of the racial populations in the GSS sample (Table 
1). Using observed race, the sample would be described as 80 percent “White,” 
15 percent “Black” and 5 percent “Other.” Based on self-reported race, the 
percentage of “Others” nearly doubles to 9 percent with all but one-half of one 
percent of the change affecting the estimate of "Whites." ? There also was a 55 
percent increase in inconsistent cases between 1996 and 2000 (64 cases and 99 
cases, respectively)? 


Table 1: Cross-tabulation of Observed and Self-reported Race in the GSS 
Self-reported Race 


White Black Other Row Total 

White 2182 11 109 2302 

> 99% 3% 43% 80% 
S Black 11 399 17 427 

€ 196 9796 7% 15% 
Š Other 13 2 125 140 

3 1% 0% 50% 5% 
Column 2206 412 251 2869 

Total 77% 14% 9% 100% 





Note: Consistently classified cases on the diagonal. Sample distribution by self-report along 
bottom row, distribution by observed race in last column. Column percentages may not sum 
to 100 due to rounding. 


Of course, the purpose of this study is not simply to answer the question 
of whether or not inconsistencies exist between observed and self-reported 
measures of race. We can expect from previous work, described above, that 
they will — at least for certain subpopulations. As a researcher, what is more 
important to know is: Are these inconsistencies significant enough to affect 
research conclusions? In order to answer this question, | use each measure of 
race in otherwise similar analyses of income inequality. For the parallel inquiries, 
| chose family income over respondent's income because it has fewer missing 
cases in the GSS data, though results using respondent's income (not shown) 
point to the same conclusion. | calculated family income from the midpoints of 
the GSS income categories in each year and coded it in thousands of dollars. 
Respondents in the open-ended top category for 1996 and 2000 were assigned 
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values of $85,000 and $135,000, respectively. All values were then adjusted to 
constant 2000 dollars and logged. The ordinary least squares regressions are 
limited to respondents 24 years old and above. Race is coded using the GSS 
categories of "White," Black" and "Other." Gender, educational attainment* and 
region are also included as common human capital and labor market controls. 
The design is similar to the analysis of income gaps in Brazil conducted by Telles 
and Lim (1998). 

After establishing that inconsistency between the two measures of race is 
worthy of concern, | verify that the same individual-level characteristics identified 
in previous studies of specific subpopulations (e.g., Harris and Sim 2002; Montalvo 
and Codina 2001; Xie and Goyette 1997) help explain inconsistencies between 
the two measures of race in the GSS. For this part of the analysis, | estimate two 
logistic regression models predicting classification consistency for white and 
nonwhite respondents, setting the interviewers’ classification as the benchmark. 
| also estimate two models predicting consistency for whites and nonwhites as 
defined by the respondents’ classification. | do this to highlight that neither the 
interviewers’ or the respondents’ classifications should be considered the “right” 
one for all purposes: in each of the four models, classification consistency is 
related to a different combination of respondent characteristics. 

In the four logistic regressions, the dependent variables are coded 1 when the- 
two racial classifications match and 0 when they do not. For cases in the self- 
reported “White” category, a 0 means the interviewer “darkened” the respondent 
to either "Black" or “Other.” A O for self-reported nonwhites corresponds to the 
interviewer “whitening” the respondent. The latter was the most common type 
of inconsistency, accounting for 83 percent of the cases of disagreement. This 
contradicts many of the assumptions of whiteness studies and critical race 
theory. While in the past people have been prevented from claiming whiteness 
by both custom and law (Jacobson 1998, Lopez 1996), these GSS respondents 
prefer to say they are not “White,” even though “White” is how the interviewers 
chose to describe them. 

There are some limitations to the GSS data. Foremost among them is that 
the survey design does not support making causal claims about why either the 
interviewers or the respondents chose the racial categories they did. My analysis 
reveals that classification inconsistency is a significant concern because different 
measures of race can lead to different research conclusions. It also highlights 
the characteristics of respondents that are associated with classification 
inconsistency. However, my analysis cannot adjudicate between competing 
explanations for why the inconsistencies exist. A longitudinal survey that includes 
multiple measures of race in each wave would be ideal for that purpose, but no 
such data is available for a national sample of adults at this time. 

Other design limitations that make the data less than ideal include the GSS 
method of matching interviewers and respondents by neighborhood, and the 
placement of the race measures after questions about ancestry and a number 
of socioeconomic characteristics. These aspects of the survey design increase 
the likelihood that the two measures of race will match.® Prior to 2002, the GSS 
also used the broad racial categories of “White,” “Black” and “Other,” which 
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lump together diverse identities and classification histories, particularly among 
the “Others” (Hispanics, Asians, American Indians). At the expense of further 
obscuring diversity and losing cases of inconsistency between the “Black” and 
“Other” categories, | grouped respondents into whites and nonwhites for the 
analysis of individual-level predictors of classification consistency. While this 
aggregation was necessary to increase the statistical power of my models, it also 
highlights characteristics that predict consistent classification as “White,” which 
historically has proven difficult to claim (Davis 2001, Lopez 1996). Taken together, 
the above limitations suggest that the survey provides a conservative estimate of 
inconsistent racial classification in the United States, and that my demonstration 
of the impact of inconsistent classification on research conclusions is similarly 
understated. 


The Significance of Inconsistent Racial Classification 


Results from my ordinary least squares regression analysis of family income 
inequality reveal that even the low levels of inconsistency between measures 
of race in the GSS can affect research conclusions about socioeconomic status 
rankings inthe United States (Table 2). As illustrated in Figure 1, using observed race, 
GSS respondents in the “Other” category are estimated to have family incomes 3 
percent greater than those of interviewer-classified “Whites.” Using self-reported 
race, respondents in the “Other” category are predicted to have family incomes 
19 percent less than those of self-reported “Whites.” The estimates of family 
income differences for “Blacks” vary little by type of classification, as might be 
.expected given their comparatively low rates of classification inconsistency in 
this sample. In terms of dollars, these estimates imply that the median family 
income among self-reported white Americans in the late 1990s was $37,878 per 
year, while median self-reported "Other" families earned $29,724 and the median 
family income among self-reported blacks was $22,281. On the other hand, 
interviewer-classified "Other" families had a median family income of $37,679 per 
year, while the median family incomes of interviewer-classified whites and blacks 
were estimated to be $36,987 and $22,436, respectively. The discrepancies that 
exist between the two measures of race — above and beyond common human 
capital and labor market controls — result not only in a 22 percent (or nearly $8,000) 
difference between estimates of the racial gap in income for respondents in the 
"Other" category, but in a substantive change in the conclusions drawn from the 
analysis. From one perspective, "White" and "Other" families appear to be doing 
equally well. From the other, "White" families out-earn "Others," whose family 
incomes are almost as far away from "Whites" as they are from "Blacks."9 
Some of the discrepancy between these estimates of family income inequality 
can be explained by knowing which respondents are less likely to be consistently 
classified by race. Individual-level characteristics that might be related to 
inconsistent racial classification, such as immigrant status, educational attainment, 
age, gender and region of residence, are also related to stratification outcomes 
such as earning power. The analyses that follow represent an important step 
toward examining these relationships and identifying how different measures of 
race provide different perspectives on inequality in the United States. 
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Table 2: Ordinary Least Squares Results Estimating Family Income by Racial Classification Type 


Self-reported Race Observed Race 





Black -41 -,40*** 
(.05) (.05) 
Other = 29514 -.04 
(.06) (.08) 
Constant 29021" 230m 
(.06) (.06) 
N 2328 2328 


Note: ***p < .01. Numbers in parentheses are standard errors. Models also include controls 
for region, age (and age squared), gender and educational attainment. Dependent variable is 
logged family income adjusted to constant 2000 dollars. 


Results from the four logistic regressions also confirm that respondent 
characteristics identified in previous studies are associated with classification 
consistency in the nationally representative GSS sample (Table 3). The four 
models indicate that immigrant generation, age and educational attainment are 
most often associated with classification consistency, though which specific 
characteristics are related to inconsistent racial classification differs both across 
the white and nonwhite groups and depending on whether the interviewer's 
classification or the respondent's identification is treated as the "correct" or 
baseline response. 

In addition, when examining the results in Table 3, it is important to keep 
in mind that if observed and self-reported race could be considered equivalent 
measures then any classification inconsistencies that resulted from comparing 
the two would be randomly distributed across cases. Statistically significant 
coefficients on any of the independent variables, which include parental nativity,’ 
age, educational attainment, region and gender, indicate that classification 
consistency is associated with the given characteristic. Significant negative 
coefficients indicate that respondents with those characteristics are less likely to 
be consistently classified. 

The coefficients from the first model, for self-reported nonwhites, represent . 
the log odds predicting that the interviewer's classification will match the 
respondent's identification as nonwhite (Table 3, column 1). Recall that this 
scenario — respondents who say they are nonwhite, but the interviewers disagree 
- accounts for the majority of the inconsistent cases in the GSS sample. For self- 
reported nonwhites, the negative relationships between classification consistency 
and having foreign-born parents are worth highlighting. These respondents are 
more likely to be classified as white by interviewers, compared to respondents 
who self-identify as nonwhite but have two U.S.-born parents. This translates to 
a predicted probability of consistent classification of 79 percent for an otherwise 
average respondent with two foreign-born parents who identifies as nonwhite, 
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Figure 1. Discrepant Income Inequality Results by Racial Classification Type 


Estimated Differences in Family Income, Compared to Whites 
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Note: All other variables not shown were held to the sample mean. In the observed race 
model (Table 2), the family income estimate for “Others” is not statistically significantly 
different from that of “Whites” (p < .66). 


and 64 percent for an otherwise average respondent with one foreign-born 
parent. In addition, as the interaction effects between educational attainment 
and age indicate; the older and more highly educated the self-reported nonwhite 
respondents are, the less likely they are to be consistently classified. The 
implication of this interaction effect for the earlier analysis of income inequality 
is that if older and more highly educated people are also more likely to have 
higher incomes, then the presence of older and highly educated self-reported 
nonwhites in the interviewer-classified white group might raise the income 
estimates for interviewer-classified whites, who are more heavily selected for 
age and educational attainment. 

The second model predicting classification consistency for interviewer- 
classified nonwhites tells a different story (Table 3, column 2). Recall that if 
observed race and self-reported race were equivalent measures, the results from 
this model and the previous one would be nearly identical and any inconsistencies 
would be randomly distributed. However, these results indicate that among 
interviewer-classified nonwhites there is very little difference in classification 
consistency between the parental nativity groups. Instead, there is significant 
inconsistency for the most highly educated respondents, such that an otherwise 
average interviewer-classified nonwhite person with at least a bachelor's degree 
would be consistently classified just 87 percent of time. Unlike with self-reported 
nonwhites, the negative relationship between higher education and classification 
consistency does not change with age. With respect to the income inequality 
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Table 3: Logit Models Identifying Individual-level Correlates of Racial Classification Consistency 
ee dere EE 


Nonwhites Whites 
Self-report Observed Self-report Observed 


Parental Nativity ` 
One born in U.S. 22: A0 -.01 SE 
(.46) (1.09) (1.05) (.38) 
Neither born in U.S. -.62** 1.50 -.34 21.00 2 
(.30) (.78) (.77) (.28) 
Age -.01 .04** 0395 
(.01) (.02) (.01) 
Education 
Less than H.S. -.88* -.23 -1.34** -1.90*** 
(.50) (.62) (.65) (.50) 
B.A. or more .81 A -.87* 155 
(.60) (.49) (.53) (.50) 
Region 
Northeast 40 22 
(.43) (.55) 
South DI -.01 
(.35) (.48) 
Mountain = 1.585255 -1.91*** 
(.48) (.51) 
Pacific -.20 15 
(41) (.59) 
Female .23 .T4* 23 ab 
(.25) (.45) (.47) (.54) 
Interviewer Doubt -93** -2.40** -2.28"* 
(.32) (.60) (.31) 
Interaction Effects 
Education*Age 
Less than H.S.*Age .04** .05** 
(.02) (.02) 
B.A. or more*Age -.05* -.01 
(.03) (.02) 
Region*Female 
Northeast*Female 18 
(.80) 
South*Female -.12 
(.68) 
Mountain*Female 4745 
(.91) 
Pacific*Female -1.32* 
(78) 
Constant ges 2.90*** 4.45" Se 
(41) (41) (.54) (.42) 
N 562 511 1968 2044 


Note: *p < .10 **p < .05 ***p < .01. Numbers in parentheses are standard errors. Indepen- 
dent variables, except age, are coded 1 when respondents have the named characteristic 
and zero when they do not. The analysis is limited to respondents who are 24 years of age 
or older. Constants refer to a 24-year-old male who has a high school degree, native-born 
parents and lives in the Midwest (when applicable). 
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analysis, these findings suggest that income estimates for interviewer-classified 
nonwhites might be higher than income estimates of self-reported nonwhites 
because some highly educated interviewer-classified nonwhites, who are also 
more likely to have higher incomes, report themselves as being "White." 

Yet another combination of correlated characteristics is revealed in the 
third model predicting consistent classification for self-reported whites (Table 
3, column 3). Here, classifications are less likely to match for respondents at 
either extreme of educational attainment, though the negative effect is strongest 
for self-reported whites with less than a high school degree. Consistent 
classification does increase for older respondents who self-identify as white. 
This relationship between age and racial classification is the complement to that 
found for respondents who self-identify as nonwhite. (The first model suggests 
that inconsistency increases with age, particularly among respondents with a 
college degree.) 

Finally, the model predicting classification consistency for interviewer- 
classified whites (Table 3, column 4) indicates that respondents with less than 
a high school degree are less likely to be consistently classified, though that 
discrepancy decreases for older respondents with the same level of education. 
Having foreign-born parents also lowers the probability of consistent classification. 
These findings suggest that some second generation Americans and people with 
low levels of education, who are less likely to have high incomes, are self-selecting 
into the nonwhite group (even though casual observers perceive them to be 
white). This might lower income estimates for self-reported nonwhites compared 
to interviewer-reported nonwhites. In addition, as is the case with the two models 
for nonwhites described above, a different set of individual-level characteristics 
accounts for consistent classification among interviewer-classified whites and 
self-reported whites. Clearly, the two measures are capturing different aspects 
of race in America. 


How Inconsistent Racial Classification Comes to Matter, and When it Matters Most 


This analysis of individual-level characteristics that predict inconsistent racial 
classification begins to untangle how different measures of race provide different 
perspectives on inequality, but as noted above, the GSS data is not designed 
to reveal why different types of racial classification capture different types of 
people. For example, the relationships between education, age and classification 
consistency from the first model for self-reported nonwhites could support the 
claim that education increases racial or ethnic consciousness (Xie and Goyette 
1997). They also may indicate that GSS interviewers are more likely to assume well 
educated (i.e., well spoken) people are white — or perhaps that highly educated, 
darker skinned people would prefer to be "whitened," as is often assumed by 
interviewers in Brazil (Nobles 2000). The higher rates of inconsistent classification 
for self-reported nonwhites with at least one foreign-born parent also could be 
interpreted in several ways. It is well known that different countries define race 
differently (Davis 2001), so first-generation immigrants may report different racial 
identities than otherwise similar native-born Americans. In addition, with the 
increase in immigration from non-European countries over the past four decades, 
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many couples with one foreign-born parent and one native-born parent might be 
interracial couples. Some of their adult children also may choose not to identify 
with conventional American race categories or they could be difficult to classify 
by appearance. Determining which specific processes are operating is beyond 
the scope of this study. 

What the analysis of individual-level characteristics does indicate is that the 
two measures of race are the most consistent for average respondents: middle- 
aged Americans with high school degrees and U.S.-born parents. Respondents 
who depart from the mean or the mode in any of these characteristics also are 
more likely to have atypical (i.e., inconsistent) racial classifications. One example 
is a 45-year-old native-born woman of Mexican ancestry who did not complete 
high school and identifies as white, whom the interviewer is sure is not. Or a 25- 
year-old man with a junior college degree and two foreign-born parents, whom 
the interviewer has no doubt is white, but who reports his race as “Asian.”? A 
combination of atypical characteristics makes racial classification inconsistency 
even more likely. For self-reported nonwhites, the 20 percent gap in the probability 
of consistent classification between respondents with two U.S.-born parents and 
respondents with one foreign-born parent increases to 30 percent among the 
most highly educated in each parental nativity group, as illustrated by Figure 2.? 

It is also important to note that much of this individual-level complexity would - 
be hidden if one examined only the net shifts between racial populations. Some 
of the effects identified above operate in opposite directions and, while the 
inconsistencies may be substantively interesting at the individual level, in the 
aggregate they largely cancel each other out. For example, the second model for 
interviewer-classified nonwhites suggests that some highly educated respondents 
"whiten" themselves, while the third model for self-reported whites suggests 
that some highly educated respondents "darken" themselves. In the aggregate, 
this translates to negligible differences between self-reported and observed 
race estimates of the educational distribution, ranging from .02 to .06. The most 
notable differences between population-level frequencies for the independent 
variables used in the above analysis are on the order of 2 percent: using self- 
reported race, 74 percent of nonwhite Americans have native-born parents; 
using observed race, 76 percent do. Nevertheless, even with this more narrow 
(or pragmatic) definition of measurement inconsistency, the disparity between 
the distributions of immigrant generation by the type of racial classification is 
statistically significant. 

To isolate the characteristic of immigrant generation and illustrate this claim, 
| reverse the previous analysis and use parental nativity as the dependent 
variable in a multinomial logit regression. The model includes, as independent 
variables, the respondent's self-reported race and a categorical variable taking 
into account the three possibilities of classification consistency (it is coded "0" 
for a consistent classification as white, "1" for an inconsistent classification and 
"2" for a consistent classification as nonwhite). In setting up the analysis this 
way, | do not mean to imply that an individual's racial classification "predicts" 
her foreign-born parentage; | am using the reversed formulation to determine 
whether knowing both racial classifications fits the data on immigrant generation 
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Figure 2. Decreasing Racial Classification Consistency for Atypical Americans 
Self-reported Nonwhites, by Parental Nativity and Education 
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Note: All variables not shown were held to the sample mean. Among self-reported 
nonwhites in the GSS, 60 percent have a high school or junior college degree, 23 percent 

do not have a high school degree and 17 percent have at least a B.A. In terms of parental 
nativity, 73 percent have two native-born parents, 22 percent have two foreign-born parents 
and 5 percent have one of each. | 


better than knowing just one. 

The statistically significant coefficient for self-reported race in Table 4 indicates 
that there is a difference between self-reported and observed race as a source 
of information about the respondents' immigrant generation. As noted above, 
second-generation Americans are significantly more likely to identify as nonwhite 
than the interviewers are to classify them as such. Thus, using self-reported race to 
generate descriptive statistics may yield higher estimates of nonwhite Americans 
with foreign-born parents or, conversely, using observed race may result in higher 
estimates of nonwhites with native-born parents. An analysis using whether or not 
the respondent is foreign born (not shown) provides similar results. 

It is likely that the distribution of other characteristics not examined here 
also might be affected by the type of racial classification. The small sample 
size for nonwhites made it necessary to limit my study to a few key variables, 
which were already identified in previous research as linked to inconsistent 
racial classification. However, cursory analyses indicate that the distributions of 
political views, marital status and the number of children the respondent reports 
as being ideal differ significantly by type of racial classification. At the individual 
level, as | find above, | expect respondents with atypical responses for these 
characteristics (e.g., identifying as extremely liberal or extremely conservative, 
being divorced or widowed, preferring very few or very many children) would be 
more likely than a "typical" American to have inconsistent racial classifications. 
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Table 4: Multinomial Logit Results Indicating Variation in the Distribution of Parental 
Nativity by Racial Classification Type 


One U.S.-born Parent No U.S.-born Parent 
Classification Consistency Scale - alo" - 57 
(.36) (.22) 
Self-report as Nonwhite 2.42** 2.257 
(.63) (.42) 
Constant -2 i Ski 
.10 .08 


Note: *p < .10 **p < .05 ***p < .01. N = 2855. Numbers in parentheses are standard er- 
rors. Comparison group is respondents with two native-born parents. Constants refer to 
a respondent consistently classified as white who has one, or two foreign-born parents, 
respectively. 


The Challenge of Using Race in Research 


When it comes to race, few social scientists practice in their research what many: 
have been preaching in their theories and classrooms for the past 20 years: 
that race is not a static, biological characteristic, but a socially constructed one, 
which can vary over time, place and social context (Martin and Yeung 2003). Such 
theories suggest that any racial classification scheme requires at least as much 
explanation — regarding how it came about and what it signifies — as it provides 
of how another variable of interest is distributed across the population. It also 
follows that there is not a single "true" measure of race, but numerous possible 
perspectives and interpretations. 

The findings in this study demonstrate such variation, in part, by supporting 
previous group-specific results using a nationally representative survey. Older 
Americans have different conceptions of race than younger ones; immigrants and 
their children define race differently than native-born Americans; racial identities 
are associated with educational attainment; and observed racial classifications 
may be responsive to status cues, as in Brazil and much of Latin America. 
Though specifying the causal processes behind these inconsistencies in racial 
classification is beyond the scope of this study, my results clearly illustrate the 
more general point that observed and self-reported measures of race are not 
equivalent. Further, inconsistencies between the two measures are significant 
enough to affect research conclusions, even when drawn from national cross- 
sectional data, and this suggests caution when interpreting research results 
based on one measure of race. 

The finding that different measures of race provide different perspectives on 
inequality has implications for improved methodology across the social sciences. 
Different research questions require different types of data on race. For example, 
observed race might be best for studies of discrimination, or access to goods and 
services, while self-reported race might be more useful for studies of attitudes 


Inconsistency between Survey Measures of Race ° 71 


and motivations. Teasing out the sources of race effects in this way would make 
research findings both more accurate and more useful. It would improve our 
ability to identify the mechanisms behind persistent racial inequality and help 
policy makers target resources more specifically to address those differentials. 
More than 200 years after the first systematic collection of racial data in the 
United States, the challenge of interpreting racial differences in life outcomes 
remains. It is not enough to change scientific rhetoric about race without also 
improving research practices to reflect its complexity and instability. Only then will 
we begin to understand the dynamics of race and inequality in the United States. 


Notes 


1. Following the experiment, the GSS switched to the federal government ap- 
proved, self-reported, "mark all that apply" method of measuring race. Citing 
the overall consistency between the various measures of race and ethnicity 
available in its 2000 survey, a GSS report suggests that the change from ob- 
served to self-reported race does not affect time trend data (Smith 2001). 


2. The degree of inconsistency for individuals in the "Other" category cannot be 
explained solely by respondents who are reporting Hispanic origin as a race. 
Based on the respondents' answers to the GSS ancestry question, about 
half of those reporting "Other" as their race also reported Hispanic ancestry. 
Most of the rest reported western European or American ancestry. A similar 
comparison of responses for interviewer-classified "Others" indicates that 
the majority reported Asian ancestry. 


3. Though the high rate of classification consistency for "Blacks" indicates that 
the black-white racial boundary in the United States is still fixed by the "one- 
drop rule," the increase in inconsistency between the two survey years was 
most notable in each of the four off-diagonal cells in Table 1 that correspond 
to inconsistency regarding the respondent's classification as "Black." 


4. | aggregated the GSS “highest degree earned” variable into three groups: 
“Low” for respondents with less than a high school degree, “Medium” for 
respondents with either a high school degree or a junior college degree, and 
“High” for those with bachelors and graduate degrees. 


5. |n the Brazilian survey analyzed by Telles (2002), the observed racial clas- 
sification was filled out before anything else on the questionnaire, so the 
results would not be influenced by other reported information, including the 
respondent's self-identification. In the GSS experiment, the interviewer does 
code race before the respondent, but only after hearing a fair amount of 
background information. While this may result in an underestimation of in- 
consistency, it also may serve to highlight the characteristics that the inter- 
viewer is relying on to make her determination. 


6. Itis intriguing that the two conclusions suggested by the results in Figure 1 
fall on either side of the debate about the current nature of the racial hierar- 
chy in the United States: Do Hispanics and Asians occupy status positions 
in between whites and blacks, or is there a new black-nonblack racial divide 
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(Gans 1999)? It appears that it feels like the former to respondents, but looks 
like the latter to interviewers. 


7. To capture immigrant status, | chose to use parental nativity with categories 
for whether both parents were born in the United States, one was, or neither 
were. Though the respondent's own immigrant status is a significant predic- 
tor of classification inconsistency (analyses not presented here), the nativity 
of the respondent's parents accounts for more of the variation. This may be 
because high rates of immigration from non-European countries, dating to 
the mid-1960s, make it more likely that those born outside the United States 
will be phenotypically different from "White" Americans. Thus, particularly in 
the case of respondents who had one foreign-born parent, parental nativity 
could be a proxy for mixed ancestry and/or a racially ambiguous appearance, 
which would make consistent classification less likely. 


8. The public-use GSS data for 2000 includes 134 verbatim responses from 
interviewers who classified their respondents as "Other" and then specified 
a group such as "Asian" or "Hispanic." | obtained similar verbatim responses 
for respondents from Tom W. Smith at the GSS, and the "Asian" example 
comes from this previously unreleased data. 


9. The higher probability of consistent classification for self-reported nonwhites” 
with the lowest levels of education illustrated in Figure 2 is consistent with 
this argument. Recall that in the first model self-reported nonwhites with 
less education were more likely to be consistently classified as they got old- 
er. This likely reflects the fact that in older cohorts — age 60 and older — lower 
levels of education were in fact the norm, especially among nonwhites. 
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Structural Assimilation Revisited: 
Mexican-Origin Nativity and Cross-Ethnic Primary Ties 


Susan K. Brown, University of California, Irvine 


Abstract 

Classical assimilation theory postulates that over time, members of immigrant groups will 
develop more primary ties with native members of the host society. However, lack of data has 
led most research to rely on the study of either spatial mobility or other secondary variables 
as proxies of primary ties. Using data from the Multi-City Study of Urban Inequality, this 
research examines primary relations, comparing the number of cross-ethnic strong ties 
among foreign- and native-born generations of persons of Mexican origin in Los Angeles 
County. The findings indicate that the native-born are substantially more likely to report 
cross-ethnic ties than immigrants. Spatial variables only partially explain the effect of 
primary structural assimilation, implying that both primary group and spatial dynamics 
play important roles in structural incorporation. 


The degree to which members of immigrant groups forge primary relations with 
native-born members of other ethnic groups constitutes the linchpin idea in 
traditional assimilation theory. Milton M. Gordon (1964: 70) called the development 
of such primary relations “structural assimilation,” defining it as entering “fully 
into the societal network of groups and institutions” of the host society. Newer 
refinements of assimilation or incorporation theory (e.g., Alba and Nee 2003; Bean, 
Stevens and Wierzbicki 2003) have emphasized institutional and other contextual 
factors that condition educational and earnings assimilation (the major kinds of 
structural assimilation examined in contemporary research). Nevertheless, the 
crucial ability to enter social networks beyond one's own group remains vital to 
an assessment of immigrant incorporation and economic mobility (Portes 1995). 
However, relatively few studies (Crispino 1980; Laumann 1973; South, Crowder 
and Chavez 2005; Tilly 1990; Wierzbicki 2004) have examined assimilation in 
terms of such primary relations, mostly because of lack of appropriate data. This 
paper assesses the structural incorporation of persons of Mexican origin in the 
United States, focusing on strong primary social ties as the major indicator of 
structural assimilation and on the degree to which cross-ethnic ties are greater 
among the native-born as opposed to foreign-born members of the group, even 
when neighborhood indicators of spatial assimilation are taken into account. 
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The Connection Between Assimilation Theory and Homogeneity in Social Ties 


The concepts of assimilation and incorporation have been re-examined recently 
after a generation of criticism (Alba and Nee 2003, 1997; Bean et al. 2003; 
Brubaker 2001; Kazal 1995). Although classic assimilation theory never predicted 
unadulterated Anglo-conformity, its implications of “straight-line” change among 
immigrant groups have been challenged by the findings of segmented-assimilation 
research (Portes and Rumbaut 2001, Zhou and Bankston 1998), which shows 
how ethnic networks may stratify the incorporation of the children of immigrants. 
Newer works (Alba and Nee 2003, Bean et al. 2003) recast assimilation less 
sequentially, as the result of individual decisions and collective action in densely 
tied groups. Crucial change comes through the blurring of boundaries between 
groups, and this depends on close and continuous contact between the 
members of different groups, reinforced by institutions that uphold civil rights. 
Brubaker (2001) recommends analyses that focus on multiple generations, and 
that such groups be thought of as heterogeneous across multiple dimensions. 
Assimilation would then involve changes in the generational distribution of an 
immigrant group across dimensions so that the distribution in later generations 
resembles that of the reference group. 

Such new conceptions of assimilation extend Gordon's canonical (1964). 
framework and demonstrate its continuing basic relevance, particularly in 
regards to the importance of primary social ties. Gordon distinguishes seven 
dimensions of assimilation in ways that encourage empirical analysis of a 
previously diffuse concept. His first dimension is acculturation or change in 
cultural patterns. The second, which Gordon (1964:81) calls the “keystone to the 
arch of assimilation," is structural assimilation, or widespread primary relations 
among groups and full entrance into the social networks and institutions 
of others. In his view, structural assimilation automatically spurs the other 
dimensions of assimilation: intermarriage, unity of identification, absence of 
prejudice, absence of discrimination and absence of power conflicts. Empirical 
analysis of assimilation has long focused on Gordon's types of assimilation, 
since several of the dimensions (e.g., intermarriage) readily lend themselves to 
operationalization. However, the keystone component of structural assimilation 
has either been neglected for lack of data or operationalized in terms of spatial 
assimilation or socioeconomic assimilation. (For exceptions, see Crispino 1980 
and South et al. 2005.) Because primary structural assimilation may or may not 
be related to what are often assumed to be adequate proxies of it, the use of 
such proxies without assessing their intersection with primary social ties leaves 
open not only the question of their construct validity, but also the question of 
the degree to which primary structural assimilation may occur independently of 
spatial or socioeconomic assimilation. 

The correspondence of spatial assimilation with primary structural assimilation 
is particularly important in this regard. Researchers relying on spatial patterns as 
indicators of primary structural assimilation (generally through suburbanization) 
argue that spatial mobility is necessary as "an intermediate step" for the social 
and economic assimilation of immigrants (Alba and Nee 2003:29, Massey 1985, 
Massey and Denton 1987). However, the use of spatial assimilation as an indicator 
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of structural assimilation assumes that physical distance is tantamount to social 
distance, though the nature and extent of this connection remains a long-running 
theoretical and empirical question. Studies of suburbs have shown that ethnic 
minorities who have spatially assimilated often remain socially isolated in their 
new neighborhoods, depending instead on longstanding but distant ties (Fischer 
1984, Gans 1967). Network perspectives hold that spatial proximity matters most 
for establishing contacts, but that homogeneity is more important for sustaining 
social ties over time (McPherson, Smith-Lovin and Cook 2001; Verbrugge 1977; 
Wellman 1996). To the extent this is true, social ties could range far beyond 
any given neighborhood or, in the case of immigrants, beyond even national 
boundaries (Massey et al. 1987, 1998; Palloni et al. 2001; Waldinger 1993; 
Wellman and Wortley 1990; Zelinsky and Lee 1998; Zhou 1998). 

Paradoxically, the literature on neighborhoods has often argued that 
homogeneity in social ties characterizes immigrant communities. Early studies 
portrayed second-generation enclaves as dense and solid (e.g., Gans 1962; 
Suttles 1968), with multiplex social relationships. More recent works have shown 
that communities with closed networks can provide social support and social 
capital (Portes and Bach 1985, Zhou and Bankston 1998). Ethnic enclaves tend 
to be conceptualized as buffer zones that re-create familiar institutions and 
allow the use of mother-tongue languages. Ethnographic research in multiethnic 
neighborhoods has found strong in-group rather than cross-group ties. Where 
Anglos and Mexican Americans live together, their neighborhood improvement 
organizations tend to be dominated by one group or the other and each is 
mistrustful of the other (Gonzales 1993). While immigrants of different national 
origins may be friendly, they tend to socialize with friends or family of the same 
ethnic group in the fashion of extended kin (Conquergood 1992; Hagan and 
Rodriguez 1992), but their networks are often highly gendered (Hondagneu- 
Sotelo 1994, Menjívar 2000). While early enclaves were largely working class, 
some new enclaves that are more socio-economically diverse also show class 
divisions within some recent immigrant groups (Zhou and Kim 2003). 

Over time, however, the social distance between Anglos and Mexican- 
Americans appears to wane, according to historical marriage records that show 
growth in intermarriage (Cazares, Murguia and Frisbie 1985). Dense ethnic ties 
are not unchanging for immigrant groups (Ochoa 2000). Moreover, other factors 
such as group size, internal variation and the salience of different ascriptive or 
achieved parameters of structural dimensions, also affect intergroup contacts 
and group solidarity (Blau 1977): When groups become more heterogeneous 
along salient dimensions, more cross-cutting contacts evolve and more evidence 
of the sort of structural assimilation Gordon was talking about is likely to emerge. 
Yancey, Ericksen and Juliani (1976) identify conditions that reinforce kinship and 
friendship networks and ethnic solidarity: common jobs, residential stability and 
concentration, and dependence on shared institutions and services. Undermining 
ethnic solidarity are such factors as class and gender inequality and religion 
(Abelmann and Lie 1995, Espiritu and Ong 1994, Pessar 1999). Even though 
structural parameters introduce aggregate variability into the types of social ties 
forged by immigrants at destination, an examination of the ethnic composition of 
primary social ties at the micro-level nevertheless offers the prospect of clarifying 
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the role of primary-group ties in structural assimilation, the outcome that Gordon 
viewed as the most important component of immigrant incorporation. 


Hypotheses 


In the immigrant generation, ethnic, language and cultural barriers often preclude 
the development of numerous strong cross-ethnic ties. The native born, however, 
presumably develop more cross-ethnic ties because they have attended U.S. 
schools and had contact with more members of the native born (Emerson, Kimbro 
and Yancey 2002). Thus, the main hypothesis of this research, here termed the 
primary structural assimilation hypothesis, is that native-born persons of Mexican 
origin will show more cross-ethnic strong ties than the foreign born. Beyond 
this, however, the question remains about the degree to which any observed 
effect of nativity on cross-ethnic ties might simply overlap with and thus be 
explained by spatial assimilation. Despite well-documented evidence of housing 
discrimination against blacks and Hispanics (Ross and Turner 2005, Yinger 1995), 
an emerging body of work shows that many relatively well-off or highly skilled 
immigrants, especially Asians, freely choose their place of residence (Logan et 
al. 2002, Tseng 1995, Zelinsky and Lee 1998). In general, spatial assimilation has 
long been associated with socioeconomic mobility (Alba and Nee 2003, Massey. 
and Denton 1987, White and Sassler 2000). Because those who can afford richer 
areas also have financial and human capital, they are also likely to have a large and 
varied network of acquaintances as well, since money and skills often indicate 
social capital (Portes 1998, Zhou and Kim 2003). 

Both social and economic characteristics of the neighborhood thus are likely to 
be related to spatial assimilation. Because such characteristics may co-vary with 
spatial assimilation, they may account for any positive effect of nativity on strong 
cross-ethnic ties. Such a finding would provide support for research approaches 
that use measures of spatial mobility as proxies for primary structural assimilation. 
To assess this question, this research examines the influence on primary ties of 
important social and economic indicators of the neighborhood. If the tack taken 
by prior spatial-assimilation research is justified in its assumptions, any positive 
nativity effect on cross-ethnic ties should disappear or be significantly reduced 
when neighborhood variables are controlled in statistical analyses examining their 
effects (a result that would provide support for a spatial assimilation hypothesis). 
If, on the other hand, primary ties continue to constitute an independent aspect 
of structural assimilation, any positive nativity effect should not disappear when 
neighborhood characteristics are controlled (a result that would support the 
primary structural assimilation hypothesis). 

Further evidence buttressing the idea that primary structural assimilation 
might operate independently of spatial assimilation would also be obtained if 
neighborhood characteristics were found to interact in certain ways with nativity 
to affect the number of cross-ethnic ties. Residence in a poor neighborhood 
populated mainly by members of other ethnic groups should not be associated 
with more heterogeneity in social ties, either for the native born or foreign born, 
because the resident presumably has little choice of neighborhood and would 
not necessarily possess much social capital. In the absence of such constraints, 
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however, for native-born persons living outside co-ethnic areas or in wealthier 
neighborhoods, one would expect more primary cross-ethnic ties than among 
foreign-born persons living in such areas. Under such circumstances, nativity 
differences should be enhanced in the better or non-co-ethnic neighborhoods, 
further suggesting that both characteristics of neighborhoods, as implied by 
the spatial assimilation approach, and primary group ties, as implied by the 
primary structural assimilation approach, need to be considered to develop a 
more complete portrait of structural incorporation. There is some question of 
causal direction and feedback because the development of social ties may 
precede settlement in any given neighborhood. South et al. (2005:597) examine 
the possibility that prior co-ethnic social ties influence the likelihood of moving 
and find little evidence that this occurs. But among those who move, having 
more cross-ethnic ties is positively related to the proportion Anglo in the new 
neighborhoods. 


Data and Methods 


To assess these possibilities, data are drawn from the Los Angeles County 
segment of the Multi-City Study of Urban Inequality (MCSUI), a multi-stage 
probability sample that oversampled poor and minority adult populations in four 
cities (Detroit, Boston, Atlanta and Los Angeles) between 1992 and 1994 (Bobo et 
al. 1998). In Los Angeles, the major port of immigrant entry into the United States 
along with New York (U.S. Immigration and Naturalization Service 2002), the 
oversampling had the effect of capturing many immigrants, particularly Mexicans 
and Central Americans. In an overall sample of 4,025 adults, 994 were Latino, and 
697 of these were of Mexican origin. This sample makes the MCSUI the largest 
source of survey data combining both the ethnic composition of primary-group 
ties for adult Mexicans, the country's major immigrant group, and neighborhood 
characteristics. Other surveys with large immigrant samples and information on 
social ties, such as the Latino National Political Survey or the National Longitudinal 
Study of Adolescent Health, cover a variety of metropolitan contexts. The Los 
Angeles sample is also large enough to allow distinguishing foreign-born from 
native-born persons, a key feature of the analyses in this research. The great 
majority of those who identify themselves as Mexican-Americans in the MCSUI 
sample are native born, and among those identifying as simply Mexican, 96 
percent are foreign born. These two groups are combined here into a single 
Mexican-origin category. 

The variable of interest on primary group ties consists of a question eliciting 
the respondent's discussion partners. Follow-up questions were asked about the 
first three persons named. The actual question reads: 


From time to time, most people discuss important matters 
with other people. Looking back over the last six months 
— who are the people, other than people in your own 
household, with whom you discussed matters important to 
you? Please tell me the first name or initials of the people 
with whom you discussed matters important to you. IF LESS 
THAN 3, PROBE: Anyone else? 
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Following previous work based on similar questions (Louch 2000, Marsden 
1987, Straits 1991), the “important-discussant” question is used as a proxy for 
strong ties. Although the wording of the “important-discussant” question has 
been criticized as vague, the very diffuseness of the question is what attracted 
researchers to it. The question “identifies comparatively intense portions of the 
interpersonal environment for all respondents, and it thus has some general 
utility.” (Marsden 1987:123) Further studies also have confirmed the validity and 
usefulness of the question, even in cross-cultural contexts (Bailey and Marsden 
1999; Blau, Ruan and Ardelt 1991; Straits 2000). Moreover, a strong tie that is also 
cross-ethnic probably indicates that the respondent is linked to diverse social 
networks in ways that go much further than mere intergroup contact. Social ties 
do not develop in a vacuum but from pre-existing social ties or around specific 
foci (Feld 1981), so that new contacts tend to be similar to existing contacts. The 
MCSUI classifies these ties as relatives, friends, co-workers or “others,” with 
friend being the most common choice. 

The data on social ties in MCSUI are also the only large-scale survey data on 
Latinos from a single urban area of the country, something necessary for analyses 
of primary social ties. They thus afford a unique opportunity to assess primary 
structural assimilation in the United States among persons of Mexican origin. 
Despite this enormous advantage, the data are limited in certain respects. For one: 
thing, the survey collected information on only three of each respondent's alters. 
The limitation of the sample to three alters means that findings may not capture 
an entire social network but instead may only represent a subset of the network. 
The more additional alters a respondent might have named, the more likely that 
at least one of those ties would represent a bridge to another racial/ethnic group. 
The exact degree of loss in the data as a result of this is unclear, although there 
are reasons to think it is not large. In a General Social Survey that included data 
on up to five alters, the modal response was three alters, although 40 percent of 
the sample gave four or five. However, the GSS sample allowed people to name 
conversation partners from their own households, and the first alter cited was 
slightly more likely to be a spouse or kin (Marsden 1987). The MCSUI, however, 
specifically asked for ties outside the household, thus eliminating the spouse as 
a potential strong tie. For purposes of looking at variations in strong ties to out- 
groups, data from only a three-alter question that excludes spouses is likely to 
capture almost all of the important variation in the number of out-group ties. 

Another limitation of the data is that the race or ethnicity of alters is presented 
in pan-ethnic terms, such as Asian or Latino. However, because the large majority 
of Latinos in Los Angeles are of Mexican origin, the alters are also likely to be 
Mexican. Also, itis impossible to tell whether alters were born in the United States 
or whether they knew one another. Finally, nearly one-fourth of the Los Angeles 
sample reported no strong ties outside the household; these respondents were 
disproportionately non-white, foreign born, and poorly educated. 

Because eliciting the names of individuals in a network often requires probing 
by interviewers, the measure may be particularly prone to interviewer effects 
(Marsden 2003). For this reason, it is instructive to compare marginal results 
in Los Angeles to those in Boston because the two cities had different training 
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of interviewers and different placements of the network module in the survey 
questionnaire. Results for non-Hispanic blacks and whites were quite similar 
across the two cities. The variation among Hispanic groups was wider but 
involved different national-origin groups. In the Boston segment of the MCSUI, 
32.6 percent of Puerto Ricans and 55.8 percent of Dominicans reported having 
no discussion partners, compared to 38.3 percent of those of Mexican origin 
in Los Angeles. However, the Asian sub-sample in Los Angeles, which has no 
counterpart in another city, appeared to have substantial interviewer effects. This 
is one reason it is omitted from this analysis. Overall, despite such limitations, 
the MCSUI module on social ties constitutes an unusually valuable source of 
data, as indicated by the fact it has generated useful and important results in a 
number of previous, significant studies (e.g., Elliott 1999; Freeman 2001: Green, 
Hammer and Tigges 2000; Johnson et al. 2000). 

The availability of neighborhood-level data is extensive, since each respondent 
is geolinked to selected variables from the STFS file of the 1990 U.S. Census 
of Population and Housing at the block-group level, although data identifying 
the particular block groups have been removed. A constructed variable for 
neighborhood status consists of a factor score deriving from a principal 
components analysis of three 1990 census characteristics: the proportion of 
resident adults in the block group with college education, the proportion of owner- 
occupied housing, and the block group's median income. These three variables 
were all highly correlated with the first principal factor, which explained 78 percent 
of the variance among the items. White and Sassler (2000) constructed and used 
in their research an analogous variable based on a principle components analysis 
of tract-level data for five characteristics. 

Because no Classification exists for identifying the proportion of any ethnic 
group that must live in a cluster before the area becomes known as an enclave, the 
demarcation of an ethnic concentration is not obvious. Alba, Logan and Crowder 
(1997) use a standard for overrepresentation in a census tract of two times the 
proportion of the regional population. However, with a group that now constitutes 
a plurality, such as Latinos in Los Angeles, this could become a prohibitive criterion 
because the ethnic concentration is a function of size of the overall population as 
well as its level of segregation. However, the multi-stage design of MCSUI offers 
an ad hoc solution. Because investigators divided each city into racial and ethnic 
strata based on census-tract concentrations, the stratum itself can indicate the 
ethnic concentration. The threshold population level for determining an ethnic 
stratum was 50 percent for Hispanics, non-Hispanic whites and blacks, but the 
actual strata used to draw the Hispanic sample ranged from 63.4 percent to 86.3 
percent Hispanic. While any threshold for the definition of an ethnic concentration 
is necessarily arbitrary, Logan et al. (2002) have found that regardless of different 
criteria used to define enclaves, research results tend be stable. In this analysis, 
regressions run using percentage Hispanic as an independent variable instead of 
ethnic stratum produced nearly identical results. 

Methodologically, measurement of cross-ethnic ties involves tradeoffs. The 
above-outlined hypotheses are predicated on dyads, or the relationships between 
respondents and alters, as the unit of analysis. Because each respondent generally 
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reports more than one alter, conventional regression involving dyads as the unit 
of analysis would violate assumptions of independence and result in biased 
estimates (Louch 2000; van Duijn, van Busschbach and Snijders 1999). Because 
the data specifically involve counts of cross-ethnic ties, the analysis does not 
distinguish isolates from those who are embedded within their own group. To 
overcome that drawback, the research also examines the mean proportion of 
all ties, weighted by respondents. The use of Stata's survey commands ensures 
accurate point estimates, standard errors and tests of significance (StataCorp 
2001: 321-26). The numbers presented in Table 3 are from Poisson regressions, 
given that the dependent variable is a count of the respondents' cross-ethnic 
ties that is heavily skewed toward zero, with relatively few respondents reporting 
more than one cross-ethnic tie. The coefficients in the table are interpretable as 
incidence rate ratios and are obtained by transforming the coefficients from the 
regression coefficients themselves (Anderton and Cheney 2004). 


Findings 


One in five respondents of Mexican origin in Los Angeles County has ties to a non- 
Hispanic, as Table 1 shows. Unsurprisingly, the proportion of respondents with 
cross-ethnic ties rises with the overall number of alters reported, so that among. 
those with three alters, more than 38 percent report that at least one of those alters 
is cross-ethnic. Moreover, nearly half of those with cross-ethnic ties have more 
than one such tie. The order in which the alters are listed does not appear to affect 
whether the alter is cross-ethnic; that is, the first alter named is about as likely as the 
second or third to be cross-ethnic. In all, these 697 respondents list 1,170 alters by 
race or ethnicity. Of these total alters, 21.4 percent are cross-ethnic. 


Table 1: Proportion of Mexican-Origin Respondents with at Least One Cross-Ethnic Tie, by 
Overall Number of Ties Mentioned 


Overall Number of Ties 


Total 0 1 2 3 
Proportion of .202 0 .103 .290 .381 
respondents with 
cross-ethnic ties 
N 697 253 49 64 331 





Those of Mexican origin are not unusual in their level of cross-ethnic ties. 
Similar levels are also found among blacks, non-Hispanic whites and Asians in 
Los Angeles County. These levels of cross-ethnic ties are far fewer than what 
one would expect if ties were stochastic, though the difference depends heavily 
on the size of the group. For example, in the 1990 census, 37.3 percent of the 
population of Los Angeles County was Hispanic (thus 62.7 percent was non- 
Hispanic), so the ratio of observed cross-ethnic ties to expected ones (21.4/62.7) 
is .341, or roughly a third of what might be expected if ties in Los Angeles County 
had been randomly distributed. Non-Hispanic whites are overwhelmingly the 
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most popular out-group; this is probably a function of both group size and status. 
Mathematically, when using a measure based on a small number of generated 
names (e.g., three confidants), the expected number of cross-group contacts 
would be greatest when all groups are equal in size (Feld and Carter 1998). 
Accordingly, one would expect reasonable heterogeneity of strong ties in Los 
Angeles where Hispanics and non-Hispanic whites are roughly equal in numbers. 
Yet the low proportion of cross-ethnic ties suggests that racial and ethnic 
constraints on networks matter more than sheer group size. The high degree of 
racial and ethnic clustering of ties sustains previous findings that the dimension 
of race is highly salient (Shrum, Cheek and Hunter 1988). 

Table 2 shows that Mexican immigrants have significantly fewer cross-ethnic 
ties than native-born Americans of Mexican origin. As the first set of findings 
shows, 57.2 percent of the foreign-born report having any strong ties outside the 
household, as opposed to 70.9 percent of the native-born. Partly as a result, the 
native born have more cross-ethnic ties to non-Latinos, as shown in the second 
set of findings, with a third of the native born reporting cross-ethnic ties but less 
than 14 percent of the foreign born. Even if one considers only those who have 
strong ties, those born in the United States average nearly twice as many cross- 
ethnic ties as the foreign born, as Column 1 of the third set of findings shows. 
Overall, these zero-order results support the hypothesis of primary structural 
assimilation, with the native born showing stronger tendencies to form cross- 
ethnic ties than the immigrants. 

Table 2 further shows that this difference in cross-ethnic ties is accentuated 
by residence in an ethnic concentration or in a wealthier area. Columns 3 and 6 
display the values for residence inside and outside a co-ethnic concentration. The 
difference between the native born and foreign in overall levels of strong ties and 
in bridge ties to non-Hispanics is most pronounced among those living outside 
Hispanic concentrations. The native-born who live outside Hispanic concentrations 
are significantly more likely than the foreign born in their neighborhoods to have 
cross-ethnic ties (45.4 percent to 20.2 percent). Similarly, the native born living 
outside Hispanic concentrations are more likely than the native-born in Hispanic 
concentrations to have cross-ethnic ties. Cross-ethnic ties represent a particularly 
low percentage of all ties for those living inside an Hispanic concentration (roughly 
15 to 19 percent), as the last set of findings shows. But among the native born 
outside Hispanic concentrations, 37.7 percent of all strong ties are cross-ethnic. 
The tendency of the native-born to have more cross-ethnic ties also occurs in the 
wealthier geographic areas. In the less well-off areas, the proportion of bridge 
ties to non-Hispanics remains uniformly low — no more than 18 percent of the 
respondents have any, as columns 4 and 7 show. But among the native born in 
wealthier strata outside areas of ethnic concentration, nearly half (48.3 percent) 
have cross-ethnic ties. The overall level of cross-ethnic ties is also highest among 
the native born in the wealthier strata. Among the same group inside Hispanic 
concentrations, 29.1 percent of their strong ties cross ethnic boundaries. The 
percentage of cross-ethnic ties is in almost all cases much lower for immigrants 
and those living in poorer areas. 
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Table 2: Proportion of Cross-Ethnic Strong Ties among those of Mexican Origin, by Place of 
Residence, Wealth of Neighborhood and Nativity 





Residence" 
Outside Co-ethnic Concentration Inside Co-ethnic Concentration 
Place Overall N All High- Low-Poverty All High- Low- 
of Birth Mean Strata Medium Poverty” Strata Strata Medium Poverty Poverty Strata 
(1) (2) (3) (4) (5) (8) (7) (8) 
Proportion of Mexican-origin respondents reporting any strong ties 
US. 709 180 826 460 DÉI .560 .602 .521 
(.048) (.051) (.178) (.048) (.073) (.119) (.088) 
Foreign 572 517 613 571 628 545 652 .384 
(.040) (.050) (.060) (.065) (.057) (.070) (.092) 
Proportion of respondents with bridge ties to non-Hispanics 
U.S. .337 180 454 .153 .483 .188 135 .238 
(.051) (.067) (.076) (.072) (.071) (.047) (.125) 
Foreign .136 517 .202 .178 211 094 .090 101 
(.022) (.036) (.052) (.045) (.026) (.031) (.046) 
Mean proportion of all ties that are bridges to non-Hispanics" 
315 377 .182 .387 .193 .105 291 
Foreign .161 853 .184 .166 .190 .145 .119 211 


Standard errors in parentheses. 

* Mexican-origin respondents living in census tracts that were more than 50 percent Hispanic 
in 1990 were considered to be in a co-ethnic concentration. 

> Medium-and high-poverty strata are defined as those in which more than 20 percent of the 
residents fall below the poverty line. 

* N refers here to the total numbers of ties, not the total number of respondents. Each tie is 
assigned the weight of the respondent, but all ties are then averaged. Conventional estimation 
of standard errors would be biased. Race/ethnicity data are missing on two of the 1,170 total 
number of ties 


The greater number of cross-ethnic ties among the native born compared to 
the foreign born thus appears consistent with Gordon's hypothesis that greater 
primary structural assimilation occurs the longer the members of an immigrant 
group have been in the United States. The next research task is to ascertain the 
extent to which this tendency is a function of spatial assimilation, as researchers 
examining geographic mobility as a main component of structural assimilation 
implicitly assume. The results in Table 3 show the extent to which this nativity 
effect holds up when other variables are controlled, including variables reflecting 
neighborhood characteristics alone and thus spatial assimilation. 

Native-born Mexican-origin persons are twice as likely to report cross-ethnic 
ties as Mexican immigrants (see the incidence rate ratio of 2.0 for Model 1), 
a tendency that changes little regardless of whether respondents live outside 
areas of ethnic concentration (see Model 2 where the ratio remains at nearly 1.9 
when this factor is controlled). Taking into account the economic well-being of 
the neighborhood, however, reduces the zero-order ratio to 1.6, indicating that 
natives are still 60 percent more likely than immigrants to report cross-ethnic ties, 
even after considering that natives tend to live in better-off areas. This result holds 
up even when controlling for the tendency to live in non-co-ethnic areas (Model 
4). In other words, the force of spatial assimilation per se, represented here by 
the degree to which the neighborhood of residence is well off, intersects with the 
tendency to form primary out-group ties. As a consequence, the results support 
the assumption of residential-assimilation researchers that spatial variables often 
function as reasonably good proxies of structural assimilation. 

However, Model 4 also shows a strong and statistically significant effect 


Mexican Nativity and Cross-Ethnic Primary Ties ° 85 


of primary structural assimilation remaining even after taking the influence of 
spatial indicators into account. This shows that processes of primary structural 
assimilation cannot be totally subsumed under categories of spatial assimilation, 
but rather that primary assimilation inserts its own dynamics into the incorporation 
experience. Further support, albeit weak, for this idea emerges in models 5 and 6. 
Model 5 shows that the nativity effect (the primary structural assimilation effect) 
is accentuated in non-co-ethnic neighborhoods, as would be expected by the 
classical structural assimilation hypothesis, though the effect is not statistically 
significant. Model 6 shows that neighborhood quality is more important for the 
foreign born than native born in predicting the number of cross-ethnic ties. Finally, 
results in Models 7 and 8 show that when the effects of a number of individual- 
level variables that might also explain the primary structural assimilation effect are 
examined, they do not appreciably reduce the nativity coefficient. This indicates 
that processes of primary structural assimilation continue to exert their influence 
even when the native born differ from immigrants in individual characteristics. 


Discussion and Conclusions 


In finding that native-born Mexican-origin persons are more involved in cross- 
ethnic ties than Mexican immigrants, this study reaffirms the importance of 
primary group relations in the structural incorporation of immigrant groups. 
Strong ties represent primary relations — not just contacts — and thus reflect the 
kind of social support that Gordon (1964) emphasized when discussing structural 
assimilation. Greater structural assimilation stems from successive immigrant 
generations reporting more cross-ethnic ties to other racial and ethnic groups, 
particularly to native-born whites. This measurement of similarity in ties entails 
the major advantage of being more direct than measures of spatial assimilation, 
which consider only where immigrants reside and thereby must rely on the 
assumption that relationships develop among racially and ethnically dissimilar 
people who happen to live near one another. In fact, this assumption is partially 
but not fully born out here. 

The traditional argument for spatial assimilation assumes that the first 
generation may be unlikely to leave poor ethnic enclaves for many years, if at all. 
For that reason, spatial assimilation theory never clearly distinguishes whether 
moving to the suburbs is a sign of socioeconomic mobility or social integration. 
However, the recent growth of immigrant concentrations in suburbs, often 
immediately upon arrival in the United States, (Logan et al. 2002, Tseng 1995) 
suggests a need to fine-tune the traditional concepts of spatial assimilation. 
The findings presented here provide empirical evidence to suggest that spatial 
assimilation measures capture important aspects of the dynamic of structural 
incorporation, especially those parts linked to economic assimilation. Such 
measures fail to reflect fully the development of primary social ties and thus the 
extent to which immigrant groups are becoming structurally assimilated. The 
findings reaffirm the importance of spatial assimilation measures for research on 
immigrant group incorporation while showing the limitations of assuming that 
spatial assimilation alone is sufficient for structural assimilation. Further work 
along these lines would benefit from more complete data on immigrants” social 
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Brutal Borders? Examining the Treatment of Deportees 
During Arrest and Detention 


Scott Phillips, University of Denver 
Jacqueline Maria Hagan, University of North Carolina, Chapel Hill 
Nestor Rodriguez, University of Houston 


Abstract 

Recent legislation has produced a dramatic rise in the detention and removal of 
immigrants from the United States. Drawing on interviews with a random sample of 
Salvadoran deportees, we examine treatment during arrest and detention. Our findings 
indicate: (1) deportees are often subject to verbal harassment, procedural failings and 
use of force; (2) force tends to be excessive; (3) force is more common against deportees 
than citizens; (4) situational contingencies and organizational actors influence force, 
but ecological settings do not. 


Departing from immigration policies of the previous three decades that granted 
immigrants increasing rights and privileges, the 1996 Illegal Immigration Reform 
and Immigrant Responsibility Act (IIRIRA) strengthened the enforcement arm of 
the Immigration and Naturalization Service (INS) giving it broad authority to arrest, 
detain and remove immigrants.' The 1996 Anti-Terrorism and Effective Death 
Penalty Act (AEDPA) and the 2001 USA PATRIOT Act further elevated the power of 
federal agencies to detain and remove immigrants, but limited judicial review of 
the process. The result has been a staggering increase in detentions and removals 
despite a constant rate of immigration. As Figure 1 demonstrates, the average 
daily detention population rose from 9,011 in FY 1996 to 21,133 in FY 2003; annual 
removals accelerated from 69,680 in FY 1996 to 202,842 in FY 2004. 

Despite journalistic accounts and human rights investigations (Amnesty 
International 2003, Catholic Legal Immigration Network 2000), no research 
project has systematically examined how deportees are treated during arrest and 
detention (but see Phillips, Rodriguez and Hagan 2002). This gap in the literature 
exists despite a long tradition of scholarship on police officers' use of force against 
citizens during arrest (Sherman 1980, Riksheim and Chermak 1993), and a more 
limited scholarship on corrections officers' use of force against citizens during 
detention (Marquart 1986). Because immigrants do not have the rights of citizens 
and are often apprehended and detained in remote locations, the potential for 
mistreatment is substantial. Our research begins to fill the gap, focusing on five 
questions: How often are deportees subject to verbal harassment and procedural 
failings during arrest and detention? How often are deportees subject to the use 
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of force during arrest and detention? If force is used, does it tend to be justified 
or excessive? Is the use of force during arrest more common against deportees 
than citizens? What are the predictors of force during arrest and detention? 

To answer these questions, we collaborated with a program in El Salvador called 
“Bienvenido a Casa” (BAC), meaning “Welcome Home.” BAC is a consortium of 
local organizations coordinated through Catholic Relief Services that attempts to 
facilitate the resettlement of Salvadoran deportees. During the planning stages of 
our research, BAC was preparing to host job placement meetings for deportees. 
Upon learning about our proposed research, the director of BAC offered to have 
BAC representatives interview a random sample of the deportees who attended 
job placement meetings. (All deportees were eligible to be interviewed regardless 
of legal status at the time of arrest.) Our collaboration resulted in 300 interviews 
with deportees and represents the closest approximation of a random sample of 
deportees that has ever been collected. 


Research on the Use of Force Against U.S. Citizens 


Our research focuses on the use of force against deportees because the parallel 

. literature on citizens provides a point of departure and comparison, but we also 
consider verbal harassment and procedural failings despite the absence of 
comparable literature. This broadened approach allows an exploration of the total 
deportation experience. 

Though scholars have long investigated the use of force against U.S. citizens 
(Sherman 1980, Riksheim and Chermak 1993), and are beginning to explore the 
topic of crime and immigration (Hagan and Palloni 1999, Martinez 2002), the 
intersection of both streams — the use of force against deportees — remains 
uncharted. The following review of research on force against citizens provides a 
comparative context for examining force against deportees. 

Research on the use of force against citizens has proceeded along divergent 
paths. The literature on police officers’ use of force during arrest is voluminous, 
and tends to use quantitative methods to estimate rates and causal predictors. 
(For reviews see Sherman 1980, Riksheim and Chermak 1993.) The literature 
on corrections officers’ use of force during detention is sparse and relies on 
qualitative methods to describe the guard subculture (Marquart 1986). Because 
we are interested in rates and causal predictors, we draw on insights regarding 
the use of force against citizens during arrest and extend those insights to the 
use of force against deportees during both arrest and detention. 

Research on the use of force against citizens during arrest can be divided into 
three categories: situational, organizational and ecological.’ Situational research 
considers police-citizen interactions. This line of inquiry consistently reports that 
the suspect's demeanor is critical: being drunk and resisting arrest increase 
the chance of force (cf. Friedrich 1980; Worden 1995, 1996; Engel et al. 2000). 
However, the role of other situational factors is not settled. The police seem to 
be more apt to use force against young suspects, but the relationship disappears 
after controlling for confounders (Friedrich 1980, Riksheim and Chermak 1993). 
Findings regarding the impact of gender on force are equally mixed (compare 
Friedrich 1980, Sherman 1980, Griswold and Massey 1985). Disagreement also 
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reigns regarding the influence of location. Most argue that arrests made out of 
public view increase the specter of force (Reiss 1968, Sherman 1980), but others 
maintain that location does not matter (Friedrich 1980). Still others suggest that a 
small number of officers commit brutal acts of force out of public view, but that 
routine force tends to be spontaneous and public (Fyfe 1996). To incorporate 
the situational approach, we consider the traditional elements of the interaction 
mentioned here and elements that are specific to deportation. Are deportees 
who are detained for a long period of time more likely to experience force? 
Are deportees who have access to legal counsel during detention less likely to 
experience force? 

Organizational approaches examine a police department's culture, policies 
and structure. For our purposes, the size of police departments is relevant. 
Departments that are large and bureaucratic produce more force, but not 
more excessive force (Worden 1995, 1996). It appears that supervisors in large 
departments cannot adequately mentor front-line officers, while supervisors in 
small departments can do so and, in the process, pass on tips for avoiding force 
(Worden 1995, 1996). To incorporate the organizational approach, we focus on 
the logic of the training argument. Are federal authorities who are trained to 
work with deportees, specifically U.S. Border Patrol officers and federal detention 
personnel, less likely to use force during arrest and detention than local authorities, 
specifically police officers and jail personnel, who have little or no such training? 

Ecological approaches consider the distribution of force across geographical 
space. For our purposes, the threat hypothesis is relevant. Derived from the 
conflict perspective, the threat hypothesis suggests that authorities use the 
criminal justice apparatus to control concentrations of minorities who are seen 
as a threat to the social order (Turk 1969). Indeed, research suggests that as the 
percentage of minorities in an area increases so too does the size of the police 
force (Jacobs 1979), police budget expenditures (Jackson and Carroll 1981), the 
arrest rate (Liska et al. 1985), police use of deadly force (Jacobs and O'Brien 1998), 
and police use of excessive force (Holmes 2000). To incorporate the ecological 
approach, we examine whether deportees who are arrested and detained in 
areas with high concentrations of immigrants are more likely to experience force. 
Specifically, are deportees who are arrested and detained in the border states of 
Arizona, California, New Mexico and Texas more likely to experience force than 
those arrested elsewhere? 


Methods 


Data Collection 


To understand the data collection process, it is important to understand the 
deportation process.^ Salvadoran deportees are flown from the United States to 
San Salvador, the capital of El Salvador. After the plane arrives, a Salvadoran official 
boards and offers the deportees a chance to participate in BAC. Most deportees 
agree to participate after the official explains that BAC is a Catholic Relief Services 
program that supports the resettlement of deportees through the coordination of 
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local social service organizations. During the airport meeting, BAC representatives 
provide: (1) counseling to help deportees manage the trauma of being returned; 
(2) information to help deportees get home (phone numbers, bus routes); and (3) 
information to help the deportee after he or she arrives home (local social service 
organizations). BAC representatives also conduct an interview with deportees to 
collect demographic and contact information. From the inception of the program 
in February 1999 through March 2002, BAC representatives interviewed 10,876 
deportees (84 percent of the deportees returned to El Salvador during the time 
period). 

Finding a job is often the most urgent need for deportees. But high rates 
of joblessness, coupled with the stigma of being deported, present substantial 
obstacles to resettlement (Rodriguez and Hagan 2004). In response, BAC created 
a program to help deportees find jobs or transition to better jobs. To launch the 
program, BAC sent information postcards to all former BAC participants. The 
postcard described the program and invited deportees to attend meetings 
scheduled across the nation. 

Serendipitously, BAC representatives were poised to begin the job placement 
meetings inthe spring of 2002 when we approached them about our research. The 
director of BAC, Luis Perdomo, offered to collaborate. Following a discussion of 
our need for rigorous data collection and BAC's need for expeditious job placement 
meetings, the director offered to have BAC representatives conduct face-to-face 
interviews with every third person who attended a job placement meeting. In 
summer 2002, the authors traveled to El Salvador to train the interviewers who 
then conducted 300 interviews across the 11 Salvadoran departments (states) 
during a six-month period, from June 2002 to December 2002. 


Profile of Respondents 


Table 1 profiles the respondents. The deportees were overwhelmingly young 
men: 95 percent are male with a median age of 29. Most of the deportees 
resided in the United States without authorization (51 percent were unauthorized 
migrants deported for an immigration violation; 16 percent were unauthorized 
migrants deported for a crime), but some of the deportees were legal residents 
(8 percent were legal residents deported for an immigration violation; 26 percent 
were legal residents deported for a crime). Before being deported, 58 percent 
of the respondents spent four months or less in detention, but 30 percent were 
detained for a year or more. 

Table 1 also demonstrates that the deportees were often established residents, 
evident in settlement, work and familial histories. The median amount of time 
spent in the United States prior to deportation was eight years. The deportees 
developed strong work and familial ties during this period: 78 percent were in the 
U.S labor force at the time of deportation, and 79 percent left family members 
in the United States. Moreover, 52 percent left a spouse or child in the United 
States. Given such strong ties, it should not be surprising that 38 percent of the 
respondents plan to return to the United States. 
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Table 1: Profile of Respondents (n = 300) ` 
Characteristic Minimum Maximum Median S.D. % 
Age 17 62 29 8 
Sex 
Male 95 
Female D 
Legal Status by Reason for Deportation' 
Unauthorized, Immigration Violation 51 
Unauthorized, Crime 16 
Authorized, Immigration Violation 8 
Authorized, Crime 26 
Years in U.S. Prior to Deportation 0 31 8 8 
Employed in U.S. Prior to Deportation ' 
No 22 
Yes 78 
Family Remaining in U.S. Household 
No 21 
Yes 79 
Months in Detention 
Less than 1 4 
422 34 
3-4 20 
5-6 7 
7-11 6 
12-23 7 
24-35 8 
36 or more 19 
Prior Deportation from U.S. 
No 75 
Yes 25 
Plan to Return to U.S. 
No 62 
Yes 38 


' Authorized includes: citizen, legal permanent resident, work authorization, residence 
authorization, political asylum, visa and temporary protection status. 


Dependent Variables: Force, Verbal Harassment and Procedural Failings 


To measure the spectrum of force that can occur during arrest and detention, 
the instrument included indicators for the following forms: hit/punch, slap, kick, 
throw to the ground, shove, spray (pepper spray) and shoot (stun gun). The 
indicators were combined to examine whether the respondent reported some 
form of force. 

It is also important to distinguish between justified and excessive force. 
Though the concept of excessive force is subjective, courts have applied a 
“reasonable officer” standard (Adams 1996, Klockars 1996). In the current 
research, the question of whether a reasonable officer would have used force 
turns on whether the deportee resisted arrest or resisted orders in detention. 
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For arrest, the instrument included two questions: “Did you try to run away 
during the arrest?" "Did you try to physically resist arrest?" Answering "yes" to 
either denotes resistance. For detention, the instrument included one question: 
"During detention, did you cooperate with officers or resist?" For both arrest and 
detention, resistance justifies force. But if the respondent did not resist then 
force is considered excessive. This distinction is a conservative application of 
the "reasonable officer" standard because all the forms of force outlined above 
would be excessive if the deportee did not resist, and some might be excessive 
if the deportee did resist. 

To measure verbal harassment during arrest and detention, the instrument 
included indicators of racial slurs: "Did arresting officer(s) direct racial slurs at 
you?" "Did detention officers direct racial slurs at you?" To measure procedural 
failings during detention, the instrument included indicators of access to food, 
water and a phone: "Did the detention officer(s) fail to provide adequate food and 
water?" "Did the detention officer(s) fail to provide access to a phone?" 


Independent Variables 


Our multivariate models consider situational, organizational and ecological 
predictors of force during arrest and detention. Table 2 describes measurement 
strategies and descriptive statistics for the independent variables. 

To explore the situational components of force, the arrest and detention models 
incorporate a similar, though not identical, set of dichotomous independent 
variables. The arrest model examines: age (the bottom quartile from 12 to 23 
is distinguished to highlight the unique experiences of young deportees), sex, 
whether the respondent had been drinking at the time of arrest, whether the 
respondent resisted arrest, and whether the arrest occurred out of public view. 
The detention model examines: age, sex, whether the respondent had access to 
legal counsel during detention, and whether the respondent was detained for a 
long period of time (more than the median of three months). 

To explore the organizational and ecological components of force, the 
arrest and detention models incorporate the same dichotomous independent 
variables. From an organizational perspective, both models examine whether 
the respondent was arrested/detained by local authorities or federal authorities. 
From an ecological perspective, both models examine whether the respondent 
was arrested/detained in a border state (Arizona, California, New Mexico and 
Texas) or non-border state. 


Potential Limitations 


The research has two potential limitations: validity and selection bias. Validity 
is a perennial concern in self-report data. Did the respondents exaggerate 
mistreatment to retaliate against the United States? Did the respondents conceal 
mistreatment for fear of retaliation from the United States? To reduce the chance 
of exaggeration, the instrument included closed-ended objective questions (e.g., 
"Did the officer kick you?") rather than open-ended subjective question (e.g., 
"How did the officer treat you?"). To reduce the chance of concealment, several 
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Table 2: Measurement and Descriptive Statistics for Independent Variables ~ 


Response Values Mean 

Panel A: Arrest 
Situational 

Bottom Quartile of Age’ 1 = yes (ages 12 to 23) .26 

Male 1 = yes .95 

Drinking 1 7 yes .18 

1 = yes (run from officer or physically 

Resist Arrest resist) .09 

Arrested Out of Public View — 17 yes i 
Organizational 

Arrested by Police" 1 = yes 52 
Ecological 

Arrested in Border State 1 7 yes 14 
Panel B: Detention 
Situational 

Bottom Quartile of Age see above 

Male see above 

Legal Counsel in Detention 1 = yes 52 

Long Detention 1 = yes (> median of 3 months) .48 
Organizational 

Detained in INS Facility 1= yes .48 
Ecological 

Detained in Border State 1 = yes E 


The bottom quartile of age (ages 12 to 23) is distinguished in order to illuminate the unique 
experiences of young immigrants. The middle 50 percent encompasses ages 24 to 33 and the 
top quartile encompasses ages 34 to 61. The mean age is 29.57. 

* Arresting authorities can be divided into three categories: police (46 percent), INS (48 per- 
cent), and a combination of police and INS (6 percent). Our measure indicates police involve- 
ment, and thus combines police and police-INS (52 percent). However, a “pure” police model 
produces the same substantive results. 


procedures were used: respondents were guaranteed confidentiality; interviews 
were conducted after the job placement meeting in the privacy of the deportee’s 
home; the interviews were documented through notes to assuage the deportee’s 
potential fear of being tape-recorded; and the interviewers were former Salvadoran 
deportees who now work for BAC, providing the entrée needed to establish trust. 
Despite such precautions, exaggeration and concealment remain possible. 
Selection bias is a perennial concern in multistage sampling procedures.® 
Fortunately, the BAC airport census of more than 10,000 deportees enables a 
comparison of the deportee sample to the Salvadoran deportee population. (All 
possible comparisons are reported.) The similarities between the sample and 
the population are strong: both the sample and population are 95 percent male; 
52 percent of the sample speaks English well, compared to 56 percent of the 
population; 43 percent of the sample was deported for a crime, compared to 36 
percent of the population; 63 percent of the sample is 30 or younger, compared 
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to 61 percent of the population. Finally, 38 percent of the sample had been 
in the United States for more than 10 years, compared to 30 percent of the 
population. These comparable profiles increase our confidence that the sample 
is a reasonable representation of the population. 


Results 


Table 3 describes verbal harassment and procedural failings during arrest and 
detention. The results paint a consistent picture of verbal harassment: 25 
percent of the deportees reported racial slurs during arrest; 26 percent reported 
racial slurs during detention. Though it is impossible to determine if respondents 
exaggerated mistreatment, it is possible to calculate the number of deportees 
who claim to have been subject to racial slurs during both arrest and detention. 
The fact that most deportees (192 of 298) did not report racial slurs in either 
setting, coupled with the fact that arrest and detention are distinct settings with 
unrelated actors, suggests that if an unreasonable number of deportees report 
racial slurs in both settings then exaggeration is a concern. The 153 reported 
racial slurs (arrest: 76 of 299; detention: 77 of 298) are distributed as follows: 
29 deportees reported a slur during arrest but not detention; 30 reported a 
slur during detention but not arrest; and 47 reported a slur during both. The 
proposition that 16 percent (47 of 298) of the deportees experienced racial 
slurs in both settings does not seem unreasonable. But because exaggeration 
remains possible, we offer minimum and maximum estimates of the percent 
of deportees who appear to have experienced a racial slur at some point in the 
deportation process, ranging from 20 percent (59 of 298) to 36 percent (106 
of 298). Turning to procedural failings, Table 3 also indicates that a substantial 
number of deportees did not have access to basic resources in detention: 31 
percent reported not being provided with adequate food and water; 45 percent 
reported not having access to a phone. 


Table 3: Verbal Harassment and Procedural Failings During Arrest and Detention 
Percent Yes 
Panel A: Arrest 
Verbal Harassment 
Did arresting officer(s) direct racial slurs at you? 25 
Panel B: Detention 
Verbal Harassment 


Did detention officer(s) direct racial slurs at you? 26 
Procedural Failings' 

Did detention officer(s) fail to provide adequate food and water? 31 

Did detention officer(s) fail to provide access to a phone? 45 


Percent experiencing one or more procedural failings 61 


! The original procedural questions were framed in a positive direction (e.g., "Did detention 
officer(s) provide adequate food and water?" Here, the questions were rephrased (and reverse 
coded) so that all responses could be described in a consistent manner using “percent yes.” 
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Table 4 details the use of force during arrest and detention. For arrest, the most 
common forms of force are shoving the suspect and throwing the suspect to the 
ground. Incidents of officers hitting, punching, slapping and kicking suspects are 
less common, but do occur. In rare cases, officers use pepper gas or a stun gun 
to subdue suspects. Combining all forms, 20 percent of the deportees reported 
some form of force during arrest. Of the 58 force cases, eight were justified 
(resistance leads to force) and 49 were excessive (force despite no resistance). 
Of the 239 non-force cases, 219 involved routine arrests (no resistance or force) 
and 19 involved restraint on the part of the officer (no force despite resistance). 
Notably, excessive force is reported almost 2.5 times as often as restraint during 
arrest. Turning to the findings on detention, the most common form of force is 
shoving a deportee, but other forms - hit, punch, slap and throw to the ground 
— occur at almost the same rate. The use of pepper gas and stun guns remains 
rare. Combining all forms, 11 percent of the deportees reported some form of 
force during detention. Of the 34 force cases, two were justified and 32 were 
excessive. Of the 262 non-force cases, 252 involved routine detention and three 
involved restraint on the part of the officer.? Here, too, excessive force dominates 
restraint. 

In addition to specific patterns, Table 4 reveals a broad substantive theme: 
most force is excessive. Including both arrest and detention, 81 of the 92 force 
incidents occurred despite the deportees' claims not to have resisted arrest or 
detention officers' orders. Using the same approach to examine the question 
of potential exaggeration, the 81 excessive force incidents are distributed as 
follows: 30 deportees reported excessive force during arrest but not detention, 
12 reported excessive force during detention but not arrest, and 19 reported 
. excessive force during both.'? Thus, a minimum of 14 percent (42 of 298) of the 
deportees appear to have experienced excessive force at some point during the 
deportation process, compared to a maximum of 20 percent (61 of 298). 

Our description of the treatment of deportees during arrest and detention 
leads to a logical question: How are deportees treated compared to U.S. citizens? 
Comparison data are available for arrest, but not detention. The Bureau of Justice 
Statistics’ 2001 Police-Public Contact Survey (PPCS) provides the best available 
citizen self-report data on the use of force during arrest.'' Conducted in 1999 
as a Supplement to the National Crime Victimization Survey (NCVS), the PPCS 
includes a random sample of more than 80,000 respondents. The PPCS defines 
police use of force during arrest as the threat of, or actual engagement in, one 
or more of the following behaviors: pushing or grabbing a citizen in a manner 
that does not inflict pain, pushing or grabbing a citizen in a manner that does 
inflict pain, kicking or hitting a citizen, spraying chemicals on a citizen, pointing 
a gun at a citizen, or ordering a police dog to subdue a citizen (BUS 2001:2). The 
PPCS unit of analysis can be disaggregated to “citizens involved in a crime,” an 
appropriate comparison to deportees. The PPCS data indicate that 18 percent 
of citizens who had a contact with the police because of involvement in a crime 
reported force or the threat of force. To refine the comparison, 78 percent of the 
citizens’ incidents escalated to actual force — rather than threats — so 14 percent 
of citizens reported actual force. Thus, deportees are about 1.5 times more likely 
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Table 4: Use of Force During Arrest and Detention 
Panel A: Arrest 





Forms of Force Percent Yes 
Did arresting officer(s) hit or punch you? 7 
Did arresting officer(s) slap you? 6 
Did arresting officer(s) kick you? 7 
Did arresting officer(s) throw you to the ground? 12 
Did arresting officer(s) shove you? 15 
Did arresting officer(s) spray you (e.g. pepper spray)? 4 
Did arresting officer(s) shoot you with a stun gun? 1 
Number experiencing one or more forms of force 58 
Percent experiencing one or more forms of force 20 
Number Type Definition Number 
of Cases of Cases 
Force 58 Justified' Resistance leads to force 8 
Excessive Force despite no resistance 49 
Missing on resist 1 
No Force 239 Routine No resistance or force 219 
Restraint No force despite resistance 19 
Missing on resist 1 


Missing on Force 3 


Panel B: Detention 


Forms of Force Percent Yes 
Did detention officer(s) hit or punch you? 7 
Did detention officer(s) slap you? 7 
Did detention officer(s) kick you? 5 
Did detention officer(s) throw you to the ground? 14 
Did detention officer(s) shove you? 9 
Did detention officer(s) spray you (e.g. pepper spray)? 3 
Did detention officer(s) shoot you with a stun gun? 2 
Number experiencing one or more forms of force 34 
Percent experiencing one or more forms of force 11 
Number Type Definition Number 
of Cases of Cases 
Force 34 Justified' Resistance leads to force 2 
Excessive Force despite no resistance 32 
Missing on resist 0 
No Force 262 Routine No resistance or force 252 
Restraint No force despite resistance 3 
Missing on resist 7 


Missing on Force 4 


1 'The opposite causal order is also possible: force leads to resistance. To give officers the benefit 
of the doubt, we code force as justified if resistance and force occur during the same incident. 
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than citizens to report actual force during arrest — 20 percent of déportees vs. 14 
percent of citizens. 

However, the true deportee-citizen force differential might be greater. Among 
citizens, 37 percent of the actual force incidents involved being pushed or grabbed 
in a manner that did not inflict pain (BJS 2001:25). If such incidents are excluded, 
then 9 percent of citizens reported force that inflicted pain. How does that 
compare to deportees? The question cannot be answered definitively because 
we did not ask the deportees about pain. But it seems reasonable to assume that 
the following forms of force in our data would inflict pain: being hit, punched, 
slapped, kicked, thrown to the ground, sprayed with pepper gas, or shot with a 
stun gun. The only form of force in our data that might not cause pain is being 
shoved. If our assumptions are granted, and shove incidents are excluded, then 
16 percent of deportees reported force that inflicted pain (excluding all shove 
incidents creates a conservative estimate of the force differential because some 
shove incidents presumably did cause pain). Using this approach, deportees are 
almost twice as likely as citizens to report force during arrest that inflicts pain 
— 16 percent of deportees versus 9 percent of citizens. 

Turning to the causal mechanisms that produce force, Table 5 presents odds 
ratios from the logistic regression of force during arrest/detention on situational, 
organizational and ecological predictors. 


Table 5: Odds Ratios from the Logistic Regression of Force on Independent Variables 


Odds Ratio 

Panel A: Arrest 
Situational 

Bottom Quartile of Age Zao 

Male .58 

Drinking .80 

Resist Arrest 1.89 

Arrested Out of Public View .88 
Organizational 

Arrested by Police 21495% 
Ecological 

Arrested in Border State .96 
(n) (276) 
Panel B: Detention 
Situational 

Bottom Quartile of Age 2.06* 

Male AT 

Legal Counsel in Detention .54 

Long Detention 2.39** 
Organizational 

Detained in INS Facility 10 
Ecological 

Detained in Border State 1.48 


(n) (266) 


"p 0. PELOS ASEO 
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Panel A examines arrest. Considering situational predictors, the age of the 
deportee has a strong influence on force: the odds of force are 2.3 times greater 
for deportees who are in the bottom quartile of the age distribution, compared 
to all other deportees (in contrast to prior research the relationship between 
age and force remains robust in the face of controls). The remaining situational 
components are non-significant. (Resistance is in the expected direction and 
comes closer to significance than the other situational factors.) Considering the 
organizational predictors, the results suggest that arresting authorities also shape 
force: the odds of force are 2.5 times greater for deportees who are arrested by 
local authorities rather than federal authorities. In fact, police use of force accounts 
for the entire deportee-citizen force differential during arrest, suggesting that 
federal authorities (such as the Border Patrol) are doing a better job than common 
criticisms would suggest." Recall that 14 percent of citizens were subject to 
force at the hands of the police. Among deportees arrested by INS, 13 percent 
were subject to force - no different than citizens. Among deportees arrested by 
the police, 26 percent were subject to force — almost double the rate for citizens. 
Yet the police are not inherently more forceful than INS; police use the same 
rate of force against citizens as INS uses against deportees. So the police are 
only more forceful when dealing with deportees, a population the police are not 
sufficiently trained to handle. Finally, ecological influences — concentrations of 
immigrants in border states — do not shape force during arrest. '? 

Panel B examines detention. The situational components of detention are 
fateful. Consistent with the impact of age on force during arrest, being in the 
bottom quartile of the age distribution doubles the odds of force during detention. 
Exposure also matters: the odds of force are 2.4 times greater for deportees 
detained longer than the median of three months. Hiring legal counsel appears to 
reduce the odds of force. However the impact of legal counsel must be considered 
provisional because the p value of .13 does not reach the generous threshold of .10. 
Nonetheless, the difference is sufficient to warrant further investigation: 9 percent 
of deportees who had legal counsel experienced force during detention, compared 
to 15 percent of those who did not. The organizational and ecological components 
of detention are inconsequential. Although the police use more force than federal 
officers during arrest, jail personnel do not use more force than federal detention 
personnel. Consistent with the prior results regarding arrest, concentrations of 
immigrants in border states do not influence force during detention. 


Conclusion 


The recent passage of IIRIRA , AEDPA and the USA PATRIOT Act has produced 
a dramatic rise in the arrest, detention and removal of immigrants from the 
United States, while simultaneously limiting judicial review of the process. The 
sheer number of deportees who are now at risk of human rights and civil rights 
violations underscores the need to monitor treatment during arrest and detention. 
But policymakers have ignored the task in the wake of 9/11 as immigration 
issues are increasingly tied to national security concerns. Researchers have also 
remained silent. Criminologists continue to focus on the treatment of citizens at 
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the hands of criminal justice officials, and demographers continue to focus on 
other dimensions of the migration experience. 

Drawing on a random sample of Salvadoran deportees, our research indicates 
that substantial numbers of deportees are subject to force during arrest. Indeed, 
the rate of force against Salvadoran deportees is higher than the rate against U.S. 
citizens. Moreover, force tends to be excessive, occurring despite the absence 
of resistance. The most important predictors of force during arrest are age and 
arresting authorities; young deportees and those arrested by the police are 
more likely to be subject to force. The mistreatment of Salvadoran deportees 
continues in detention. Not only does the use of excessive force continue, albeit 
at a diminished pace, but numerous deportees did not have adequate food and 
water or access to a phone. Here, the most important predictors of force were 
the age of the deportee and the length of detention. 

Criminologists and demographers should join forces to examine the treatment 
of deportees, considering, for example, whether the treatment of Salvadoran 
deportees is indicative of the treatment of all deportees or anomalous. Such 
scholarship could provide needed insights regarding the state's expanding use 
of removal as a form of social control; burgeoning detentions and removals 
demonstrate the power to regulate social membership through exclusion, 
including the exclusion of residents with established work and familia histories 
in the United States. Studies of deportation can also illuminate how state 
action shapes the social and demographic structures of local communities and 
institutional settings. Research on the treatment of deportees even has the 
potential to help ensure an equitable balance between protecting legitimate 
interests of national security and protecting the civil rights of deportees. 


Notes 


1. In 2003, INS was merged with the U.S. Customs Service to form the Bureau 
of Immigration and Customs Enforcement (BICE) and relocated to the De- 
partment of Homeland Security (DHS). 


2. Approximately one million legal immigrants were admitted to the United 
States annually during the 1990s. Another 200,000-350,000 migrants arrived 
annually without authorization (DHS 2004). 


3. A fourth category considers police officers’ individual characteristics, such 
as education and marital status. Ideally, we would have collected data on 
the characteristics of each officer. But our inability to do so is not a fatal 
flaw. Despite conventional wisdom regarding "good cops" and "bad cops," 
the search for individual predictors of force remains elusive (Riksheim and 
Chermak 1993). 


4. Information about BAC comes from organization documents and meetings 
with the Director. 


5. Responserates varied across the two stages of the research: among deport- 
ees who were sent information postcards about the job placement meet- 


10. 


T2: 


13. 
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ings, 55 percent attended a meeting; among deportees who were asked for 
an interview, 80 percent agreed. 


Complete data regarding the comparison of the sample to the population 
can be obtained from the authors upon request. 


One case is missing data on whether the deportee resisted arrest. 
Again, one case is missing data on whether the deportee resisted arrest. 


Here, seven cases are missing data on whether the deportee resisted orders 
during detention. 


The distribution of excessive force incidents totals to 80 (30-- 124- 19-- 19) 
rather than 81 because in 1 case the respondent experienced excessive 
force during detention, but it was impossible to determine whether he expe- 
rienced excessive force during arrest (he did experience force during arrest, 
but he did not answer the question about whether he resisted arrest). 


. Other methodological strategies have also been used to estimate the rate 


of force against citizens during arrest, including observation and officer self- 
report. Neither approach provides a suitable comparison to deportee self- 
report data. Observation data are obtrusive and tend to focus on encounters 
rather than arrests. Officer self-reports focus on arrests, but officers and 
citizens often do not share the same interpretation of an event. Such differ- 
ences tend to deflate the estimate of force against citizens, relative to citizen 
self-reports. 


Part of the explanation might revolve around communication. The local po- 
lice are less apt to have an officer at the scene of the arrest who speaks 
Spanish: according to the deportees, a Spanish speaking officer was pres- 
ent at 41 percent of local arrests compared to 73 percent of federal arrests. 
Moreover, the presence of a Spanish speaking officer appears to reduce the 
chance of force: in the presence of a Spanish speaking officer 16 percent of 
arrests escalated to force, but in the absence of a Spanish speaking officer 
23 percent of arrests escalated to force (the effect of a Spanish speaking of- 
ficer is in the expected direction but non-significant in multivariate models). 


In an earlier project, we found that the police are more likely than INS to use 
force during arrest, but that the difference disappears after controlling for 
whether the deportee was arrested for a crime or an immigration violation 
(Phillips, Rodriguez, and Hagan 2002). Here, the police-INS difference does 
not disappear after controlling for the basis of the arrest (models available 
upon request). Nonetheless, we do not include the basis of the arrest in the 
current models because of substantial overlap between the basis of arrest 
and the arresting authority (police officers make most arrests for crimes and 
INS officers make most arrests for immigration violations). 
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Whites' Opposition to Affirmative Action: 
Rejection of Group-based Preferences as well as Rejection of Blacks 


Thomas C. Wilson, Florida Atlantic University 


Abstract 

This study addresses whether whites’ rejection of affirmative action reflects an opposition 
to group-based preferences per se, independent of their attitudes toward blacks. Analysis 
of 1996 General Social Survey data shows that whites’ attitude toward preferential hiring 
and promotion of blacks is predicted by their attitude toward preferential hiring and 
promotion of women. This finding remains unchanged when controlling for racial and 
other attitudes, and it holds regardless of gender. The conclusion is that whites’ rejection of 
racial preferences reflects both rejection of blacks and rejection of group-based preferences 
in general, regardless of the target group. 


Introduction 


White Americans generally oppose preferential treatment for African Americans 
in the areas of education and employment (Krysan 2000). As an attitude object, 
racial preference programs reflect an amalgam of two objects: the black target 
group and the preference policy itself, independent of its intended beneficiary's 
identity (Krysan 2000). Past research shows that whites’ rejection of race-based 
preferences reflects their rejection of blacks. In this study | focus on whether 
opposition to race-based preferences in hiring and promotion is also based 
on whites’ rejection of group-based preferences per se, that is, in general, 
independent of their attitudes toward blacks. 


Explaining Whites’ Policy Views 


Research clearly shows that whites’ opposition to race-based preferences is 
motivated by “new, but not old-fashioned” racism. It has little to do with traditional 
prejudice expressed by stereotyping, the assumption of biological inferiority, and 
by supporting segregation and discrimination (McConahay 1982, Sears 1988, 
Schuman et al. 1997, Sears et al. 2000). On the other hand it has been linked 
to various formulations of what is considered a new form of racism. The first, 
best developed and most thoroughly tested of these is “symbolic racism,” by 
which is meant a blend of anti-black affect and the kind of traditional American 
moral values embodied in the Protestant ethic (Kinder and Sears 1981). Symbolic 
racism is expressed by the belief that “racial discrimination is largely a thing of 
the past, that blacks should just work harder to overcome their disadvantages, 
and that blacks are making excessive demands for special treatment and get 
too much attention from elites, so their gains are often undeserved.” (Sears 
et al. 1997; Sears, Henry and Kosterman 2000). Other similar formulations are 
termed "modern racism," (McConahay 1986), "racial resentment," (Kinder and 
Sanders 1996), "subtle prejudice," (Pettigrew and Meertens 1995) and "laissez- 
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faire racism," (Bobo, Kluegel and Smith 1997), all of which are theoretically 
distinguishable from but operationally similar to symbolic racism (Sears et al. 
2000, Kinder and Mendelberg 2000, Schuman 2000, Krysan 2000). 

New racism has been found to predict whites' racial policy views, including 
rejection of race-based preferences ( Kinder and Sears 1981; Sears and Kinder 
1971; Kinder and Sanders 1996; Sears, Henry and Kosterman 2000; Kluegel and 
Bobo 2001). Studies have also found that whites' views are predicted by attitudes 
and values containing no manifest racial content and which, on their face, are 
seemingly racially neutral and, if anything, more reflective of rejection of group- 
based preferences in general than rejection of blacks in particular. Such attitudes 
include political ideology, attitudes toward the role of government, individualism, 
egalitarianism, stratification ideologies and justice concerns (Kinder and Sears 
1981; Kluegel and Smith 1983; Sears et al. 1997; Sears, Sidanius and Bobo 2000; 
Kinder and Mendelberg, 2000; Sears, Henry and Kosterman 2000; Federico and 
Sidanius 2002a, 2002b; Krysan 2000). However, these attitudes' race-neutrality 
has not been conclusively documented and, in fact, certain of them have been 
found to be correlated to whites' rejection of blacks (Bobocel et al. 1998; Federico 
and Sidanius 2002a, 2002b ). Further, even if these attitudes are racially neutral, 
rejection of policy and not target group can only be inferred from the empirical 
link between them and racial policy views. To my knowledge no empirical study 
has tested this inference. 

Prior research shows that whites' rejection of race-based preferences is to 
some extent reflective of their rejection of blacks as expressed by indicators 
of new racism, and to some extent reflective of seemingly race-neutral values 
suggesting rejection of group-based preferences in general. In the following 
study | more directly test the role of general rejection of preferences, doing 
so by focusing on the relationship between whites' opposition to race-based 
preferences and their views on an apparently race-neutral but otherwise identical 
policy providing gender-based preferences. 


A Study of Race- and Gender-Based Preferences 


Data are taken from the 1996 General Social Survey (Davis and Smith 2003), which 
is the only GSS data set containing variables that tap attitudes toward preferential 
hiring and promotion of blacks and women. Their frequency distributions 
are presented in Table 1.' Whites’ rejection of race-based preferences is 
overwhelming, with 92.3 percent of men and 87.3 percent of women opposed or 
strongly opposed. Rejection of gender-based preferences is nearly as great, with 
82.7 percent of men and 78.2 percent of women opposed or strongly opposed. 

In what follows, | assume that whites’ rejection of race-based preferences is 
based on rejection of group-based preferences, which are generally independent 
of the target group to the extent that the following conditions are met. First, 
the two policy views in Table 1 are directly related. Second, the relationship is 
replicated when controlling for attitudes toward blacks and women and toward 
other racial- and gender-policy views not involving preferences. The rationale 
here is that one's view on a preference policy can reflect both evaluations of the 
policy and the target group (Krysan 2000). So, a correlation between views on 
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Table 1: Whites' Opposition to Preferential Hiring and Promotion of Blacks and of Women, 
by Gender 


Men (N = 325) Women (N = 409) 





Preferences* for Blacks Women Blacks Women 
Percentages: 

1. strongly support 4.9 8.9 5.1 13.9 

2. support 2.8 8.3 7.6 7.8 

3. oppose 23.7 25.2 26.4 28.1 

4. strongly oppose 68.6 575 60.9 50.1 


*Some people say that because of past discrimination, (blacks/ women) should be given prefer- 
ence in hiring and promotion. Others say that such preference in hiring and promotion of 
(blacks/ women) is wrong because it discriminates against (whites/ men). What do you think? 
Are you for or against preferential hiring and promotion of (blacks/ women)? 


race and gender policies may merely reflect respondents' tendency to evaluate 
blacks and women similarly. With attitudes toward blacks and women controlled, 
any residual correlation between race- and gender-policy views indicates that 
respondents evaluate policies similarly, independent of target group. Further, if 
the correlation is maintained when also controlling for views on similar policies 
not entailing preferences, the implication is that respondents evaluated the race- 
and gender-based preference policies similarly based on their general attitude 
toward preferences per se. A third condition is that the relationship is replicated 
both among white men and white women. The rationale here is that white men's 
tendency to evaluate these preference policies similarly may be based on self- 
interest leading them to oppose any policy offering no benefit to themselves. 
However, the same tendency among white women is interpretable as based on 
their general views on preferences, regardless not only of target group, but also 
regardless of whether they themselves benefit. 

My statistical method is OLS regression of race-preference views on gender- 
preference views, and control variables. Controls tapping attitudes toward 
blacks include two indicators of old-fashioned, racism: the attribution of blacks’ 
socioeconomic condition to their lesser ability to learn and reluctance to vote 
for a black presidential candidate. Also included are five indicators of symbolic 
racism, perhaps the most prominent formulation of new racism. Available 
symbolic racism indicators correspond to two of four dimensions of the concept 
identified by Henry and Sears (2002): work ethic and responsibility, and continued 
discrimination. The indicators are the views that blacks lack motivation, that they 
should do "without any special favors, that current discrimination is minimal, 
that blacks have adequate educational opportunity, and that conditions for blacks 
have recently improved.”? 

Six controls tap attitudes toward women: the views that women should take 
care of their homes first, that wives shouldn't work outside the home, that women 
are unsuited emotionally for politics, that a husband's career is more important 
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than a wife's career, that it is better if the man is the achiever outside the home, 
and reluctance to vote for a women presidential candidate. Two controls tap 
views on policies not involving preferences. The first asks whether the country 
is spending too much to improve the condition of blacks, and the second taps 
views on "special efforts" to hire and promote women. | will also control for 
several demographic variables: age, education in years, household income (a 21- 
point ordinal scale), and region coded for Southern residence? 


Findings 


Findings for white men appear in the left-most columns of table 2. Bivariate 
correlations between rejection of race preferences and the independent variables 
show a very strong tendency for those opposing gender preferences to also 
oppose race preferences ( r = .63). Racial attitudes associated with old-fashioned 
racism are unrelated to views on racial preferences but all new racism indicators 
are directly related to opposing racial preferences. Attitudes toward women 
are consistently unrelated to race-preference views. Those who oppose other 
policies favoring blacks and women but not involving preferences tend also to 
oppose race preferences. 

The regression analysis shows for white men that even when attitudes 
toward blacks, women and policies not involving preferences are controlled, 
gender-preference views remain a robust predictor of views on race-based 
preferences and, indeed, its strength is only marginally reduced compared to 
that at the zero-order level ( beta = .54 v r = .63). As | noted above, this residual 
relationship between views on gender- and race-preferences strongly suggests 
that respondents tend to evaluate group-preference policies similarly, regardless 

of the identity of the groups targeted by the policies, and that their evaluations 
reflect their general attitude toward group-base preferences per se. 

There is an alternative interpretation: race- and gender-preference views may 
be related among white males not because both policies entail group-based 
preferences, but because they entail group-based preferences that do not benefit 
white men. That is, white men's rejection of both policies may be based on their 
own self-interests. If so, | would expect a much weaker relationship between 
white women's gender- and race-preference views compared to those just 
discussed for white men. This explanation is tested in the right-most columns 
of Table 2. 

Results for white women are for the most part quite similar to those for white 
men. There is a strong tendency for those opposing gender preferences to also 
oppose racial preferences (r — .47). Racial attitudes associated with new but not 
old- fashioned racism are related to opposing racial preferences, as are attitudes 
toward non-preference racial and gender policies. None of the gender attitudes 
is related to racial-preference views. 

The regression shows that, controlling for racial and gender attitudes along 
with other policy attitudes, gender-preference views remains a robust predictor 
of opposition to race-based preferences, with the magnitude of its effect reduced 
only slightly compared to the bivariate level (beta =.35 vs. r = .47). 
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Table 2: Determinants of Opposition to Race-based Preferences in Hiring and Promotion 
———— —————Ó————— 


White Men White Women 

r beta' r beta' 
Opposes Preferential Hiring of Women .63* .94* AT* A 
Old-fashioned Racism: 
Blacks less able to learn 08 09* - 01 02 
Wouldn't vote for black candidate 05 08 02 -.02 
New Racism: 
No current discrimination 31 02 30* 10* 
Blacks lack motivation HRE -.03 2185 .06 
Blacks have educational opportunity 20* 02 21* 03 
Conditions for blacks have improved 20° 07 ELL -.00 
No special favors for blacks At" e 41* 29% 
Gender Attitudes: 
Women care for home, not country 03 -01 - 04 - 04 
Women not suited for politics 05 02 - 04 a 
Women shouldn't work 20 .00 .03 02 
Men work, Women tend home 06 -.03 .08 -.06 
Wouldn't vote for woman candidate .08 202 02 01 
Women help husband first 07 05 09 08 
Too much spending on assistance to blacks 31* .08 Om -.03 
Opposes special efforts to hire women Kl 04 .28* 10* 
R? .497 .356 


! Net also controls for respondent's education, household income, age and residence in the 
southern states. 
*p « .05 


In all, findings are similar regardless of gender and show two things: (1) 
opposition to race-based preferences is predicted by rejection of gender-based 
preferences; (2) the strength of the prediction is little attenuated with attitudes 
toward blacks, women and other policies controlled. The only notable difference 
in findings by gender is that the relationship between views on gender- and race- 
preferences is clearly stronger for men (beta = .54 vs. beta = .36).* A potential 
explanation for this was mentioned earlier: for white men, views on gender- and 
race-preferences may be correlated not only because both policies involve group- 
based preferences but also because both policies offer benefits to others (women 
and blacks), but not to white men. For white women, who stand to benefit from 
gender preferences, views on gender- and race-preferences can be correlated 
only because both policies involve group-based preferences (that is, after controls 
for the other attitudes included in the regression). If this is so, then findings for 
women are a better indicator of the extent to which rejection of race-preferences 


116 * Social Forces Volume 85, Number 1 * September 2006 


are based on rejection of group-based preferences generally, independent not 
only of the other attitudes included in the analysis, but independent also of self- 
interest or group-interest. 


Discussion 


In a recent literature review Krysan (2000:142) observed that "The debate about 
the meaning of racial policy attitudes comes down to assertions about which. 
of a number of attitudinal variables is responsible for generating support or 
opposition." To date, research on views toward racial preferences has shown 
that opposition among whites is consistently and robustly generated by racial 
attitudes indicating whites' rejection of blacks. Results have shown that it is also 
generated by attitudes and values that are arguably race-neutral, and from which 
opposition to group-based preferences in general and regardless of target group 
might reasonably be inferred. 

The approach taken in this study has been to focus on the statistical relationship 
between whites' views on race-based preferences in employment and promotion 
and their views on similar gender-based preferences. The rationale has been 
that to the extent to which this relationship is observed among white men and 
women alike, even when controlling for racial and gender attitudes, and other 
racial and gender policy views, it is interpretable as the extent to which rejection 
of race-based preferences is based on rejection of group-based preferences 
per se, independent of target group's identity. Results show that regardless of 
gender, whites' opposition to race-based preferences is to a considerable extent 
interpretable as their rejection of group-based preference policies per se. 

Some caution is called for in assessing these results. First, | have used the only 
data set which contains policy-view data on both race- and gender-preference 
policies. The data set is taken from the well-respected General Social Survey, 
but its limitations should not be overlooked. The data are from a single year 
(1996) and the sample is regrettably small (N = 325 men and 405 women). Only 
views on employment preferences are available, and so there is no way to tell 
whether this study's findings apply also to race-based admissions policies at 
colleges and universities. 

Second, indicators of new racism available in the GSS correspond to only two 
of four dimensions of symbolic racism (Henry and Sears 2002), and symbolic 
racism Is only one of several formulations of new racism. Most of these alternative 
formulations are similar to symbolic racism, but at least one is not. Dovidio et al. 
(2002), distinguish between explicit consciously-held racial attitudes of the sort 
considered in this study, and implicit racial attitudes, termed “aversive racism", 
which operate in an unconscious and unintentional fashion and are therefore 
virtually impossible to measure in survey research and must be addressed 
instead by using experimental designs (Dovidio and Gaertner 1986). Studies have 
found aversive racism to influence whites’ behavior and judgment in interracial 
contexts including hiring decisions. While aversive racism is not expected to 
motivate whites’ views on racial policies (Dovidio and Gaertner 1986) whether or 
not it does in fact has never been tested. To the extent that aversive racism may 


Whites' Opposition to Affirmative Action * 117 


influence white's policy views, this study may have underestimated the impact of 
racism on attitudes toward race-based affirmative action. 

Third, the correlations between racial- and gender-preference views may have 
been overestimated here. An anonymous reviewer suggested that the correlation 
may have been inflated by the norm of even-handedness or reciprocity, which 
provides that respondents be consistent and therefore bring a response to an 
item asked later in a survey into line with a response to an similar item asked 
earlier (Schuman and Presser 1996:28). Specifically, if one expressed opposition 
to preferences for blacks (asked earlier in the GSS), one would tend also to 
express opposition to gender-based preferences (asked later in the interview) in 
order to appear fair and consistent. Studies of such effects show that they are 
not particularly common and that they often do not occur even when they are 
most expected (Schuman and Presser 1996). The GSS tests for question-order 
effects of this sort using split-ballot designs in which the order of items is varied. 
Unfortunately, no such test has been conducted with the racial- and gender- 
policy attitudes used here. However, results have shown that the operation of 
question-order effects is quite rare in GSS data, amounting to no more than 
12 probable effects out of 500 variables tested (Smith 1990). Furthermore, to 
minimize context effects similar items are often separated in surveys by buffers 
of intervening items (Wanke and Schwarz 1997). In the GSS data used here the 
buffer between the racial- and gender-preference policy views was substantial, 
amounting to dozens of questions (the exact number depending on skips) taking 
10 to 15 minutes to cover (Smith 2004). In all, the operation of the norm of 
even-handedness cannot be ruled out, but it seems unlikely to have more than 
modestly enhanced the robust correlations reported here. 

Given these limitations, results are best regarded as provisional. Further 
research in this area, including the collection of data on a fuller range of new 
racism measures and a broader variety of preference policies is much needed. 
But with these cautions in mind, | draw the following implication from this 
study. Whites' opposition to race-based preferences is motivated by the sort of 
rejection of blacks that has come to be called new racism. But itis also motivated 
by rejection of group-based preferences in general, regardless of whom they 
are intended to benefit, and regardless even of whether oneself is among the 
intended beneficiaries. 

This latter motivation appears to be race-neutral, that is, quite distinct from 
hostility or antipathy toward blacks. 


Notes 


1. Non-whites are excluded from this analysis as are whites with missing data 
on these or other variables used in the analysis. 


2. There are no available GSS items corresponding to the other two symbolic 
racism dimensions, excessive demands and undeserved advantage, nor to 
other formulations of new racism. The racial attitudes used here are neither 
ideal nor exhaustive, but they are typical of old-fashioned and new racism 
indicators used in previous research. Item wordings for these and other 
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items in the analysis can be found in Davis and Smith (2003) and online 
at http:/webapp.icpsr.umich.edu/GSS/. The respective variable names 
are RACDIF2, RACPRES, RACDIF4, WRKWAYUP RACDIF1, RACDIF3 and 
BLKSMIP These items were recoded as necessary to give higher scores for 
attitudes unfavorable to blacks. 


3. Respective variable names for the gender attitudes are FEHOME, FEWORK, 
FEPOL, FEHELP FEFAM and FEPRES. Names of the policy views are NATRACE 
and FEHIRE. The demographics are AGE, EDUC, INCOME91 and REGION. 


4. Comparing standardized regression coefficients between groups can be 
misleading if variables’ variances differ between groups. Comparison of non- 
standardized regression coefficients is preferred (Pedhazur 1982), and here 
also shows a significantly stronger relationship for men than women (B = 
437 v B = .278; t = 2.84, p « .05). 
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Abstract 

We examine the assortative mating patterns of new parents who are married, cohabiting, 
romantically involved and no longer romantically involved. Using data from the Fragile 
Families and Child Wellbeing study, we find that relationship status at the time of a birth 
depends mainly on father's race rather than on whether mother and father's race/ethnicity 
differ. Crossing race/ethnic lines does not appear to have much effect on relationship 
transitions following a birth. Rather, parents are less likely to marry after a birth if one 
parent is black, and the relationships of Hispanic couples are particularly stable. Crossing 
educational lines has little effect on relationship status at birth, but same-education couples 
had a slightly lower risk of divorce following the birth. 


Introduction 


Intermarriage patterns are a classic measure of social distance between ethnic 
and racial groups as well as a classic indicator of the assimilation of social groups 
(Kalmijn 1998, Merton 1941). Patterns of assortative mating with respect to 
socioeconomic characteristics have also been of interest because of what they 
reveal about the evolution of the institution of marriage and gender roles and 
because of their implications for our stratification system (Mare 1991). Married 
couples, however, representa shrinking share both of American couples in general 
and of couples having children. Recently, the study of assortative patterns of 
marriage has been extended to include cohabiting couples (Blackwell and Lichter 
2000, 2004; Garfinkel, Glei and McLanahan 2002; Schoen and Weinick 1993). 
These recent studies have used the degree of homogamy to understand some 
of the differences and similarities between marriage and cohabitation as social 
institutions. 

In this article we expand the focus of assortative pairing from co-resident . 
couples to all parents of newly-born children. Taking advantage of the new 
data on the parents of a recent birth cohort from the Fragile Families and Child 
Wellbeing study, we are able to look at the assortative pairings of mothers and 


The authors gratefully acknowledge advice and suggestions from Sara McLanahan, Ger- 
man Rodriguez, Michael Hout, Vin Tu, Jean Knab, Kelly Musick and Lawrence Wu. A 
prior version of this article was presented at the Annual Meeting of the Population Associa- 
tion of America in Minneapolis, Minnesota on May 3, 2003. The first author's research for 
this article was funded by a grant from NICHD, R03 HD043084. Direct correspondence 
to Joshua Goldstein, Department of Sociology, Princeton University, 257 Wallace Hall, 
Princeton, NJ 08544, 609-258-5513, e-mail: josh@Princeton.edu; or to Kristen Harknett, 
Department of Sociology, University of Pennsylvania, 271 McNeil Building, 3718 Locust 
Walk, Philadelphia, PA 19104, 215-573-9388; e-mail: harknett@sas.upenn.edu. 


© The University of North Carolina Press Social Forces, Volume 85, Number 1, September 2006 


122 * Social Forces Volume 85, Number 1 * September 2006 


fathers who are (1) married, (2) cohabiting, (3) romantically involved but not living 
together, and (4) no longer romantically involved. The Fragile Families study took 
place in 20 U.S. cities and oversampled births to unmarried parents, allowing 
for an analysis of assortative pairing among a relatively low-income and racially 
diverse population. 

By focusing on a study of new parents, we are limiting our analysis to the 
assortative pairings of couples who have children together. However, the 
assortative patterns of parents are of particular importance because of their 
consequences for the transmission of inequality and the composition of the next 
generation. The rapid increase in multiracial children makes this population of 
particular interest. Additionally, the study's overrepresentation of low-income 
parents provides an opportunity to explore patterns of couple formation and 
disruption in the context of the disappearance of men's advantage in educational 
attainment and the general decline in the earnings capacity of working class men 
relative to women (DiPrete and Buchmann 2006, Jacobs 1996, Wilson 1996). 

In addition to looking at a wider variety of relationships, the Fragile Families 
study enables us to follow couples longitudinally to observe the way in which 
relationships that cross racial and class lines evolve overtime. Do such differences 
appear to discourage marriage? Do they increase the chance that the couple 
will break up? This longitudinal dimension has sometimes been studied in the 
context of marriage and divorce, but this study is the first to look at longitudinal 
transitions among a broader set of relationship types. 


Prior Research 


The large literature on assortative marriage patterns in the United States has 
been interested in interracial marriage as a measure of social distance between 
racial groups, in the exchange of racial for educational and occupational status, 
and in the evolution of educational homogamy over time (Kalmijn 1993, 1998; 
Mare 1991; Merton 1941; Qian 1997). 

Almost four decades have elapsed since the legal prohibition against interracial 
marriage was abolished, yet marriages between whites and African Americans 
remain uncommon, indicating the persistence of social distance between 
these groups (Jacobs and Labov 2002; Kalmijn 1993, 1998; Qian 1997). Using 
intermarriage as a measure of social distance, some research has found that 
the gulf between Mexican Americans and African Americans is similar to that 
between whites and African Americans, and that less distance exists between 
whites and Mexican Americans (Rosenfeld 2002). 

In recent decades, women's educational attainment has caught up with and 
even surpassed men’s, leading to new patterns of assortative mating in terms of 
education (Jacobs 1996, Mare 1995). Marriages and cohabitation in which the 
woman is better educated than the man are now more common than relationships 
in which the woman has less education than her male partner (Qian 1998). A 
great deal of research has found a strong positive relationship between men's 
earnings capacity and marriage (Bennett, Bloom and Craig 1989; Lichter et al. 
1992; Sweeney 2002; Wilson 1987, 1996). At the same time, recent literature has 
suggested that women's earning potential has become increasingly attractive to 
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male partners (Kalmijn 1998, Oppenheimer 1988, Sweeney and Cancian 2004). 
Therefore, it is possible either that male educational advantages continue to 
facilitate marriage or that a couple's collective human capital resources is more 
important (Gibson-Davis, Edin and McLanahan 2005; Kalmijn 1998; Mare 1991). 

Building upon the research on racial and educational assortative mating, 
recent research has examined assortative mating patterns as a way to reveal 
underlying differences between the institutions of marriage and cohabitation. 
Schoen and Weinick (1993) found that married couples in the National Survey of 
Families and Households (NSFH) were more likely to be close in age and religion 
but somewhat less likely to have the same educational attainment than were 
cohabiting couples. They concluded that this supported the idea that cohabitation 
and marriage were distinct institutions. |n particular, cohabitation was seen to 
be a "looser bond" than marriage in which economic characteristics were more 
important than social and cultural characteristics. 

The NSFH sample size did not allow Schoen and Weinick an in-depth look at 
interracial couples. Blackwell and Lichter (2000) used the 1990 Census to obtain 
racial and educational characteristics on nearly half a million couples. They used 
log-linear models to describe educational and racial assortative patterns of 
married and cohabiting couples. Blackwell and Lichter found that although racial 
homogamy was very strong for both married and cohabiting couples, it was even 
stronger for married couples. Unlike Schoen and Weinick, they did not find that 
educational homogamy was stronger for cohabitors than for married couples. 
Blackwell and Lichter did find that women were more likely to “marry up” in 
education than they were to "cohabit up." 

Whereas Schoen and Weinick emphasized the competing institutional nature 
of marriage and cohabitation, Blackwell and Lichter argued that the differences 
in homogamy between marriages and cohabitations were probably due in large 
part to the differential selection of couples out of cohabitation into marriage. 
Couples that "fit" together better were more likely to marry. In a subsequent 
article, Blackwell and Lichter (2004) found limited support for this “winnowing 
hypothesis." Using the National Survey of Family Growth, Blackwell and Lichter 
(2004) found slightly more homogamy in marital relationships than in cohabiting 
or dating relationships, but also found substantial education, racial and religious 
homogamy among dating and cohabiting couples. 

The winnowing hypothesis is particularly valuable in our study of Fragile 
Families, in which the current status and evolution of couples can be seen as 
a chain of selective transitions. The framework we envision is one in which 
simultaneous and sequential decisions are being made based on a combination 
of factors involving partner's characteristics, relationship type, the decision to 
conceive a child and bring it to term, the decision to change relationship types 
between conception and birth, and the decision to change relationship types 
following the birth of the child. It is clear that the relationships are not simple. 
For example, partner preferences might initially be based on preferences for 
relationship type, but relationship type preferences are surely also a function 
of the partner that one is able to find. Similarly, the decision to conceive a child 
must depend in part on who one is partnered with and what kind of relationship 
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one has with that partner. There is also a large element of chance in conception, 
and in some cases it is the pregnancy that influences relationship type as much 
if not more than the relationship type influences the decision to conceive and 
carry to term. 

Our article departs from most prior research by focusing on parents. We are 
aware of one other study that analyzed assortative mating patterns of parents 
in particular. Garfinkel, Glei and McLanahan (2002) used the National Survey of 
Family Growth to compare assortative mating patterns of married and unmarried `: 
parents with the aim of estimating the ability of unmarried fathers to pay child 
support. They find that unmarried parents are less homogamous with respect 
to education and race than married parents are. They also find that educational 
heterogamy is associated with instability of marital and non-marital relationships. 
Their study focuses on three groups of parents: married, cohabiting and not 
cohabiting; and their analysis focuses on the percent of parents marrying 
within racial or ethnic group and the mean differences in parents’ educational 
attainment and age. Our study departs from theirs by focusing on parents of 
a recent birth cohort, including longitudinal analyses, looking at partnering up 
or down in education rather than mean differences, and providing more detail 
on the relationship between particular racial and ethnic pairings and relationship 
status. Nevertheless, our findings are broadly consistent with theirs. 

Based on prior research, we derive the following set of questions that motivate ` 
Our analysis: 


1. Are married parents more assortative than unmarried 
parents? If cohabitation is a substitute for marriage, then 
we might expect not. In particular, it may well be that some 
of the differences observed in past studies between the 
assortative pairing of married and cohabiting couples are 
due to intentions about childbearing. In a sense, our sample 
of new parents controls for this difference since all of the 
couples have chosen to have a child. Alternatively, it may 
be that differences between partners are an obstacle to 
marriage, and more generally may discourage more serious 
relationships if we view the categorization of relationships as 
ordered from marriage (the most serious) to not romantically 
involved (the least serious). If this were the case then we 
might expect progressively more heterogamy as we move 
toward less serious relationships. 


2. Are some racial or ethnic differences more salient 
than others? Because of the relatively high proportion of 
interracial couples in the Fragile Families sample, we have 
a chance to see if the major color line is between black 
and non-black, white and non-white, or Hispanic and non- 
Hispanic parents. 


Parenting Across Racial and Class Lines * 125 


3. For new parents, do traditional sex roles in which the 
father is more educated than the mother encourage 
marriage and relationship stability? This question will be 
increasingly important in the future, as women (particularly 
African-American women) have higher educational 
attainment than their male counterparts. Alternatively, some 
research suggests that, at least for cohabitors, similarities 
in educational attainment may encourage relationship 
stability and that women's educational advantage may 
have a destabilizing effect on relationships (Brines and 
Joyner 1999). 


4. Is the effect of assortative characteristics symmetric for 
entry into marriage and for disruption? We would hypothesize 
that educational differences might have a larger effect on 
disruption than on entry into marriage, especially when 
contrasted with racial differences. This is because racial 
differences are strongly sanctioned against at the time of 
marriage, whereas challenges stemming from educational 
differences may emerge with time. 


Data and Methods 


The data for this article come from the Fragile Families and Child Wellbeing study. 
Fragile Families is a longitudinal study that follows a birth cohort of approximately 
3,700 children born to unmarried parents with a comparison sample of 1,200 
children born to married parents. The unmarried sample is representative of non- 
marital births in large cities. Focusing on urban areas allows us to gain unusual 
detail on the lives of poor and minority parents. About 35 percent of the sample 
received welfare or food stamps in the year before the birth, 62 percent of births 
were covered by Medicaid, and almost 80 percent of the sample is non-white. 
In our analyses, we apply weights to adjust for the oversampling of non-marital 
births in the Fragile Families study. ' 

Baseline interviews with mothers were completed in the hospital shortly after 
the birth. Fathers were also interviewed soon after the birth, either at the hospital 
or as soon as possible thereafter. Contacting parents soon after the birth of their 
child produced a relatively high response rate for fathers, even those who were 
not married to the mothers. Baseline interviews with fathers were completed for 
89 percent of married couples, 90 percent of cohabiting couples, 73 percent of 
romantic couples, and 40 percent of not involved couples. Baseline interviews 
provide data on mother and father’s race, ethnicity and educational attainment. 
When father interviews were not available, we relied upon mother's reports of 
father's race, ethnicity and educational attainment. Follow-up interviews were 
administered at 12 and 30 months after the birth. We rely on mother reports 
of relationship status at the 30-month follow-up in our longitudinal analysis. 
The longitudinal analysis includes a smaller sample than the baseline analysis 
because of attrition. The response rate to the 30-month survey was 86 percent. 
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Parents' race and ethnicity were based on two questions on the baseline 
survey: "Which of these categories best describes your race?" and "Are you of 
Hispanic or Latino origin or descent?" On the initial question, respondents were 
permitted to mark only one category. We classified parents into three racial/ethnic 
groups: Black (non-Hispanic), Hispanic and White (non-Hispanic).? Other racial 
and ethnic groups were not sufficiently large to analyze. Parents were grouped 
into four educational categories: less than high school, high school only, some 
college, and college graduate — all based on reports on the baseline survey. 

The dependent variables in our analysis are measures of relationship status 
at different points in time based on mother reports. We analyze the relationship 
between parent's characteristics and their relationship status at the time of birth 
(baseline) and at the 30-month follow-up. Table 1 shows descriptive statistics for 
the dependent variables of interest: parents' relationship at birth and relationship 
at 30 months. As shown, about half of parents were married at birth, about one- 
quarter were cohabiting, and the other quarter were either dating or not romantically 
involved. Between baseline and 30 months, some marriages ended and some 
new marriages formed. At the 30-month follow-up point, 53 percent of parents 
were married, 27 percent were not romantically involved, and the remainder were 
cohabiting or dating. Our analysis focuses on the parental relationship and does 
not consider marriages to new partners, which were relatively infrequent. About 2 
percent of mothers married a new partner and 10 percent were living with a new ` 
partner at the follow-up. These mothers are included in the category indicating 
that the parents were not romantically involved. 

Our analysis examines whether parents' individual and joint characteristics 
help to explain variations in relationships at birth and in relationship transitions 
between baseline and 30 months. We use descriptive distributions and 
multinomial logistic regression to analyze parents' relationship status at the 
time of birth. For our longitudinal analysis, we use logistic regression models 
to analyze relationship transitions between birth and the 30-month follow-up. 
We analyze three relationship transitions between birth and 30 months: from 
cohabiting or romantically involved to married, from cohabiting or romantically 
involved to not romantically involved, and from married to no longer married. 

Table 1 also contains descriptive statistics on parents' individual and joint 
race and education characteristics. About 90 percent of couples were racially 
homogamous, i.e., the mother and father self-reported belonging to the same 
race/ethnic group. The other 10 percent of couples had crossed race/ethnic lines 
in their relationships. Parents were much more likely to differ in their educational 
attainment than they were in their race/ethnicity. More than half of couples 
consisted of mothers and fathers with similar levels of educational attainment. 
When parents’ educational attainment differed, mothers were more likely to 
have the higher level of education: fathers had more education in 20 percent of 
couples and mothers had more education in 27 percent of couples. 

In our baseline and longitudinal analyses, we incorporate a series of control 
variables that may be related both to the assortative race/class pairing of parents 
and to parents’ relationship outcomes. Not all parents in our sample were first- 
time parents, so we control for whether parents had previous children together 
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Table 1: Sample Characteristics 


e 


Percentage (weighted 
Dependent Variables 
Parents' relationship at birth 


Married 52.0 

Cohabiting 24.8 

Romantically involved 17.4 

Not romantically involved 5.8 
Selected relationships at 30 months 

Married 53.4 

Not romantically involved 27.1 

Independent Variables 
Mother is Black 40.5 
Mother is Hispanic 32.0 
Mother is White 275 
Father is Black 41.2 
Father is Hispanic 33.3 
Father is White 25.5 
Couple is racially homogamous 90.3 
Mother less than high school 27.0 
Mother high school 24.2 
Mother some college 28.0 
Mother college degree 20.7 
Father less than high school 28.3 
Father high school 28.4 
Father some college 25.6 
Father college degree 17.8 
Couple is educationally homogamous 52.6 
Father has more education 20.4 
Mother has more education 21.1 
Control Variables 

Parents have previous children together 46.4 
Mom has child with another partner 26.3 
Dad has child with another partner 29.1 
Mom worked last year 69.4 
Dad worked last week 83.0 
Birth was twins 1.8 
Mother was previously married 10.0 
Mother age 14-19 years 10.5 
Mother age 20-24 years 28.1 
Mother age 25-29 years 25.9 
Mother age 30-34 years 21.2 
Mother age 35 plus years 14.2 
Mother older than father by 5 years or more 4.0 
Father older than mother by 5 years or more 27.6 


Father older than mother by 10 years or more le 


Notes: n = 4373. Percentages are weighted to correct for the undersampling of marital births in the 
Fragile Families and Child Wellbeing study. 
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and for whether parents had children with other partners. It is also worth noting 
that about 25 percent of our sample had subsequent children during the 30- 
month follow-up, which could be both a cause and effect of parents' relationships 
status. Because of the endogeneity of subsequent children with respect to 
relationship status, we do not include subsequent fertility as a control variable. 
We do include additional control variables, displayed in Table 1, for mother and 
father's employment, twin births, previous marriage by mother, mother's age 
group and the difference between the parents' ages. 

For the analysis of how racial/ethnic assortative pairing affects relationship 
status at birth and 30 months later, we estimate predicted probabilities of 
relationship status for couples with each possible combination of mother 
and father's race/ethnicity. These predicted probabilities are estimated for a 
hypothetical couple with the following modal characteristics: both parents have 
a high school degree and no further education, mother and father are first-time 
parents, mother did not give birth to twins, mother has not been previously 
married, mother was employed in the year prior to the baby's birth, father was 
employed in the week prior to the baby's birth, mother is between 20 and 24 years 
old, and father is within five years of mother's age. Fixing the control variables 
at their modal values allows us to estimate the effects of race/ethnic pairings on 
relationships net of potential confounding characteristics such as parents' age, 
education or employment. 

For the analysis of how educational assortative pairing affects relationships, 
we estimate predicted probabilities of relationship status for couples with each 
possible combination of the mother and father's educational attainment group. 
These predicted probabilities are estimated for a hypothetical couple similar to the 
couple described above except that mother and father's race is held constant at 
white and educational characteristics are allowed to vary at their observed values. 


Analysis and Results 
Assortative Pairing of Parents at the Time of Birth 


Racial Homogamy 

Figure 1 shows the distribution of relationship type at the time of the birth 
by the race of the father and the mother. The bars represent the predicted 
probabilities of marriage, cohabitation, romantic involvement and being split 
up, holding constant everything but race. Predicted probabilities are estimated 
using a multinomial logistic regression model that predicts relationship type as a 
function of mother and father's race, the interaction of mother and father's race, 
and the control variables.? We focus on the predicted probabilities rather than the 
observed proportions, because we are interested in the effects of parents' race 
on relationship status net of potential confounding variables. 

The diagonal panels of Figure 1 show the distributions of relationship status 
for homogamous white, black, and Hispanic couples. The modal relationship at 
time of birth differs by race: married for white couples, romantically involved for 
black couples, and cohabiting for Hispanic couples. 

Next, we consider whether parents' relationship status is affected when 
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parents partner with someone from a different race/ethnic group. The figure 
suggests that, from the mothers' perspective, crossing race or ethnic lines has a 
large effect on the relationship with father at the time of birth. For white mothers, 
crossing race/ethnic lines reduces the seriousness of relationship with father at 
birth. For black mothers, crossing race/ethnic lines increases the seriousness of 
relationship with father at birth. For Hispanic mothers, crossing race/ethnic lines 
increases the seriousness of the relationship if the father is white and decreases 
the seriousness of the relationship if the father is black. 

We see a very different pattern of results when we take the fathers' 
perspective. For fathers, crossing race/ethnic lines does not have much effect 
on the seriousness of relationship with mother at birth. The distribution of 
relationships for white fathers with white mothers is similar to the distribution for 
white fathers with Hispanic mothers. (Although the distribution for white fathers 
with black mothers looks much different, this group only consists of 20 couples 
and is too small to provide reliable estimates.) For black and Hispanic fathers, the 
distribution of relationship types is roughly similar whether mothers are white, 
black or Hispanic. 

The figure suggests that relationship status at baseline depends mainly on 
father's race rather than on whether mother and father's race/ethnicity differs. 
We conducted statistical tests to determine whether a model with father's race/ 
ethnicity-only fit the data as well as a model with the full set of main effects and 
interaction terms for mother and father's race/ethnicity. Goodness of fit tests 
found that there is no statistically significant pattern beyond fathers' race with 
the exception of the father white and mother black category, a category for which 
we have so few observations that we are reluctant to generalize. 

Our findings help explain those of Garfinkel, Glei and McLanahan (2002), who 
found that white mothers were less likely to be with white fathers when they 
were cohabiting than when they were married. Garfinkel, Glei and McLanahan 
(2002) did not find a statistically significant difference in homogamy between 
married and unmarried Hispanic or black mothers. Our findings suggest that for 
Hispanic mothers this may conceal offsetting effects: Hispanic mothers married 
less when they had children with black fathers, but married more when they 
had children with white fathers. There may also have been offsetting effects for 
black mothers, but the small sample size for the black mothers with white fathers 
makes it difficult to draw a firm conclusion. 

Our findings differ somewhat from Blackwell and Lichter's analysis (2000) 
of homogamy parameters. Like us, they found that the tendency of whites to 
have a same-race partner was stronger for those who married than those who 
cohabited. However, they also found greater homogamy among married than 
cohabiting blacks. This consistency allows them to conclude that "cohabiting 
couples are significantly less racially endogamous than married ones." (Blackwell 
and Lichter 2000:286) Our analysis, however, suggests that this is not generally 
true for new parents. Indeed, for fathers, the race/ethnicity of their partner has no 
apparent influence on relationship status. 

The difference between Blackwell and Lichter's analysis and ours is not simply 
that we are dealing with parents as opposed to couples in general, but also 
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because our approach has a different underlying theoretical model. When we 
use their method and estimate homogamy parameters for the Fragile Families 
population, we also find more homogamy for blacks who marry than for blacks 
who were in less serious relationships. The interpretation of results thus depends 
on whether one relies on homogamy parameters or the predicted probabilities 
of same-race unions. The homogamy models control for the relationship-specific 
marginal distributions of race, implicitly assuming that individuals are sorted into 
relationship types even before they have found a partner. The behavioral model 
we have in mind, on the other hand, is one in which relationship type is at least in 
part determined by whom one finds as a partner. We thus view the relationship 
type as a dependent variable driven in part by homogamy, as opposed to the log- 
linear models used by Blackwell and Lichter, which allow homogamy estimates 
to be driven by the racial composition of those in each relationship type. 


Educational Assorting 

Next, we examine how partnering patterns by education differ by relationship 
type. In particular, we are interested in whether different educational attainment 
levels are an obstacle to couples forming stronger relationships. We conducted 
a similar analysis using income categories of mothers and fathers, which yielded 
consistent findings. Because of the consistency in income and education results, 
we present only the results of our education analysis. 

Figure 2 shows the distribution of relationship types by education of the 
mother and father. The clearest pattern is that the proportions marrying increase 
as the educational attainment of either the mother or the father increases, which 
can be seen as we move from the upper left to lower right of the figure. At lower 
educational levels of mother and father, cohabitation is as common as marriage. 
At higher educational levels marriage is almost universal. 

By graphing all of the educational combinations we can also see that the 
diagonal does not stand out in any way. This means that there is no obvious 
interaction between mother and father's educational attainment. Looking off the 
diagonals we see further that the effect of mother's and father's education is 
largely symmetric. In particular, we see that when father's education is higher 
than mother's it does not tend to make the relationships more serious than when 
mother's education is higher than fathers. In fact, marriage is more common 
when mother has some college or a college degree and father has a high school 
degree than when the father has the equivalent education advantage. 


Longitudinal Results 


Our longitudinal analysis examines whether couples with similar characteristics 
are more likely to marry and less likely to break up than couples whose 
characteristics are dissimilar and determines which differences represent the 
greatest impediment to relationships. 

We begin by analyzing transitions to marriage or to splitting up among couples 
who were cohabiting or romantically involved at birth. We exclude parents who 
were no longer romantically involved at the time of birth from those at risk of 
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marriage or relationship dissolution. We use logistic regression models in which 
we regress married at follow-up (or split up at follow-up) on parents' individual and 
joint race/ethnic and education characteristics and an array of control variables. 
Then, we analyze transitions to splitting up among couples that were married 
when their babies were born. For the married sample, we regress split up at 
follow-up on parents' individual and joint race/ethnic and education characteristics 
and control variables. 

Prior research has shown that parents' individual race and class attributes 
have a large influence on whether they marry or separate following a non- 
marital birth (Carlson, McLanahan and England 2004; Harknett and McLanahan 
2004). Because the main focus of our analysis is on the effects of parents' joint 
characteristics, we do not discuss in detail the effects of individual race and 
class characteristics or display these results in tables. However, consistent with 
the analysis of relationship at the time of birth, being white or Hispanic and 
having a college education were associated with marriage and staying together, 
whereas being black or having lower educational attainment were associated 
with remaining unmarried or breaking up. 


Racial Homogamy 

Table 2 shows the association between racial homogamy and relationship 
transitions between birth and 30 months. The first panel of Table 2 shows the 
predicted probability of marriage at follow-up for couples that were cohabiting 
or romantically involved at baseline. The rows represent six possible racial/ethnic 
pairings: homogamous black, Hispanic and white couples, and heterogamous 
couples consisting of one black and one Hispanic parent, one black and one white 
parent, and one Hispanic and one white parent. The tables do not make distinctions 
based on mother as opposed to father's race or ethnicity, because separate 
analyses determined that the effects of crossing racial or ethnic lines were largely 
symmetric with respect to gender in the period following the baby's birth. Although 
father's race seemed to dominate in affecting relationship status at the time of the 
birth, this no longer seemed to be the case after babies were born. 

The first column shows the predicted probability of marriage between birth and 
30 months with education held constant at high school and the control variables 
fixed at their modal values. The second column in the table shows the odds of 
marriage for couples that have crossed racial/ethnic lines relative to homogamous 
couples. For example, the second row of Table 2 indicates that the odds of marriage 
for black/Hispanic couples are 1.36 times that of homogamous black couples 
(not significantly different). Each of the three types of heterogamous couples 
are repeated in two separate rows so they can be compared with homogamous 
couples from the racial or ethnic group of each parent. 

The first panel of Table 2 shows that, for black parents, crossing racial lines 
does not lower the probability of marrying following a non-marital birth. Couples 
in which either parent was black were at least as likely to marry as couples in 
which both parents were black. For white and Hispanic parents, crossing racial 
lines lowers the probability of marriage when one parent in the couple is black. 
The odds of marriage for couples consisting of one Hispanic and one black 
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Table 2: Parental Transitions to Marriage or Splitting Up between Birth and 30 Months by 


Race of Mother and Father 


Transitions from Cohabiting or Romantically involved to Marriage 


Parents' Joint Predicted Odds relative 

Characteristics probability’ ^— to homogamy z-stat n 
Black-Black .13 — — 1340 
Black-Hispanic A 1.36 (1.07) 122 
Black-White Mz 1.36 (.79) 64 
Hispanic-Hispanic .28 — — 584 
Black-Hispanic H^ Doi (2.13) 122 
Hispanic-White .26 .90 (.38) 92 
White-White .28 — — 243 
Black-White A7 155 (1.47) 64 
Hispanic-White .26 OS .26 92 


Transitions from Cohabiting or Romantically involved to Split Up 


Parents' Joint Predicted Odds relative 

Characteristics probability’ to homogamy z-stat n 
Black-Black A3 — — 1340 
Black-Hispanic 43 1.03 (.14) 122 
Black-White E .64 (1.61) 64 
Hispanic-Hispanic .26 — — 584 
Black-Hispanic EN 2.14** (3.51) 122 
Hisp-White 42 20955 (2.97) 92 
White-White ez — — 243 
Black-White .32 1.01 (.03) 64 
Hispanic-White 42 1.53 (1.64) 92 

Transitions from Married to Split Up 

Parents’ Joint Predicted Odds relative 

Characteristics probability’ to homogamy z-stat n 
Black-Black .30 — — 235 
Black-Hispanic .39 1.48 (.75) 26 
Black-White .30 .98 (.02) 12 
Hispanic-Hispanic gä — —- 204 
Hispanic-Black .39 4.41** (2.61) 26 
Hispanic-White 32 3.34" 2.37) 54 
White-White 24 — — 413 
White-Black .30 dee (.34) 12 
White-Hispanic .32 152 .92 54 





Notes: "Control variables (other children, children with other partners, twins, previous marriag- 
es, mother employment, father employment, age group and age difference) fixed at modal values 
and education fixed at high school. 

BEE 
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parent were half that of couples in which both parents were Hispanic. Although 
white and black parents who crossed race/ethnic lines appear to be less likely 
to marry than their counterparts in racially homogamous white relationships, this 
difference was not statistically significant perhaps because of limited sample 
size. Crossing the white-Hispanic line does not reduce the odds of marriage 
relative to homogamous white or Hispanic couples. 

The middle panel of Table 2 shows that unmarried Hispanic parents who 
crossed racial or ethnic lines were more likely to break up by the 30-month 
follow-up compared with homogamous Hispanic couples. Although crossing the 
white-Hispanic line did not discourage marriage, crossing this line increased the 
odds that an unmarried couple would break up. Black parents who crossed racial 
or ethnic lines were no more likely to break up than racially homogamous black 
parents. Perhaps surprisingly, the predicted probability of breaking up for black- 
white couples was relatively low and was similar to the predicted probability 
of breaking up for white couples. We suspect this may be a selection effect; 
black-white couples who have children together may have particularly resilient 
relationships. However, we are cautious in interpreting the findings for black- 
white couples because of the small sample size (n = 64) for this group. 

We also examined whether parents' joint race and ethnic characteristics 
affected their marital stability for the sample of parents who were married when 
their babies were born. Mirroring the results on unmarried parents splitting up 
after the birth, we find that married Hispanic parents have lower odds of splitting 
up than Hispanic-black or Hispanic-white couples. The effect of crossing race/ 
ethnic lines for married Hispanics was dramatic. The odds of breaking up for 
Hispanic parents who crossed race/ethnic lines were three to four times that 
of Hispanic-Hispanic couples. Racial homogamy was not significantly related to 
marital stability for black or white parents. 

The effects of racial homogamy on transitions to marriage and on transitions 
to breaking up were mostly consistent. In each case, crossing race or ethnic lines 
did not affect the relationship transitions of black parents, and in each case the 
effects of homogamy in encouraging marriage and discouraging break ups were 
largest for Hispanic parents. Although crossing the white-Hispanic line was not 
an impediment to marriage formation, it was associated with break ups. 


Educational Assorting 
In Table 3, we examine the association between parents' joint educational 
characteristics and relationship transitions. Unmarried parents' joint education 
characteristics had only weak effects on marriage formation. Unmarried mothers 
who "partnered up" with respect to education were slightly more likely to marry 
than unmarried mothers whose partners had the same education, but the 
difference did not achieve statistical significance. Whether the couple had the 
same level of education, the mother had more education, or the father had more 
education, the predicted probability of marriage was similar and statistically 
indistinguishable. Unmarried parents' joint education characteristics also had no 
discernible effect on splitting up. 

We also analyzed the effects of educational homogamy and hypergamy on 
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Table 3: Parental Transitions to Marriage or Splitting Up between Birth and 30 months by 
Education of Mother and Father 


—c-s s € —————————— — 


Transitions from Cohabiting or Romantically involved to Marriage 
Predicted Odds relative 


Parents’ Joint Characteristics probability’ to same z-stat n 
Father more education .34 1.19 (1.21) 628 
Same education sel — — 1167 
Mother more education 202 1.08 .56 650 


Transitions from Cohabiting or Romantically involved to Split Up 
Predicted Odds relative 


Parents’ Joint Characteristics probability’ to same z-stat n 
Father more education .30 .99 (.11) 628 
Same education Bi — — 1167 
Mother more education .29 .95 Sal 650 


Transitions from Married to Split Up 
Predicted Odds relative 


Parents’ Joint Characteristics probability’ to same z-stat n 
Father more education E 1.24 (.73) 178 
Same education .14 = = 923 ` 
Mother more education .20 1.59 (1.76) 243 


Notes: 'Control variables (other children, children with other partners, twins, previous mar- 
riages, mother employment, father employment, age group and age difference) fixed at modal 
values and race fixed at White. 

“p< 05 2p < 01 


marital stability for parents who were married at birth. Couples with the same 
level of education had the lowest probability of marital dissolution. Marriages 
in which the mother had more education than the father were the least stable: 
the probability of marital dissolution was .20 for these couples compared with 
.14 for couples with the same education and .17 for couples in which the father 
had more education. Although differences across these categories were not 
statistically significant, the pattern of results is suggestive. These findings are 
in part consistent and in part inconsistent with expectations based on traditional 
gender roles in marriage. On the one hand, the marriages that are least traditional 
in terms of parents' educational characteristics, those in which mothers' 
educational attainment exceeds fathers, are the least stable. On the other hand, 
marriages in which parents are on equal footing in terms of education are the 
most stable. The most traditional couples, in which fathers have the educational 
advantage, fall somewhere in between. Similar to Brines and Joyner's (1999) 
findings for cohabiting couples, our results are consistent with the idea that 
gender roles have shifted to the point where women are permitted to have the 
same education as men, but relationships are strained when women have more 
education than their partners. Still, further research with larger samples is needed 
to confirm whether this pattern holds for the population as a whole. 
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Conclusion 


In this article, we have examined the potential racial and class barriers that might 
prevent new parents from marrying using new data from the Fragile Families and 
Child Wellbeing Study in 20 U.S. cities. In the study, about 10 percent of couples 
who recently had a baby together had crossed racial or ethnic lines. Almost half 
had a baby with someone who had a different level of educational attainment. We 
examined whether crossing race and ethnic or educational lines was associated 
with less serious relationships between parents at the time of a baby's birth, and 
whether crossing these lines discouraged subsequent transitions to marriage 
or were associated with relationship dissolution. Because we are looking at a 
sample of new parents in particular, these results have direct implications for the 
composition of the next generation and for the reproduction of inequality. 

In spite of social pressures for like to marry like, we find that differences 
between a mother's and a father's educational attainment among new parents 
do not discourage marriage. It appears that the sum of the mother's and father's 
educational attainment is much more important in influencing marriage than are 
disparities between the two parents. Higher educational attainment increases 
relationship seriousness whether or not one's partner has higher educational 
attainment. In particular, contrary to what one might expect from the traditional 
breadwinner role that men have historically occupied, father's education has 
no more effect than mother's education on relationship seriousness. Instead, 
it appears that the total amount of resources matter much more than provider's 
gender. This is consistent with the qualitative evidence from Fragile Families in 
which unmarried couples report they want and plan to get married, but have not 
already done so because of a shortage of economic resources (Gibson-Davis, 
Edin and McLanahan, 2005). 

In recent decades, female educational attainment has been rising faster 
than male educational attainment. In minority populations, women are rapidly 
achieving educational parity and even surpassing men. Our findings suggest that, 
at least among those couples who have children, an absence of a male education 
advantage will not necessarily be an obstacle to marriage formation. 

However, we hypothesized that educational differences would have a larger 
effect on marital disruption than on entry into marriage, because difficulties 
stemming from educational differences may worsen with time. Although 
differences across educationally homogamous and heterogamous groups were 
not statistically significant, the direction of effects was consistent with this 
hypothesis. Married couples with the same level of educational attainment were 
more likely to stay married than educationally heterogamous couples. Couples 
in which mother's education exceeded father's education were the least stable, 
suggesting relationship strain when typical gender roles are reversed. 

With regard to race and ethnicity, we did not find across the board that being 
with a partner of a different race was a barrier to a more serious relationship. 
Instead, the effect of racial/ethnic heterogamy varied depending on the specific 
racial and ethnic combination and on the point of view from which it was 
measured. For fathers, the relationship status at baseline depended mainly on 
their own race/ethnicity rather than on the race/ethnicity of their partner. But, 
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for mothers, the relationship status varied depending on the race/ethnicity of 
the father. White or Hispanic mothers were much less likely to be married if 
they crossed race/ethnic lines and their partner was black. For black mothers, 
crossing race/ethnic lines had the opposite effect. Black mothers were more 
likely to be married if the father was Hispanic. Relationships at birth appear to be 
dictated by the race of the father. 

These patterns were fairly consistent when we shifted our focus to transitions 
to marriage following a non-marital birth. Again, crossing racial lines per se did not 
seem to explain the differences in transitions to marriage. Instead, the important 
distinction in transition to marriage was between couples in which either partner 
was black and couples in which neither partner was black. The former group was 
less likely to marry than the latter. 

We also found evidence that crossing race/ethnic lines was associated with 
relationship instability in particular for Hispanic parents. Interestingly, although 
Hispanic-white differences did not seem to prevent marriage, they did seem to 
lead to a higher rate of relationship instability. This signifies to us that the social 
obstacles to marriage among Hispanic-white parents may be small but the practical 
difficulties of staying together may not be. Notably, out of the nine racial/ethnic 
pairings we examined, the relationships between Hispanic mothers and fathers 
seemed particularly stable whether or not they were married. Hispanic couples 
were the least likely of any group to break up after their babies were born. ) 

Because this article focused on new parents, the findings have implications 
for the intergenerational reproduction of inequality. Research has shown that 
children who grow up with stably married parents appear to be at an advantage 
relative to their peers in unmarried or unstable households. If crossing race and 
class lines interferes with marriage and relationship stability, then we can expect 
children of multiracial and interclass parents to be disadvantaged. However, our 
article finds that crossing race and class lines leads to less marriage and less 
stability only in select instances. 

At most, inter-class parental relationships seem to only slightly increase 
the chance that children will grow up with a single parent. Children whose 
parents have different levels of education are equally likely to be born into a 
married household compared with children whose parents have similar levels of 
education. Our results provide a hint of evidence that children with educationally 
dissimilar parents may be more likely to experience the disruption of their 
parents' relationship. The main story, however, is that the overall level of parental 
educational resources matters much more for children than the similarity or 
dissimilarity of mother and father's education. 

Typically, multiracial children are no more likely than their counterparts to live 
in unmarried or unstable families. At birth, children with a black father are less 
likely to have married parents than their counterparts with white or Hispanic 
fathers regardless of the race/ethnicity of the mother. Among children born to 
unmarried parents, children who have at least one black parent are less likely to 
see their parents subsequently marry, also regardless of whether these children 
are multiracial. Couples in which one parent was Hispanic and the other non- 
Hispanic are more likely to break-up than couples in which both parents are 
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Hispanic, but this appears to be a protective feature of having two Hispanic 
parents, who are less likely to break up than other same-race/ethnicity couples, 
rather than instability brought on by crossing ethnic lines. 

Although social pressures for like to marry like are very strong, we find that 
racial and class divides among new parents are not as much of a barrier to 
marriage as one might expect. 


Notes 


1. We use uniform weights within each of the 20 urban sampling areas, which 
correct for the undersampling of married births. For the married sample 
members, the weights are calculated as the number of marital births in a city 
divided by the number of marital births in the Fragile Families sample in that 
city. For the unmarried sample, the weights are calculated as the number of 
non-marital births in a city divided by the number of non-marital births in the 
Fragile Families sample in that city. After applying our weights, our sample 
approximately represents new parents in urban areas in the United States. 


2. Our classification of black/Hispanic parents as Hispanic follows the conven- 
tion of similar studies. It would be interesting to analyze the black/Hispanic 
parents as a separate group, but sample sizes were not large enough to per- 
mit a separate analysis (57 mothers and 58 fathers reported that they were 
both black and Hispanic). 


3. The detailed multinomial logistic regression results appear in Appendix Table A. 
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Appendix Table A: Coefficients from Multinomial Logistic Regression of Parents” 

Relationship Status at Birth on Parents' Race, Education and Control Variables 
Relationship at Birth (relative to married 
Cohabitation Romantic Split up 


Mother Black 2.54* 2.54 2.97* 
(247) (1.87) (2.34) 

Mother Hispanic AN 1.24* -.13 
(1.05) (2.11) (.12) 

Father Black thea 2.59** 1.30 
(3.14) (4.55) (1.68) 

Father Hispanic aon 141% 1.02* 
(2.54) (3.38) (2.05) 

Mother/Father Black -2.61* -2.15 -2.63 
(2.31) (1.47) (1.79) 

Mother Black, Father Hispanic -2.33* -2.28 -3.00* 
(2.11) (1.58) (2.14) 

Mother Hispanic, Father Black 41 -.10 .26 
(.57) (.11) (.18) 

Mother/Father Hispanic -.38 -1.52* -44 
(.75) (2.19) (.39) 

Mother high school -.07 14 -43 
(.37) (.60) (1.27) 

Mother some college -.09 .10 -.09 
(.39) (.36) (.25) 

Mother college degree -3.81** -1.93** -1.74 
(3.19) (3.06) (1.93) 

Father high school -.22 -.11 -.03 
(1.06) (.48) (.10) 

Father some college -.96** -1.20** -.45 
(4.10) (3.96) (1.26) 

Father college degree -3.19 A4 .37 
(1.62) (.56) (.43) 

Mother/Father high school -.33 -.54 -.46 
(1.12) (1.63) (1.00) 

Mother high school/ Father 99 .68 .48 
some college (2.95) (1.57) (.85) 
Mother high school/ Father 2.79 -1.22 .66 
college degree (1.38) (1.20) (.56) 
Mother some college/ Father -.86** Jones -1.09* 
high school (2.63) (2.96) (2.19) 
Mother/ Father some college -.23 -.06 -1.29* 
(.70) (.14) (2.39) 

Mother some college/ Father 1.50 -2.88** -1.35 
college degree (.75) (3.08) (1.34) 
Mother college degree/ Father 2.14 .86 .00 
high school (1.69) (1.13) (.00) 
Mother college degree/ Father 2.46* 1.11 -1.45 
some college (1.98) (1.50) (1.15) 
Mother/Father college degree 3:59 -.81 -243 
(1.54) (.79) (1.54) 

Mother and father have prior - 78" -1.56** -2.04** 


children together (7.26) (11.67) (9.61) 
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Appendix Table A (continued) 


Mother has child with another TO jaye" 40 
partner (6.31) (3.62) (1.95) 
Father has child with another aie 18553 (SOS 
partner (4.02) (6.44) (6.96) 
Mother worked last year e .01 Sp 
(5.39) (.09) (1.80) 
Father worked last week -.56™* -1.01% -2.07** 
(4.18) (6.89) (11.54) 
Birth was twins .64 112: -.36 
(1.72) (2.57) (.36) 
Mother was previously .69^* .64** E 
married (4.02) (2.94) (3.52) 
Mother less than 20 years old Elle TIS POS 
(2.80) (3.80) (2.78) 
Mother 25-29 years old -.99** DË e -.79** 
(8.09) (9.29) (3.68) 
Mother 30-34 years old -1.51* -1.84** -1.50** 
(1.43) (9.97) (5.42) 
Mother 35 years or older sy -1.44** -1.31** 
(7.61) (6.70) (3.97) 
Mother 5 or more years older .30 .30 62 
than father (1.33) (1.06) (1.67) 
Father 5 or more years older -.09 -.28 21 
than mother (.75) (1.95) (1.13) 
Father 10 or more years older 15 .02 -.60 
than mother (.80) (.10) (1.88) 
Intercept .04 -.39 -.25 
(.17) (1.26) (.66) 
N 4373 
Likelihood ratio chi-square 2899.36 
P-value .000 
Pseudo R-squared .2872 


A ars P A Lu Du 
*p « .05 **p « .01 
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The Linking of Sociology and Biology 


Guest Editor: Guang Guo, University of North Carolina at Chapel Hill 
Why Biology? 


In the academic world, sociology and biology have long been separate sciences. 
The benefits of combining the two seem few. 

Biological influences are potentially important to certain areas of sociological 
inquiry — especially areas that involve individual attributes, traits and behaviors 
such as delinquency and crime, educational attainment and social stratification. 
These influences, however, are rarely considered explicitly. One of the implicit 
assumptions in sociological inquiry is that individuals are the same at birth; 
the differences among them are then attributed to the position each occupies 
in a social hierarchy. At least two factors contribute to the absence of explicit 
discussion on biological influences: (1) our discipline's key theoretical emphasis 
on a group's social-structural position and (2) the unavailability of reliable measures 
of relevant biological influences. Both can be illustrated by the development of 
sociological theories of delinquency and crime. 

One dominant sociological theory of delinquency and crime is the theory of 
social control (Durkheim 1897, Hirschi 1969, Sampson and Laub 1993) which 
emphasizes the social bond between an individual and society and suggests that 
a crime is more likely to be committed when the bond is weakened or broken. 
Gottfredson and Hirschi later (1990) expressed a keen awareness that individuals 
differ in propensity for delinquency and crime. The researchers considered crime- 
prone individuals to lack self-control and to be impulsive, insensitive, physical 
and risk-taking. 

The propensity has proved difficult to measure. Gottfredson and Hirschi 
suggested a number of indicators of the crime-prone propensity: the urge to 
gratify desires immediately, lack of diligence and persistence in a course of action, 
lack of commitment to job, marriage, and children, lack of skills and planning, and 
the tendency to drink excessively, use illegal drugs, and gamble. While these 
individual characteristics are, indeed, correlated with delinquency and crime, 
they are the manifestations of the propensity rather than the propensity itself 
and may, in fact, have their beginnings in biological rather than social differences. 
However, in the 1980s, measuring criminal propensity at the molecular genetic 
level was hardly thinkable. 

Two developments, one recent and one historical, suggest that the fields of 
sociology and biology would benefit from the linking of the two fields. The first 
is the spectacular advance in molecular genetics and molecular biology over the 
past three decades. Second, relative to earlier societies, genetic influences are 
becoming more prominent in modern industrial societies. | will return to the first 
development after describing my argument for the second development. 

Viewed in Lenski's (2005) lens, the long history of human society can be 
divided into four distinct stages: the hunter-gatherer society, horticultural society, 
agrarian society and industrial society. | believe that the level of genetic influences 
has a U shape across the four historical stages (Figure 1) for some sociological 
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outcomes (e.g., status attainment), starting high in the HG society, reaching the 
bottom in the agrarian society, and becoming more important in the industrial, 
especially the contemporary industrial democratic society. In a hunter-gatherer 
society, the level of technology was minimal and the social structure was much 
simpler and more flexible. Genetic endowment likely played a prominent role in 
determining an individual's social status. 


Figure 1. Genetic Influences more Prominent in Current Developed Societies 


Level of Genetic Influences 


High 


Low 





Hunting & Gathering Horticultural Agrarian Early Industrial Current 


The most rigid social hierarchies were probably formed in agrarian societies. 
In the extreme case of the Indian caste system, an individual's occupational and 
marital prospects were often fixed at birth. In a contemporary developed society, 
genetic influences are growing more important again. This is not to deny the 
enormous importance of social forces in contemporary industrial societies. Social 
forces are important in any society, but genetic influences are becoming relatively 
more prominent as a society become more egalitarian in terms of opportunities. 

Recent advances in molecular genetics have created opportunities and 
challenges for both sociology and biology. For sociology, it is beginning to be 
possible to measure genetic differences across individuals relevant to sociological 
inquiry. Animal studies are already engaged in a comprehensive understanding of 
the molecular basis of social life (see review by Robinson et al. 2005). Research 
is identifying genes that influence animal social behavior. Human studies have 
reported genetic effects that interact with social environment to influence, for 
example, delinquent and criminal behavior (Caspi et al. 2002). 

Biomedical researchers have increasingly recognized that "few diseases 
or conditions are caused purely by genetic factors; most are the result of 
interactions between genetics and environmental factors (IOM 2006)." Intense 
research in molecular genetics over the past 20 years has discovered more than a 
thousand genes responsible for Mendelian human outcomes - outcomes mostly 
determined by alleles of a single gene (Risch 2000, Botstein and Risch 2003). 
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Examples of such human outcomes include Huntington's disease, cystic fibrosis, 
hereditary non-polyposis colon cancer and heritable breast cancers. 

Molecular genetic efforts have been much less successful on Non-Mendelian or 
complex human outcomes. Many of these outcomes, including reading disability, 
smoking, alcohol use, drug use and obesity, are subjects of interest to sociologists. 
The links between genetic heritage and complex human outcomes are enormously 
complicated, typically involving multiple genes, multiple environmental factors 
and the interaction between the two. Inconsistent findings from genetic studies of 
complex outcomes are not uncommon The immensely complicated environment 
is frequently an essential source of the inconsistencies. For example, different 
samples, which are often small and convenient, are often subject to different 
environments, which may enhance or suppress a genetic effect. 

There has been an increasing consensus that social scientists' expertise in 
social context is necessary for understanding many of complex human outcomes 
(IOM 2006). The success of the Human Genome Project (Collins et al. 2003a ) 
and the HapMap Project (The International HapMap Consortium 2003, 2005) 
is improving the design and effectiveness of genetic studies. These advances, 
however, do not lessen the need to understand the environmental part of the 
puzzle. On the contrary, inadequate understanding of environmental influences 
has increasingly become the bottleneck for understanding human outcomes at 
the genetic molecular level. Recently, the HapMap project', the National Human 
Genome Research Institute, and the Committee on Assessing Interactions Among 
Social, Behavioral, and Genetic Factors in Health called for heavy investigation 
into lifestyle factors and environmental exposures. 


Contributions to the Special Issue 


The article by Alan Booth, Douglas A. Granger, Allan Mazur and Katie T. Kivlighan 
is a review on testosterone and social behavior in the area of social status, 
aggression and anti-social behavior, depression, and family and friendship 
relationships. Booth and colleagues highlight the new paradigm shift from 
biological determinism to a biosocial model in which social environment often 
plays a key mediating role in the hormones-behavior associations. They suggest 
that the technical development of measuring hormones via non-invasive saliva 
could make it easier to incorporate hormonal measures into sociology. Booth et 
al. also reviews Udry's work in these areas. Like Booth and his colleagues, over 
the past 20 years or so, Udry and his colleagues have sustained a highly fruitful 
research program on hormones and social behavior and have set a high standard 
for all who are interested in the empirical work linking sociology and biology. 

Sapolsky's article provides a counterbalance over the current fascination 
in biological sciences in the genes and genome as the ultimate levels of 
explanations of life. Sapolski describes a case of behavioral plasticity among non- 
human primates. For example, the social hierarchy of the savanna baboons has 
been the textbook example of fierce aggression and sharp stratification. Yet, in a 
period of a few years, they transformed into a culture of low aggression and high 
affiliation. To Sapolski, the evidence points to a highly flexible non-human primate 
culture and suggests a limited role for genetic heritage. 
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Sapolski is also well-known for his work on social rank, stress and health 
outcomes among animals (Sapolski 2005). His work has shown that the 
rank in social species can greatly influence the quality of life. Lower ranked 
individuals tend to be more stressful; long-term stress can heighten the risk of 
numerous diseases or exacerbate such preexisting diseases as hypertension, 
atherosclerosis, insulin-resistant diabetes, immune suppression, reproductive 
impairments and affective disorders. 

Blute's article is more theoretically oriented. The co-evolution between genes 
and culture is not a new orientation. However, Blute attempts to more effectively 
conceptualize gene-culture co-evolution in an effort to capture the fundamental 
kinds of relationships that are possible between genes and culture. 

Nielsen's article addresses the central sociological theme of status attainment. 
Nielsen is primarily concerned with the "confounding of environmental and genetic 
influences," that is, if the environmental variables are substantially correlated 
with genetic influences, the influences of environment should be exaggerated. 
He uses a behavior genetic model based on data with six types of adolescent 
sibling pairs living in the same household. The analysis of three outcomes (verbal 
IQ, grade point average and college plan) all yielded a large genetic component, 
a relatively small shared environmental component, and a large unshared 
environmental component. To the author, the evidence for genetic influences on 
educational outcomes is obvious; incorporating genetic influences will improve . 
the classic sociological status attainment model. Like similar work done before 
Nielsen, the genetic influences were estimated from genetic relatedness among 
siblings; this would make it more difficult to test the idea of estimating ‘purer’ 
environmental influences while controlling for genetic influences. 


Final Remarks 


The classic sociological thoughts were developed in a historical time when the 
Capitalist society was much more sharply stratified, individuality as contrasted 
with group was much less important, and social ranking was more determined 
by a person's position in a social hierarchy than in a contemporary democratic 
industrial society. In such a historical time, inherent individual difference could 
often be safely ignored in sociological consideration. Social hierarchy remains 
important today, but in many cases, biological differences across individuals 
may become too important to ignore. Taking genetic heritage or other biological 
factors into account promises a fuller understanding of social outcomes and a 
more precise understanding of the roles of social context. Recent advances in 
molecular biology are making it possible to explore how the interactions of social, 
behavioral and genetic factors affect sociological outcomes. 


Note 


1. The International HapMap Project is a collaborative effort by scientists and 
funding agencies from Japan, the United Kingdom, Canada, China, Nigeria 
and the United States. The purpose of the project is to identify and catalog 
genetic similarities and differences in human beings. The information from 
the HapMap will facilitate researchers in finding genes that affect health, 
disease and individual responses to medications and environmental factors. 
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Gene-Culture Coevolutionary Games 


Marion Blute, University of Toronto at Mississauga 


Abstract 

Gene-culture interactions have largely been modelled employing population genetic-type 
models. Moreover, in the most notable application to date, the “interactive” modes have 
been one way rather than bidirectional. This paper suggests using game theoretic, fully 
interactive models. Employing the logic utilized in population ecology for coevolution 
between two species, the basic kinds of coevolutionary interactions possible between 
biological and sociocultural strategies are mutualism, in which both are favored to match; 
competition, in which they are favored to unmatch; and antagonism, in which one is 
favored to match and the other to unmatch, in two possible ways. Hypothetical examples 
based on life history characteristics, ones emphasizing the importance of the mode of 
cultural transmission, and possible complications are discussed. 


Few thoughtful life or social scientists still believe that human behavior, mind, 
culture and social organization can be explained by purely biological or purely 
sociocultural processes. Of course there are some holdouts. On the social 
science side, some continue to doubt that biologically inherited differences can 
have any relevance to matters of interest to them. More than 20 years ago, the 
late John Maynard Smith responded to this objection as well as anyone has 
before or since. 


"There is no single case in which it is known thata difference 
in a human gene causes a difference in behaviour. This is 
at the same time true (if one excludes cases of physical or 
mental defect) and irrelevant. All that is needed if models 
of this kind are to be relevant is that there should be some 
additive genetic variance for the behavioural trait in question. 
This, too, is hard to establish, basically because one cannot 
treat man like a laboratory animal. It would, however, be 
very odd if it were not true; experience suggests that there is 
additive genetic variance for almost every characteristic that 
has been studied (including behavioural ones), and | see no 
reason why our species should be an exception. However, 
| can see good reasons why the heritability of human 
behavioural traits may prove to be small." (1982a:21) 


On the life science side, some have difficulty grasping that the existence of 
a "second inheritance system" (Boyd and Richerson 1985) in the form of social 
learning (in micro terms) or culture (in macro terms) implies that variation and 
selection are inevitable, and hence that culture, quite literally, evolves. One of the 
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best remedies for that skepticism is a brief glance at historical linguistics with its 
some 200 language families, the more recently discovered super families and so 
on. It all reveals "trees," products of "descent with modification," as impressive 
a set of phylogenies and ones understood in at least as much detail as any 
biological phylogeny. It is also worth noting that most of the great evolutionary 
biologists of our time have been aware of the potential significance of cultural 
evolution (Dawkins 1976:191-201, 322-31; Maynard, Smith and Warren 1982; 
Williams 1992:15-16, 18-19; Maynard, Smith and Szathmary 1995:309) and sorne 
have even made it a focus of their work (Cavalli-Sforza and Feldman 1981). In 
short, as Whiten (2001:359) put it: 


"That there are just two forms of behavioral evolution, 
occurring through genetic and cultural transmission 
respectively must rank among the most exciting and 
fundamental discoveries . . . achieved over the last century 
and a half." 


Despite the reality of both forms of evolution, it is still not obvious that genes 
and culture necessarily coevolve, i.e., evolve in interaction with each other to any 
significant extent. Perhaps biological processes were largely responsible for the 
evolution of human behavior early in the history of our species and sociocultural ` 
processes more recently, leaving a quite traditional division of labour between 
the life and social sciences (despite the fact that genetic adaptation to our early 
environment may have left its mark as much research in evolutionary psychology 
suggests). That scenario cannot be true in its simplest form, however, because 
cultural inheritance and evolution is widespread in birds and mammals and hence 
definitely antedates the existence of our species. (For overviews, see McGrew 
1992; Whiten et. al.1999; and McGrew 2004 on chimpanzees; and Rendell and 
Whitehhead 2001 on Cetaceans.) 

Cultural transmission and evolution have been with us from the beginning. 
On the other hand, early forms of such were undoubtedly limited, based as they 
were on social learning by observation, largely an analogue process. Cumulative 
evolution requires the inheritance of a large number of possible states (Maynard 
Smith and Szathmary's 1995 "unlimited inheritance"), i.e., requires digitally 
encoded information (Dawkins 1995). For culture, this essentially means language 
in which utterances are composed of distinct units of sound, phonemes. It is 
possible, therefore, that gene-culture coevolution was most characteristic of an 
intermediate phase in our history — as Deacon (1998) has suggested that the 
brain and language coevolved. More recently, the low genetic heritability of 
human behavioral traits combined with rapid horizontal cultural transmission 
has resulted in a predominance of the sociocultural. Thus, "the more biological 
blends into the coevolutionary blends into the more sociocultural" would be a 
modified version of the sequential hypothesis. Whatever may be the truth of this 
matter, whenever genes and culture did or do coevolve, the considerations that 
follow about that Coevolutionary process are relevant. 
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Because culture constitutes a second inheritance system in the human species 
and variably beyond it, the possibility that genes and culture interact and even 
co-evolve has been extensively discussed (for general reviews, see Blute 
1987; Janicki and Krebs 1998; Odling-Smee, Laland and Feldman 2003:246-50) 
and sometimes modelled, largely employing population-genetic type models 
(Lumsden and Wilson 1981; Boyd and Richerson 1985; Feldman and Laland 
1996 and references; Higgs 2000). Indeed, gene-culture coevolution has been 
defined by some as a branch of (theoretical) population genetics (Feldman 
and Laland 1996). Beginning with the pioneering works of Maynard-Smith and 
Price (1973) and Maynard Smith (1982b) game theory-type models have been 
applied extensively in evolutionary theory (Nowak and Sigmund 2004) and have 
been extended as well to inter-species interactions in population ecology. The 
essential difference is that population genetics attempts to model underlying 
informational structures, whether genetic or memetic, while optimization (for 
non-social situations) and game theory (for social situations) model surface 
or observable characteristics, including behavior, which are commonly called 
"strategies." The relationship between the two can be complex, because just 
as there is no necessary one-to-one relationship between thought and action, 
there is no necessary one-to-one relationship between gene and phenotypic 
characteristic. Hence, modelling informational structures model what is inherited 
but may not necessarily be a potential target of selection, while modelling surface 
or observable characteristics model what is potentially a target of selection but 
may not necessarily be inherited. Despite this dilemma, modelling surface or 
observable characteristics, what Grafen (1991) called "the phenotypic gambit," is 
widely viewed as desirable in evolutionary ecology today. This is particularly the 
case because it has been shown that results can approximate those of population 
genetic-type models even more closely than was once thought (Hammerstein 
1996; Grafen 2002; for something of a social history see Schwartz 2002). The 
approach taken here follows in that tradition, taking specifically a game theoretic 
approach to gene-culture coevolution. 

In the most notable and widely cited application of gene-culture coevolution 
theory to date, Durham (1991) described and provided illustrative case 
studies of five modes of the relationship between genes and culture — three 
“comparative modes” and two “interactive modes.” The three comparative 
modes - enhancement, neutrality and opposition, pertain to the effect of culture 
on biological fitness. Culture may increase, be neutral with respect to, or reduce 
biological fitness. For example, incest taboos are generally thought to enhance 
fitness whereas ritual cannibalism which spread the neuro-degenerative disease 
kuru among the Fore had a negative impact. Durham's interactive modes were 
"genetic mediation" — essentially genes select among cultural alternatives, and 
"cultural mediation" — essentially cultural elements select among genes. He 
considered colour vision and Tibetan fraternal polyandry possible examples of 
the former, and malaria resistance, with its side-effect of sickle cell anaemia 
(selected for by slash-and-burn agriculture) and lactose absorption beyond 
childhood (selected for by dairying cultures) possible examples of the latter. While 
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extremely interesting and widely cited, neither genetic nor cultural mediation in 
Durham's senses are “interactive” in the strict sense of the meaning of that word. 
Rather, in each case there is a unidirectional selective action — genes on culture 
or culture on genes. On the other hand, both together would be truly interactive 
and would conform more to the logic of the concept of coevolution as it is used 
in biology more widely. Since the pioneering paper of Ehrlich and Raven (1964), 
coevolution is usually defined in ways similar to the following: 


“Coevolution. The process by which members of two 
(or more) species contribute reciprocally to the forces 
of natural selection that they exert on each other, e.g. 
parasites and their hosts." (Begon, Harper and Townsend 
1996:848, italics added) 


“Coevolution is the interdependent evolution of two or more 
species having an obvious ecological interaction.” (Ricklefs 
and Miller 2000:639, italics added) 


Sometimes this strict sense is distinguished from the looser sense more akin to 
that employed by Durham: 


“Coevolution. Strictly, the joint evolution of two (or more) 
ecologically interacting species, each of which evolves in 
response to selection imposed by the other. Sometimes 
used loosely to refer to evolution of one species caused by 
its interaction with another." (Futuyma 1998:766) 


Of course in the gene-culture coevolutionary case, rather than genetic 
populations coevolving in interaction with each other, there would be one or 
more biological and one or more cultural populations coevolving. 


Gene-Culture Coevolutionary Games 


Ecologically, there are three possible types of relationships between members of 
different species and therefore three possible types of coevolution — cooperation 
or mutualism, in which both parties benefit from the interaction (a plus/plus 
relationship); competition in which both parties lose (a minus/minus relationship); 
and conflict or antagonism, in which one party benefits and the other loses (a plus/ 
minus relationship). Others have been proposed but probably represent more 
of a mathematical than a realistic possibility (e.g., commensalism, a plus/zero 
relationship) or reduce to one of the three (e.g., Batesian mimicry representing 
antagonism between members of a poisonous model and a similarly appearing 
mimetic species and Mullerian mimicry representing mutualism between 
members of species which serve simultaneously as models for and mimics of 
each other). 

Consider the simplest possible case of interactions of each of the three basic 
types then — all involving haploid, asexual biological and cultural populations. 
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There are two biological alternatives G and g and two cultural alternatives C and 
c, dominance not implied. Each pair may represent distinct kinds of individuals 
(which could in the simplest case be carriers of alternative alleles for the biological 
case and of alternative "allomemes" in Durham's terminology for the cultural case) 
or could represent a single kind of individual who behaves flexibly, conditional 
upon perceptions of what members of the other population are doing. 


Cooperation (both genes and culture are favored to match) 


If genes are G it favors culture to be C and vice versa. However, if genes are g it 
favors culture to be c and vice versa. All other things being equal, the population 
is composed of matched gene-culture types, whether GC, gc or some proportion 
of each (Table 1). 


Competition (both genes and culture are favored to unmatch) 


If genes are G it favors culture to be c and vice versa. However, if genes are g it 
favors culture to be C and vice versa. All other things being equal, the population is 
composed of unmatched gene-culture types, whether Gc, gC or some proportion 
of each. (Pay-offs in Table 1 are reversed.) 


Table 1: Pay-off Matrix for Gene-Culture Matching Game 





Notes: h = high, | = low. By convention, in each box the upper entry is the payoff to the column 
strategy given the row strategy and the lower entry is the payoff to the row strategy given the 
column strategy. In unmatching games, pay-offs are reversed. 


Conflict (one is favored to match and the other to avoid matching, two ways) 


Culture is favored to match and genes to avoid matching. If genes are G it favors 
culture to be C; if culture is C it favors genes to be g; if genes are g it favors 
culture to be c; if culture is c it favors genes to be G . Under the simplest set of 
assumptions, dynamically the populations continues to co-evolve cyclically with 
culture (metaphorically) chasing genes with a lag and genes running away, like a 
predator population tracking a prey population. (See Table 2. For biological examples 
and formal treatments see ecology texts such as Ricklefs 2000, Chapter 23.) 


Genes are favored to match and culture to avoid matching. If culture is C it favors 
genes to be G; if genes are G it favors culture to be c; if culture is c it favors 
genes to be g; if genes are g it favors culture to be C. Under the simplest set of 


156 ° Social Forces Volume 85, Number 1 * September 2006 


assumptions, dynamically the populations continue to co-evolve cyclically with 
genes chasing culture with a lag and culture running away, also like a predator 
population tracking a prey population. (Pay-offs in Table 2 are reversed.) 


Table 2: Pay-off Matrix for Gene-Culture Antagonistic Game 


C 
C 


Notes: h = high, | = low. By convention, in each box the upper entry is the pay-off to the column 
strategy given the row strategy and the lower entry is the payoff to the row strategy given the 
column strategy. As illustrated, culture is preying on genes but the reverse can also be the case. 


Hypothetical Examples 


Although the concept of coevolution has been applied to a variety of phenomena 
(for a list and references see Odling-Smee, Laland and Feldman 2003:247), life- 
history characteristics are the ideal traits to use because they are the most 
fundamental. One of the most general life history distinctions is between devoting 
resources to spending on growth here and now vs. investing in space, time or 
niche space by means of motility, maintenance or mutability. (For discussion and 
references see Blute 2003.) Human foragers acquire resources if they are plentiful 
but invest in consumption elsewhere by moving on if they have become locally 
depleted. Human agriculturalists switch to maintenance rather than motility — 
processing, preserving and storing their crop for future consumption. Members 
of industrial societies are apt to innovate under similar circumstances. 

Additionally, resources may be devoted to somatic vs. reproductive functions 
and the same spending vs. investing distinction which is applicable somatically 
is applicable reproductively, i.e, one can produce more offspring or fewer higher 
quality ones, ones better equipped to move, last or innovate. Other important 
life history distinctions include those between resources devoted naturally and 
sexually, and to other forms of the ecological vs. the social. 


Mutualism Under Vertical Transmission 


Cavalli-Sforza and Feldman (1981) introduced the terms “vertical,” “oblique” and 
“horizontal” modes of cultural transmission with the direction specified relative 
to the biological, i.e., from parents to offspring, parents to the offspring of others, 
and among (generally unrelated) members of the same generation respectively. 
By analogy from epidemiology, modes of cultural transmission are well known 
to affect the nature of gene-culture relationships. For obvious reasons, vertically 
transmitted parasites tend to evolve to be benign or relatively harmless while the 
horizontally transmitted tend to evolve to be more virulent or harmful. 
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While it has long been recognized that vertically-transmitted culture should 
evolve to be biologically adaptive, it has not always been recognized that the 
reverse is also the case. Under vertical cultural transmission, genes should 
similarly evolve to adapt themselves to culture. In a game theoretic approach 
this implies mutualism with both genes and culture evolving to match. If G 
represents a propensity to spend and g to invest (as above) whether somatically 
or reproductively, and C and c represent similar cultural norms — "just do it" vs. 
prudently providing for the future — then under vertical transmission it is likely 
that genetic and cultural populations would exert selective pressures on each 
other towards matching. Cultural norms celebrating the propensity to live in 
the present, or alternatively, to be future oriented, norms embodied in sayings, 
myths, songs, etc. would find it easier to spread by social learning in a population 
similarly genetically predisposed. (For an overview of the heritability of personality 
traits, see Ridley 2003, Chapter 3.) 

As Lumsden and Wilson (1981) emphasized, genes can act as a selective 
force, favoring the learning of, preference for or utilization of certain cultural 
traits over others. As Durham (1991) emphasized however, the reverse is also 
the case. Imagine the fate of genes trying to swim against the cultural tide so- 
to-speak. Imagine a reproductively spending (i.e., quantity-oriented "pro-natal") 
pre-demographic transition society which is relatively egalitarian. It spreads 
the available food, housing, etc. across the members of large families. In such 
circumstances, a genetic mutant predisposing one towards offspring quality 
would be unlikely to fare well. Families with few offspring in such a society 
would simply be foregoing much of their share of societal resources. In a post- 
demographic transition society with more inequality, however, the reverse would 
be the case. A genetic mutant, going against the cultural tide by having a large 
family, would be unlikely to produce any offspring of high enough quality to 
compete for their share of societal resources with their peers from small families, 
with their lavishly cared for one or two children. 

Culture can act as a selective force, favoring some genes over others. Hence 
under vertical cultural transmission, mutualism and therefore matching are 
likely. If spending strategies are common biologically, it favors cultural spending 
strategies, and if cultural spending strategies are common, it favors biological 
spending strategies. However, if investing strategies are common biologically, it 
favors cultural investing strategies, and if cultural investing strategies are common, 
it favors biological investing strategies. It is worth noting that it is theoretically 
possible for cooperation to be based on specialization and exchange rather than 
economies of scale in which case "matching" would imply doing the opposite. 
For example, in the many bird species in which males contribute to parental care, 
there is a biological division of labor between guarding and feeding, commonly 
based on turn taking. However | am not aware of any such relationship between 
genes and culture. 


Competition Under Oblique Transmission 


If vertical cultural transmission normally gives rise to a mutualistic situation in 
which genes and culture are favored to match, what of oblique transmission? 
It is often mentioned as a kind of halfway house between the vertical and the 
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horizontal but not carefully analyzed. To understand the consequences of pure 
oblique transmission, realize that if transmission is oblique for ego it is oblique 
for alter. Imagine a kinship system which has never existed. Two clans or lineages 
are endogamous (i.e., they marry-in, wolves marry wolves and bears marry 
bears). However cultural transmission is oblique, so that the children of wolves 
acquire the social identity of bears and children of bears, that of wolves. Under 
these conditions, genetic and culture allocations of effort would compete with 
each other so that effort allocated to spending (i.e., on bearing biological and 
on socializing cultural offspring) compete with each other and effort allocated 
to investing (i.e., to caring for biological offspring and to grooming high quality 
cultural. descendants) compete with each other. Genes and culture would be 
favored to unmatch. The competition engendered between genes and culture is 
probably the reason why such a system has not coevolved. 

A more typical kinship system is one in which lineages or clans are exogamous 
— wolves marry bears and bears marry wolves. Unilineal descent is common, 
the patrilineal more so than the matrilineal, and even in systems with bilineal 
descent, matri and patrilineal descent are commonly both present in a sociological 
sense, distinguished by function. In a simple system of marriage exchange 
with patrilineal descent, male wolves marry female bears and their children are 
wolves while male bears marry female wolves and their children are bears. Such 
systems are a balance of vertical and oblique transmission and hence of gene- ` 
culture cooperation and competition. Males inherit the social identities of their 
biological fathers but females inherit the social identities, not of their biological 
mothers, but of their father's sisters. Male-limited genes and culture (i.e., those 
expressed in males) are favored to match, while female-limited genes and culture 
are favored to unmatch. 

In matrilineal systems, females inherit the social identity of their biological 
mothers and are favored to match, but males inherit that of their mother's brothers 
and are favored to unmatch. This balance of cooperation and competition (overall, 
not within genders) surely has something to do with why such systems have 
coevolved although power differences are required to explain why the patrilineal 
system in which male-limited genes and culture cooperate is so common, while 
the matrilineal one is less so. While such systems may seem artificially simple, 
they are far from being irrelevant. 

The great French anthropologist, Claude Levi-Strauss (1969) believed such 
simple systems of direct marriage exchange were the original form of marriage, 
and they have left their marks on systems of kinship semantics (for naming and 
addressing relatives) in many languages around the world. In such systems 
"parallel" relatives (e.g., mother's sister and father's brother) and other same- 
side clan mates are grouped together with the narrowly lineal (e.g., mother and 
father) and "cross" relatives (e.g., mother's brother and father's sister) and other 
other-side clan mates are grouped together, and similarly for other generations 
- all of which would make sense if they were, or had in the past, practiced direct 
marriage exchange. Some systems of kinship semantics are simpler than these 
"Iroquois" systems and some more complex (for an overview see Fox 1967, 
1983), but the latter typically can be seen as complications built on (i.e., evolved 
from) such a system. 
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Antagonism Under Horizontal Transmission 


One kind of antagonistic gene-culture coevolution which is increasingly well 
understood takes place with the culture in one species, ours and the genes in 
others. Palumbi (2001) surveyed how evolution in cultural practices — in medicine, 
agriculture and fishing — have caused explosive genetic evolution in a host of 
other species, including antibiotic resistance in bacteria, resistance to herbicides 
and pesticides in agriculture, and alterations in the life histories of fish species. 
Darwin himself recognized this in domesticated animals, a phenomenon he 
called "artificial" selection, which was one of the sources of inspiration for his 
theory of "natural" selection. It took a while to recognize these as coevolutionary 
processes, however. There is nothing artificial about it when the cultural practices 
are also evolving. 

Taking only large individuals, thus giving the young a chance to mature and 
presumably breed, was for long thought to be a sound fish and crustacean 
harvesting strategy. For a while, the predominance which developed of small 
individuals in catches of some species was thought to be simply a result of 
overharvesting, until it was realized that in some cases, populations subject 
to such harvesting had actually responded by evolving genetically to unmatch 
the imposed size-selective mortality — switching their life history to grow more 
slowly and mature at a smaller size. It was not that only small individuals were 
left, but that cultural practices were actually causing them to evolve to be small. 
Just as genes can respond by unmatching with respect to antagonistic culture, 
culture can respond by unmatching with respect to antagonistic genes. The 
problem of antibiotic resistance in bacteria, particularly in hospitals but also in 
community settings, has favored a concerted medical movement in industrialized 
countries away from indiscriminate and towards more discriminating prescribing 
practices. 

Antagonistic gene-culture coevolution does not justtake place between species, 
but also within the human species. Just as it is well known epidemiologically that 
horizontally transmitted parasites tend towards virulence, horizontally-transmitted 
culture is liable to do so as well. While some are not fond of the parasite analogy 
for horizontally-transmitted culture, in many respects it is very apt. Viruses, being 
unable to metabolize independently, like culture, had to have an origin in some 
organism(s). But once infectious, we do not argue that their hosts evolve so as to 
harbor them. Rather, the two are involved in a coevolutionary arms race. Boyd and 
Richerson (1985) briefly considered the possibility of an arms race between genes 
and culture (191, 194-7). Some, but not all memetic theorists (those who employ 
the term "meme," analogous to gene, for the basic units of cultural transmission 
or replication) have emphasized the likelihood of horizontally-transmitted culture 
parasitizing genes (compare Dennett 1995; Brodie 1996; Lynch 1996; Blackmore 
1999; Aunger 2000, 2002; and Shennan 2002). 

To take an extreme example, the "memplexes" that characterize membership 
in sects or cults, ruthlessly exploit the biology of their members, convincing them 
with mythical appeals to "family" to isolate themselves from their real families 
and dedicate their all to the survival and spread of the social identity of cult 
membership. As parasites, they would be favored to match (i.e., to track genetic 
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strategies), but as hosts, genetic alternatives would also be favored to avoid 
such matching. Hypothetically, with the majority genetically predisposed toward 
somatic functions, it would pay cultural parasites to target them, to solicit material 
donations. However, if genes are being drained of resources in that way, it would 
pay them to devote their resources to reproductive functions. And if genes are 
doing that, then it would pay culture to target those functions instead — appealing 
for time and energy to be spent on recruitment, for example. 

There is little doubt that much culture in human societies is transmitted 
horizontally (Harris 1998). Similarly, there is little doubt that much of it can and 
does parasitize genes; think not only of the way proselytizing religions create 
mythical families but the way the processed food industry exploits our taste for 
nutritious, ripe fruit by providing the taste without the nutrition. There is an issue, 
however, about whether genes are capable of responding and hence whether 
full coevolution actually takes place. The contemporary obesity epidemic, for 
example, suggests genes are helpless against the onslaught. This issue requires 
a consideration of several points. 

First, the common assumption that cultural evolution necessarily proceeds 
faster than the biological is mistaken as Hull (2001), Richards (2002) and others 
have pointed out. Technically, according to Fisher's fundamental theorem 
of natural selection, the rate of evolution is equal to the genetic variance of 
fitness (Bell :1997:72-3). Hence if one or the other population varies more, or 
its characteristics are more heritable,or have more of an impact on fitness, that * 
population will evolve faster. At the same time, however, there is a basis for the 
common intuition that the generation time is relevant, because most innovation 
takes place at the time of transmission. With vertical and even oblique cultural 
transmission however, genetic and cultural generation times are necessarily equal, 
and all else being equal, rates of genetic and cultural evolution are necessarily 
identical. 

With horizontal transmission, on the other hand, the generation time of the 
culturally transmitted can be very short relative to the biological (e.g., fads and 
fashions), but not necessarily (e.9., new horizontally acquired religious or national 
identities can last a lifetime). Hence depending upon the explicit or implicit 
assumptions being made about rates of innovation and recombination, the 
accuracy of transmission, the effect on biological and sociocultural fitness and 
relative generation times, some have concluded that such coevolution is possible 
or even likely, while others tend to assume that most culture is biologically 
adaptive whether to current conditions (human evolutionary ecology) or to past 
ones (evolutionary psychology). Examples of those who have concluded that it is 
possible include the following. Burley (1979) suggested that cryptic ovulation in 
humans evolved as a genetic defense against the culture spread of the desire to 
have fewer children among women. Blute (1987) suggested that cultural parasites 
could equally well play the role that biological ones do in Hamilton's theory of the 
maintenance of sex. Kumm, Laland and Feldman (1994) suggested that cultural 
sex-biasing practices in places like China and India which do not change the 
number of offspring produced may evoke a genetic sex-ratio response in the 
opposite direction. (See also Laland, Kumm and Feldman 1995; Kumm and 
Feldman 1997). The issues involved in such questions are complex and obviously 
merit further research. 
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| have emphasized different modes of cultural transmission to illustrate all three 
modes of gene-culture coevolution because by analogy from epidemiology, they 
are well known to affect the nature of gene-culture relationships. (For a summary, 
see Table 3.) In light of such considerations, Williams went so far as to define 
conventional genetics as "that branch of epidemiology that deals with infectious 
elements transmitted exclusively from parent to offspring." (1992:15) There are 
obviously other factors influencing gene-culture relationships beyond the mode 
of cultural transmission and continuing to derive inspiration from epidemiology 
can suggest what some of them are. Parasites that infect a host singly or at a 
low frequency tend to be more benign, while those that infect multiply or at a 
high frequency tend to be more virulent. This is because with a low multiplicity 
of infection, any restraint shown today by a parasite will benefit them tomorrow, 
whereas with a high multiplicity of infection, that is not the case. By analogy, 
some social identities like national, religious and occupational identities which 
tend to preclude others should tend to evolve to be more benign while other 
cultural elements like fads and fashions which can infect one multiply should 
tend to evolve to be more virulent. 

Biological parasites transmitted by direct contact tend to be more benign than 
those which are vector borne such as by insects (Ewald 2000). This is because 
the former need to leave their victims in good enough condition to get about 
to have themselves transmitted. By analogy, cultural information conveyed 
personally should tend to be more benign than that transmitted by via media 
which, like insect-borne diseases, can get to victims even when they are down 
and unable to circulate. In summary, culture transmitted horizontally, en mass 
and via media, should tend to be more virulent and hence potentially evoke gene- 
culture antagonistic coevolution. 


Table 3: Typical Consequences of the Mode of Cultural Transmission for Gene-Culture 
Coevolution 





Mode of Cultural Gene-Culture Relationship Genes and Culture are 
Transmission Favored to: 
Vertical Mutualistic or Cooperative Match 
Oblique Competitive Unmatch 
Horizontal Antagonistic One Match, the Other 
Unmatch (two ways) 
Complications 


The simple games described can be made endlessly complex if qualifications 
such as "all other things being equal" and "underthe simplest set of assumptions" 
do not hold. Most obviously, more than two alternatives could be present in 
either or both populations. To say that at equilibrium a cooperative gene-culture 
population is composed of matched gene-culture types or that a competitive one 
is composed of unmatched ones does not specify in what proportions. Factors 
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that can influence outcomes include differing energetic costs, ecological benefits 
or quality of information. All other things being equal, equilibrium frequencies 
should be directly proportional to ecological benefits and quality of information 
and inversely proportional to relative energy costs. 

For example, spending strategies are more ecologically beneficial under low 
densities (i.e., uncrowded circumstances) relative to resources and investing 
ones under high. If we predicted equal frequencies at equilibrium, we would 
be implicitly assuming intermediate densities, but if that does not obtain then 
frequencies should be directly proportional to their relative ecological benefits, 
as well as being influenced by the social interactions taken into account in the 
game theoretical approach. 

The concept of trade-offs is fundamental to life history theory. Energy allocated 
one way cannot be allocated another way and vice versa. Evidence of a negative 
correlation is evidence of a trade-off but the absence of such evidence is not 
definitive evidence that there is not a trade-off. The simple games assume that 
the information underlying the biological or cultural alternatives is of similar 
quality, which is not necessarily the case. Some individuals and populations 
may simply be able to do more of both of whatever. The fundamental point is 
the favoring of matched or unmatched gene-culture types respectively, not the 
particular frequencies of the two kinds in either case. 

Evolutionary models, whether of the population genetics or game theory type, ` 
generally apply whether we are talking about changes through time in a single 
population or differences between populations which have been subject to a 
different history of selection. This is why phrasing such as “at” (for the former) 
or “with a history of” (for the latter) selection favors, etc. is commonly used. 
However, the simple games assume deterministic situations. More complex 
cases in which sampling error is present ideally require a stochastic treatment. 
In some cases, initial conditions can make a dramatic difference. For example, 
chaotic rather than simple cyclical dynamics can emerge in the antagonistic 
case. In short, modelling the details of particular situations requires estimates 
of circumstances and parameters that go well beyond the basics of the games 
summarized here. At the same time, this in no way detracts from the fundamental 
theoretical point that if a game theoretic as opposed to a population genetic- 
style approach is to be taken and if gene-culture coevolution is seriously under 
consideration, then the three basic kinds of interaction described are the starting 
point from which any such effort needs to begin. 


Conclusion 


Lumsden and Wilson (1981) first applied the term “coevolution” to the 
interaction between genes and culture. Boyd and Richerson, in exploring the 
population genetics of “dual inheritance”, largely viewed culture as biologically 
adaptive but briefly raised the possibility of an arms race between genes and 
culture (1985:191,194-7). Durham (1991) provided illustrative case studies of five 
modes of the relationship between genes and culture. Most recently, Richerson 
and Boyd have described coevolving pools of cultural and genetic information 
as engaged in a “swirling waltz” (2005:192). These works were pathbreaking. 
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However, none fully conceptualized gene-culture coevolution in the way that 
would seem most natural in population ecology - as interspecific mutualism, 
competition and antagonism. The three simple games (matching, unmatching and 
one matching with the other unmatching) provide insight into the fundamental 
kinds of relationships that are possible between genes and culture, a framework 
within which their interaction can be understood, and hence a base on which 
further theoretical and empirical efforts can be built. 
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Abstract 

Popular perceptions of the effect of testosterone on “manly” behavior are inaccurate. We 
need to move away from such simplistic notions by treating testosterone as one component 
along with other physiological, psychological and sociological variables in interactive and 
reciprocal models of behavior. Several hormones can now be measured in saliva, removing 
the need for blood samples. Conceptual shifts have moved research from biological 
determinism to biosocial models in which the social environment plays a key role in 
understanding behavior-hormones associations. As a result, more social scientists are 
incorporating testosterone in their studies. Following a primer on testosterone, we describe 
testosterone's link to (a) gaining, maintaining and losing social status, (b) aggression and 
antisocial behavior, (c) peer and family relationships, and (d) gender similarities and 
differences. Research needed to take us to the next level of understanding is outlined. 


"Testosterone poisoning," now part of the language, is a popular explanation for 
excessive "manly" behaviors such as boasting, violence and pugnaciousness. 
Middle-aged and older men think sagging libido and fatigue stem from their 
declining testosterone. 

In fact, there is little empirical support for these popular assertions. We cannot 
say that they are all false because research literature is not conclusive. But it is 
already clear that there is no simple one-to-one relationship between testosterone 
and machoism or aggressiveness or sexuality (Mazur and Booth 1998). It seems 
wiser to view testosterone as one component in a confluence of interacting 
physiological, psychological and social influences that affect behavior. 

The focus of this research is variables of interest to sociologists. Gaining, 
maintaining and losing status is a theme found in a great deal of sociological 
research. Many testosterone studies have focused on status change under the 
rubric of competition. Sociologists who focus on deviance and social integration 
will be interested in testosterone studies that focus on problem behavior, and 
the nature and quality of peer and family relations. Those who work in the area of 
health and social behavior will find studies of testosterone's link to depression. 
Those who study gendered behavior will be interested in the differences and 
symmetries in the links between testosterone and social behavior for males and 
females. Finally, sociologists will be interested to learn that the social environment 
has a key role in defining testosterone-behavioral links. 


This article is dedicated to the memory of James M. Dabbs, a pioneer in the field of testos- 
terone research who greatly enriched our knowledge methodologically and substantively. 
Direct correspondence to Alan Booth, Department of Sociology, 211 Oswald Tower, Penn- 
sylvania State University, University Park, PA 16801, 814-863-1141. E-mail: AXB24@ 
psu.edu. 


© The University of North Carolina Press Social Forces, Volume 85, Number 1, September 2006 


168 * Social Forces Volume 85, Number 1 * September 2006 


Following a primer on testosterone, we review the reciprocal linkage 
between testosterone and competition, deviant or norm-breaking behavior, and 
depression. We then take up the hormone's role in peer and family relationships 
and gendered behavior. Throughout, we stress the need for reciprocal biosocial 
models that take account of psychological variables and social influences. Finally, 
we suggest promising avenues for future research. 


Primer on Testosterone 


Hormones are chemical messengers that regulate, integrate and control 
bodily functions. The endocrine system produces several hundred hormones 
in response to nerve and chemical signals and to perceptual cues, which 
interact with each other and with the nervous system. Hormones regulate 
short-term processes, such as the nearly immediate responses to an external 
threat, and longer-term processes, such as sex differentiation, maturation and 
reproduction. Males produce testosterone mainly in the Leydig cells of the 
testes. Women also produce testosterone, but in smaller amounts, by converting 
dehydroepiandrosterone (DHEA) produced in the adrenal glands. Testosterone 
is one of several androgenic hormones, so called because they produce male 
characteristics. Baseline testosterone has fairly high heritability. The majority of 
estimates are between 50 percent and 70 percent with a mean of 60 percent ` 
(e.g., Gagnon et al. 2001; Harris and Vernon et al. 1998). 

The human fetus, whether genetically male (XY) or female (XX), is otherwise 
sexually undifferentiated in the first weeks of life. Testosterone secreted by newly 
forming testes of an XY fetus causes its body and brain to become masculinized. 
Lacking testosterone, a fetus develops as a female and when born looks like a 
female infant. XY fetuses that abnormally fail to produce testosterone may be 
mistaken for girls at birth. Research on non-humans convincingly shows that 
genetic females exhibit male features if exposed to high testosterone in utero and 
genetic males show female features if deprived of the hormone (Breedlove 1992). 


During the preschool and early elementary school years testosterone levels 
are low in boys and girls. During adolescence testosterone production surges, on 
average 10-fold in males and two- or three-fold in females (see Figure 1). These 
changes are implicated in the emergence of secondary sexual characteristics that 
include ripening sexual apparatus, male musculature, body hair and deepening 
voice. Fully adequate male sexual behavior requires a minimal amount of 
testosterone, but beyond that, variations in hormone level are not reliably related 
to sexuality (or to homosexuality). 

Testosterone levels reach a peak in young adult males. Men who maintain a 
constant body weight also maintain their testosterone levels as they age. More 
typically in American society, men add weight and lose testosterone as they 
grow older. Older men often maintain age-appropriate sexual behavior (assuming 
good health and partner availability) even at very low testosterone levels. If male 
testosterone is exceedingly low, therapeutic testosterone supplementation can 
improve sexuality, increase muscle mass and reduce body fat. 
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Figure 1. Testosterone by Age and Sex 
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Much of our knowledge is based on studies of men and boys because 
males produce far more testosterone than females, so the hormone is easier to 
measure. Also, the hormone's effects are clearer in males than females. We will 
review the gender consistencies and inconsistencies in behavior-testosterone 
research in a subsequent section. 

After childhood, testosterone is diurnal, being highest in the morning and 
declining across the afternoon and evening. Possibly diurnal variation is related 
to patterns of problem or psychopathological behavior (Granger, Shirtcliff, 
Zahn-Waxler, Usher, Klimes-Dougan and Hastings 2003). Controlling for diurnal 
variation, testosterone levels are highly correlated within individuals across 
hours, days and years (Granger, Shirtcliff, Booth, Kivlighan and Schwartz 2004). 
This fact has led many investigators to assume that testosterone level is a trait 
that explains differences between individuals with respect to aggression and 
violence. However, there is little empirical support for this position. 

During the past two decades, the improving ability to measure testosterone 
in saliva (rather than requiring blood samples) has created opportunities to test 
biosocial models in naturalistic settings (Dabbs 1993; Granger, Schwartz, Booth, 
Curran and Zakaria, 1999; but see Granger et al. 2004). 

A series of conceptual shifts has placed new emphasis on the contributions 
of both nature and nurture to individual development (Magnusson and Cairns 
1996: Plomin and Rutter 1998; Rutter et al. 1997). Leading this paradigm shift 
are social scientists who are testing “biosocial” alternatives to traditional models 
of individual differences and intra-individual change in behavior (e.g., Booth, 
Johnson, Granger, Crouter and McHale, 2003; Foshee et al., forthcoming; Udry 
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1988). Individual differences in behavior are considered to be interactive products 
of genetic, environmental and physiological processes over time (Cicchetti and 
Dawson 2002). 

The nature of the link between testosterone and behavior is not simply a 
biological "cause-and-effect" mechanism. It is better described as a bi-directional 
relationship that is highly dependent on intrinsic individual differences in social 
perception, previous experience and propensity for specific behavior, as well as 
the demand or "press" of the social context for particular behaviors (Sapolsky 
1997). We do not assume that testosterone is a mechanism in and of itself that 
causes or creates behavior. Instead, we assume that testosterone increases 
the likelihood that certain behaviors will be expressed, if the propensity for 
that behavior already exists, and the expression of that behavior is consistent 
with social contextual demands. Characteristics of the social landscape permit, 
stimulate, suppress or set the stage for the expression of specific testosterone- 
behavior relationships. 

One important aspect of the social landscape that affects testosterone 
production is stress. A number of studies have shown that extended periods 
of stress suppress testosterone production. For example, testosterone levels 
were low during the early (and degrading) weeks in Officer Candidate School, 
but returned to normal just before graduation, a time when achievements are ` 
recognized (Kreuz, Rose and Jennings 1972). Testosterone also drops right after 
incarceration (Thompson, Dabbs and Frady 1990). Soon after the Americans who 
had been held hostage at the American Embassy in Iran were freed, testosterone 
increased as a result of their newly gained status as free individuals (Rahe, Karson, 
Howard, Fubin and Poland 1990). Research indicates that elevations in cortisol 
(a stress hormone) lower testosterone, and declines in cortisol are followed by 
a rise in testosterone. Research focused on behavior-testosterone associations 
should incorporate environmental sources of stress as a potentially important 
player in such studies. 


Testosterone, Dominance, Aggression and Related Behaviors 


The word most often associated with testosterone in everyday parlance is 
aggression. Although in some species (e.g., rats) higher testosterone is associated 
with aggression in everyday encounters (Monaghan and Glickman 1992), in 
humans (and other primates) higher testosterone is association with the type of 
dominance that sometimes entails aggression. In our view, aggression is behavior 
intended to inflict physical injury on another individual. Dominance is behavior 
intended to gain or maintain status. Status confers influence, power or valued 
resources (Mazur and Booth 1998). The vast majority of dominance episodes do 
not involve the intent to physically harm others. Furthermore, testosterone levels 
associated with dominance behavior that entails the intent to injure others appear 
to be no higher than testosterone levels associated with dominance behavior that 
does not entail physical harm. For example, there is no significant difference in 
the testosterone levels of socially dominant but non-aggressive prisoners and of 
aggressive prisoners (Ehrenkranz, Bliss and Sheard 1974). 
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Studies used as evidence that higher testosterone associated with pubertal 
development is linked to aggression in youth are not clear cut. Two studies by 
Olweus and his colleagues (1980, 1988) and another by Mattsson, Schalling, 
Olweus, Low and Svensson (1980) are often interpreted as evidence of a 
testosterone-aggression link. But their data do not support such a conclusion. 
A comparison of samples of institutionalized delinquent boys and non- 
delinquent high school students revealed that testosterone was slightly higher 
in the delinquent sample, but the difference was not statistically significant. The 
delinquent sample of youth who were incarcerated for violent crimes had slightly 
higher testosterone levels than those institutionalized for non-violent offenses, 
but again the difference was not statistically significant. Staff and professional 
evaluations of aggressiveness were also unrelated to testosterone levels. 

Testosterone-related differences in aggression in the non-delinquent sample 
were studied as well. None were statistically significant. The only difference 
manifested was that adolescents with higher testosterone were more likely to 
respond more vigorously in response to challenges from teachers and peers. 
The vigorous response finding is consistent with our assertion that testosterone 
is linked with aggression only when it is part of dominance behavior. 

Another study pertinent to the link among testosterone, dominance and 
aggression is a longitudinal study of boys investigated when they entered 
kindergarten and thereafter until 13 years of age (Tremblay, Schaal, Boulerice, 
Arseneauth, Sousssignan and Perusse 1998; Tremblay, Schaal, Boulerice, 
Arseneault, Soussignan, Paquette and Laurent 1998; Tremblay 2000). Using 
measures that incorporated self, peer and teacher ratings, Tremblay and his 
colleagues discovered that testosterone levels at the start of puberty were linked 
to social dominance a year later but not to physical aggression. Dominance was 
not related to current aggression or aggression over the previous three years. 
On the other hand, body mass was a predictor of physical aggression. The 
extant research suggests that the rise in testosterone associated with pubertal 
development does not lead directly to aggressive behavior. 

Another term often associated with testosterone is antisocial behavior. We 
view this as another form of dominance behavior that occurs in settings where 
authority figures require behavior to conform closely to rigid standards such 
as in schools, prisons, the military and sometimes families. Individuals who 
are predisposed to dominate behavior but hold subordinate positions in such 
organizations are likely to break the rules in order to prevail over the constraining 
environment. Udry and colleagues observed an association between norm- 
violating behavior and testosterone (Udry 1988, 1990). Much of such behavior is 
expressed through vandalism and status violations rather than aggression (Rutter, 
Giller and Hagell 1998). Such rebellious behavior is termed antisocial behavior by 
those in authority and those who study such behavior. 

In addition to aggression and antisocial acts, two other types of behavior 
are often associated with testosterone. One is risky behavior, and the other is 
initiative. Both are important in achieving and maintaining status. Risk-taking 
behavior refers to acts that at the same time increase the chances of success and 
the odds of failure in maintaining or increasing status. Daltzman and Zuckerman 
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(1980) were among the first to observe a link between sensation seeking and 
testosterone. Recent studies have linked testosterone-related risk behavior to 
health deficits (Booth et al. 19994). 

Initiative is less often associated with the testosterone-dominance link, but is 
an important factor in understanding testosterone-dominance links. A dominate 
act requires initiative to challenge someone else's status or to protect one's own 
status. As the same time a testosterone-related initiative is necessary to build 
friendship networks, marriages and other close relationships where dominance 
behavior occurs less often, but is still crucial to the operation of the group. 

Although research provides considerable evidence that testosterone is 
associated with dominant behavior, correlation does not prove causation. If 
the administration of testosterone was followed by an increase in dominant 
behavior, we would have a stronger case for asserting a causal relationship. 
Two experiments support the idea that the link is causal. In one study with a 
double-blind, randomized, crossover design, young men were given doses of 
testosterone or a placebo. Subjects were paired with a fictitious subject and told 
that each member of the pair could, by pushing a button, reduce the cash flowing 
to the other member. The subject was told that the other individual was reducing 
the cash that was flowing to the subject. Subjects receiving testosterone rather 
than the placebo pushed the button significantly more times (Kouri, Lukas, Pope 
and Oliva 1995). A second study with the same design was conducted with ` 
men aged 20 to 50 years (Pope, Kouri and Hudson 2000). This time testosterone 
was administered over a six-week period. Subjects participated in the same 
experiment. Results indicated that those who had had the treatment pushed the 
button many more times. These studies put us in a much stronger position to 
claim that testosterone stimulates dominant behavior. 


Testosterone and Status Hierarchies: What We Learn from Competition Studies? 


Dominance hierarchies occur only in face-to-face groups where members 
interact with one another. (See Mazur 2005 for a detailed development of the 
dominance hierarchies concept.) They may be as small as a dyad or consist 
of 100 or so members. They persist over time and are characterized by the 
unequal ranking of members in terms of power, influence and access to valued 
resources and prerogatives. High ranking members influence others in the group 
and subordinate individuals have little influence and limited resources and are 
constrained in their choices. Members know where they fit into the hierarchy 
and regard it as legitimate. Rankings change as youth mature and those in power 
pass their prime, membership changes, or environmental challenges alter the 
needs of the group. 

Virtually everyone lives and works in groups that have dominance hierarchies 
whether they are families, peer groups, work groups, civic Organizations, 
religious groups or play groups. Large organizations such as corporations 
and governments are overlain with many face-to-face groups, each operating 
somewhat autonomously. Within every hierarchy there are shifts in the status 
ranking over time. Status changes are preceded by competition. 


Testosterone and Social Behavior ° 173 


Face-to-face status competition often occurs during polite conversation, 
though it may additionally include assertive eye contact or posturing, orthe display 
of culturally elaborated status symbols, or rarely, argumentative and physical 
tussling. When these competitions occur, each adversary attempts to out-stress 
his opponent. This is often done without violence or overt aggressiveness, and 
indeed may occur so gently that the participants and other observers are barely 
aware of the competition. Generally, after one or a series of such encounters, 
one adversary concedes higher rank to the other, thus relieving the tension of 
the contest and establishing relative rankings in the group. Based on studies 
of change in status in competitive events, testosterone would appear to play a 
fundamental role in the face-to-face competition at least in males (Mazur 1985). 

Sporting events are excellent venues for studying the hormonal correlates 
of face-to-face dominance episodes because they are highly organized and 
monitored, the rules are clear, and measures of performance, such as winning and 
losing, are well defined. Research has focused on three phases of competition: 
pre-event, event and post-event. In general, men's testosterone levels increase 
prior to competition and rise during the contest itself. Afterward, testosterone 
levels of winners often remain higher than the testosterone of losers. 

These results come mostly from studies of male athletes (e.g., Mazur and 
Booth 1998). There are only three studies of women, each suggesting a different 
profile than is found in men (Bateup, Booth, Shirtcliff and Granger 2002; Kivilighan, 
Granger and Booth 2005; Mazur, Susman and Edelbrock 1997). Another caveat is 
that the vast majority of studies have focused on experienced competitors (e.g., 
Bateup et al. 2002; Booth et al. 1989; Elias 1981; Mazur, Booth and Dabbs 1992; 
Salvador et al. 1999). In the only study comparing varsity with novice competitors, 
testosterone behavior patterns were different for novices than for experienced 
players (Kivlighan et al. 2005). 

The pre-event rise in testosterone possibly improves psychomotor function 
and coordination (Herrmann and Beach 1976), and mental activity (Herrmann, 
McDonald and Bozak 1976; Klaiber et al. 1971), and makes people more willing 
to use risky strategies (Daltzman and Zuckerman 1980), although none of 
these effects are conclusively demonstrated. Men appear to adjust their pre- 
competition rise in testosterone to the perceived strength of the opponent and 
the importance of the event. The greater the challenge, the greater the hormone 
increase (Booth et al. 1989; Gonzalez-Bono et al. 1999; Mazur, Booth and Dabbs 
1992: Mazur and Lamb 1980; Salvador et al. 1987; Neave and Wolfson 2003). 

The link between testosterone and performance during an event has received 
very little attention because of the difficulty in collecting saliva samples and 
other measures during the event itself. Studies have been limited to studying 
the associations between the change in testosterone (typically a rise) during the 
event and performance as measured after the event. One study of male rowers 
suggests that elevation in testosterone during the event is associated with 
superior performance, as indicated by the amount of time it takes to row 2000 
meters (Kivlighan et al. 2005). 

Postevent research indicates that winners have higher testosterone than 
losers. Not all studies show this effect (e.g., Suay et al. 1999), but it has been 
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replicated several times, not only in physically vigorous competitions but in chess 
competitions (Mazur, Booth and Dabbs 1992). The extent of post-match rise in 
the hormone seems to depend on the importance of the win and the participants’ 
evaluations of their own performances (e.g., Booth et al. 1989; Elias 1981; Mazur 
and Lamb 1980). When male competitors believe they have won by luck rather 
than personal effort, there is little if any post win rise in testosterone (McCaul, 
Gladue and Joppa 1992). 

As indicated at the outset, we propose that bio-social models are reciprocal 
where testosterone affects dominance, and also that changes in dominance 
change behavior (or social status), which changes testosterone levels as 
demonstrated in studies of primates (Rose, Bernstein and Gordon 1975). It is 
theoretically tempting to presume bi-directional effects in humans: testosterone 
increases competitiveness, and competitiveness increases testosterone. 
The studies noted above indicate that winning a competition increases one's 
testosterone, which increases the likelihood of engaging in and winning more 
competitions, which increases or maintains high testosterone, and so on. Losing 
a competition would depress testosterone, which would encourage avoidance 
of further competition. This would neatly account for the self-reinforcing effects 
of winning and losing streaks. However, the evidence is weak and additional 
research is needed to reach a firm conclusion about reciprocity. A 

How are the testosterone changes noted in formal competition related 
to changes in other face-to-face groups? We expect that any face-to-face 
negotiation that potentially entails an important change in ranking would involve 
changes in testosterone. If there was sufficient time (20 minutes or more prior to 
negotiation), an anticipatory rise in testosterone would occur. During negotiations 
testosterone increases in those who think they have a chance of gaining or 
maintaining status, and it decreases among those who think they are about to 
lose. At the conclusion of the negotiation the winner would experience a rise in 
testosterone, whereas the loser would experience a decline that could affect 
performance at subsequent dominance episodes. 

For future research, perhaps the most important question is the extent to 
which the competition findings apply to dominance episodes occurring outside 
short term sporting events such as negotiating a commercial contract, engaging 
in a political campaign, and being a party to litigation, as well as to family and peer 
interaction. If so, we need to continue to explore gender differences and the links 
between testosterone and the amount of experience and skill that participants 
bring to a dominance episode. Skill may lead to confidence that may signal a 
rise in testosterone (Mazur and Booth 1998). We suggest future research should 
also include culture and individual development (when studies focus on youth). 
When dominance episodes entail groups of individuals the rank of the individual 
members within the group and extent of bonding between members are also 
important considerations. Individual feelings of efficaciousness and social skills 
may also advance or impede the testosterone production. 

For dominance episodes that last for hours, days or weeks, we need to 
understand more about how the ebb and flow of successful performance is linked 
to testosterone production. To date, research has focused on one competitor in a 
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dominance episode but not the opponent. It would be useful to have information 
about the opponent's testosterone throughout the event, the opponents' self 
evaluation, and his/her evaluation of the opponent. Given the intensity of the 
competitor-opponent relationship, information on the opponent is bound to 
increase our understanding of the dynamics of dominance episodes. 


Problem Behavior and Psychopathology 


Unlike the research on competition, which centers on testosterone changes taking 
place over a few minutes or hours, much of the research on problem behavior 
(as well as on peer and family relationships) is cross-sectional and sometimes 
involves reports that cover much longer periods of time (weeks, months and 
years). We also switch to considering baseline testosterone, the characteristics 
level of the hormone that we have upon arising each morning. Research on the link 
between testosterone and problem behavior includes adult crime and depression 
as well as children's internalizing (depression) and externalizing (risky behavior). 
In keeping with contemporary thinking about hormones and the interests of 
sociologists, we rely heavily on research using the bio-social model, studies that 
examine the role of the social environment as moderating or mediating the link 
between testosterone and behavior. 


Crime 


Men with higher levels of testosterone are more likely to be arrested for offenses 
other than traffic violations, to buy and sell stolen property, incur bad debts and 
use a weapon in fights (Booth and Osgood 1993). Individuals with testosterone 
more than one standard deviation above the mean were 28 percent more likely to 
engage in criminal behavior than those one standard deviation below the mean. 
There is no evidence of a threshold of testosterone where the propensity to commit 
crime increases at a higher rate. The association between testosterone and crime 
is explained in part by a record of juvenile delinquency. The remainder of the 
testosterone-crime association is accounted for by participation in conventional 
social roles. Conventional social roles include holding a steady job, being married 
and belonging to community organizations. The greater the testosterone, the 
more important social roles were to preventing criminal activities. 


Depression 


The largest study of the link between testosterone and depression revealed an 
unusual finding. The study, which involved more than 4,000 men ages 32 to 48, 
found that a parabolic model best fits the data (see Figure 2). Men with above- 
and below-average testosterone levels reported more symptoms of depression 
(Booth et al. 1999b). The link between high testosterone men and depression is 
the inverse for men with below-average testosterone. The relationship disappears 
for those with above average testosterone when controls for antisocial and risk 
behaviors and the absence of protective factors such as marriage and steady 
employment are in the equation. Anti-social and risky behavior brings people into 
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contact with the criminal justice system and puts individuals at odds with relatives 
and friends. Incarceration, being sued or paying fines, and social isolation increase 
the risk of depression. The association is unchanged by these factors for those 
with below-average testosterone. It is possible that low testosterone is linked to 
neurotransmitters that are associated with depression (Zhang, Ma, Barker and 
Rubinow 1999). Further analysis revealed that testosterone was not linked to 
other forms of psychopathology (bipolar disorder, paranoid schizophrenia, panic 
attacks, post traumatic stress disorder and several phobias). When exploring the 
links between testosterone and behavior it is important to keep in mind that the 
relationship may not be linear. 


Figure 2. Lowess Curves of Testosterone and Depression 
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Military Combat 


An analysis of factors that predict exposure to military combat reveals that high 
testosterone increases the likelihood of exposure (Gimble and Booth 1996). It 
is unclear whether individual or environmental variables influenced participation 
in combat. Soldiers with high testosterone may have taken an active role in 
Seeking out combat. Or those in command may have recognized qualities that 
will make the individual a better combatant and assign him accordingly. It is also 
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possible that high testosterone individuals are antisocial enough to get combat 
assignments as punishment. In any case, the model shows impressive predictive 
ability despite the fact that we cannot determine the causal direction. 


Risk Behavior and Depressive Symptoms in Youth 


In a study of 654 normally developing youth (ages 7-17, M = 13.5), there was 
no evidence of direct testosterone-behavior effects (Booth et al. 2003). Rather, 
the expression of testosterone-related behavior was dependent on the quality 
of parent-child relations. That is, when parent-child relations were poor, high- 
testosterone sons (comparedto low-testosterone sons) were more likelyto engage 
in antisocial behavior, and low-testosterone sons (compared to high-testosterone 
sons) were more likely to report symptoms of depression. In daughters a different 
picture emerged. When daughter-mother relations were poor, low-testosterone 
daughters (compared to high-testosterone daughters) were more likely to express 
risky and antisocial behavior. When father-daughter relations were poor, low- 
testosterone daughters (compared to high-testosterone daughters) were more 
likely to be depressed. In short, testosterone's relationship with risk behavior and 
depressive symptoms was conditional on the quality of parent-child relations. 
As parent-child relationship quality increases, testosterone-related adjustment 
problems are less evident. Interestingly, unlike sons' testosterone, daughter's 
testosterone is negatively related to risk behavior. 

It is clear from these studies that the biosocial model is a promising one for 
exploring problem behavior. Exploring the social environment as a source of 
variables that have the potential to moderate the hormone-behavior association 
will greatly expand our knowledge. The social environment also has potential 
for affecting hormones as well as behavior. When studying youth, the potential 
confounding influences of cognitive maturation and pubertal development add 
another layer of factors to take into account in developing research questions. As 
suggested by Mazur and Booth (1998), around puberty, the effect of testosterone 
on behavior may work through increased size, muscle mass and appearance of 
secondary sexual characteristics. These changes produce dramatic change in 
the way individuals are treated by peers, parents, siblings and authorities. 

Also, future biosocial models should incorporate a wider range of variables 
in the analysis. For example, Figure 3 suggests that the parent-child relationship 
quality not only moderates the link between testosterone and problem behavior, 
but may also moderate the link between developmental phase and problem 
behavior and have a direct effect on problem behavior. The model also suggests 
that parents’ testosterone may influence the quality of parent-child relationships. 
The phase of child development, especially the early adolescent tendency to 
engage in more conflict with parents and the late adolescence tendency to 
decrease the amount of time spent with parents in favor of peers, may also affect 
the testosterone-problem behavior association (Silk and Steinberg 2003). 

A series of studies alerts us to the importance of the timing of measuring 
hormones and behavior in future studies of both youth and adults. A series of 
studies by Udry and his colleagues (1988, 1990) are particularly interesting with 
respect to timing issues. In a cross-sectional analysis of 12- to 13-year old boys, 
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Figure 3. Testosterone and Youth Behavior Problems 
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correlations were observed between testosterone and norm-violating behavior. 
In a prospective analysis of these boys over a three-year period, no associations 
between testosterone and behavior were observed (Drigotas and Udry 1993; 
Halpern, Udry and Suchindran 1997, 1998). Many interpreted this to mean that . 
testosterone is largely organizational and not activational during adolescence (e.g., 
Mazur and Booth 1998). When the investigators conducted a third study in which 
the interval between data collection points was substantially shorter, they found 
that increases in testosterone were linked to first intercourse at a much earlier age 
in both boys and girls (Halpern, Udry and Suchindran 1997, 1998). Investigation 
showed that recall of sexual intercourse initiation over longer intervals was 
inaccurate and masked the relation between testosterone and problem behavior. 
Measurement timing deserves serious attention subsequent studies. 


Peer and Family Relationships 


While the vast majority of studies have focused on associations between 
testosterone and negative behavior, a handful of small studies suggest that higher 
levels of testosterone have positive links with interpersonal associations in some 
circumstances. In a study of college students, males and females with higher 
testosterone had a more forward and confident interaction style than those with 
low testosterone (Dabbs et al. 2001). Above average levels of testosterone have 
been linked to college students being helpful, engaging and outgoing (Dabbs and 
Ruback 1988). In adult women, above-average testosterone is related to being 
confident (Baucom, Besch and Callaan 1985). 


Peer Relationships 


A study of adolescent peers using the biosocial model indicates that testosterone 
may be linked to positive as well as negative behavior depending on the nature 
of the social moderator. Rowe, Maughan, Worthman, Costello and Angold (2004) 
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documented the association between testosterone and behavior in boys and 
showed that it was highly dependent on the nature of the immediate peer context. 
When peers were judged as positive social influences, high testosterone levels 
in boys were associated with assertive and dominant behaviors characteristic 
of leadership. When peers were engaged in deviant behavior (rule violation, 
substance use), high testosterone levels in boys predicted non-aggressive 
symptoms of conduct disorder. 

Another study of peers reveals that the quality of the relationship adolescent 
boys have with their mothers or older sisters is important for understanding 
testosterone's links to peer relationships (Updegraff, Booth and Bahr 2006). 
Above average levels of testosterone are positively related to boys' perceptions 
of their best friends as popular, to their own perceived competence in peer and 
romantic relationships, and to the time they spend alone with a female. But this 
occurs only if a boy has a positive relationship with his mother or an older sister. 
It appears that female members of the family play an important role in socializing 
adolescent boys with respect to developing and maintaining meaningful 
relationships with peers. 

In a classic study, Udry (1988) found that testosterone is linked to interest in 
sexual behavior, but that the interest is expressed behaviorally only in cases of 
father absence and low participation in team sports in the case of females. In 
males, however, the behavioral expression of testosterone's link to interest in sex 
is not moderated by any of the variables studied. 


Marital Relationship Quality 


Research on marital quality suggests that high levels of testosterone may result 
in either high or low marital quality depending on perceptions of the social 
environment. In a sample of established working- and middle-class families with 
school-age children, neither husbands' nor wives' testosterone showed a direct 
connection with marital quality (Booth, Johnson and Granger 2005). In contrast, 
the association between husbands' testosterone and positive and negative marital 
quality (as evaluated by both spouses) was conditional on husbands' perception 
of role overload. When perceptions of men's role overload were elevated, higher 
testosterone levels were associated with lower levels of marital quality. When 
role overload was low, higher testosterone levels were linked to higher levels of 
marital quality. The study offers additional support to the notion that depending 
on perceptions of the social context, testosterone enables positive behavior in 
some situations and negative behavior in others. What is the mechanism by which 
men's perceptions of the demands of the environment moderate the association 
between testosterone and marital quality? While we know of no research bearing 
on this question, we suggest that when stress is low, higher testosterone men 
may devote more attention to their spouses, take the initiative in expressing 
positive interaction, and make a greater overall investment in the relationship. 
When high testosterone men are under considerable stress they are more likely 
to disagree, be less responsive to affective cues, and take risks (e.g., infidelity, 
drug or alcohol use) that may threaten marital relationships. 
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An earlier study of men who were veterans indicated testosterone had direct 
negative effects on marital quality (Booth and Dabbs 1993). The men in the two 
samples were approximately the same age. What may account for the difference 
is that only some of the servicemen were in stable long-term marriages that 
involved children whereas the entire sample of men in the study reported above 
were in such unions. Maintenance of such long-term unions may mean that very 
few men in the recent study were predisposed to express low levels of affect and 
high levels of conflict whereas a much higher proportion of the men from a more 
diverse population are likely to manifest such behavior. These studies indicate 
the importance of taking into account early experiences and other background 
factors in studies linking testosterone and behavior. 


Women's Orientation toward Family Relationships 


Again we note the limited number of studies linking testosterone to female peer 
and marital relationships. One study that poses a challenge for future research 
explored gendered behavior in 250 women aged 27-30. Udry, Morris and Kovenock 
(1995) found that women with higher levels of testosterone were less likely 
to marry and assigned a lower priority to marrying. They also found that such 
women were less interested in children and had fewer children. Cashdan (1995) 
found that women with higher testosterone-associated dominance behavior felt ' 
less need for a partner. Whether testosterone-linked low interest in marriage and 
children is translated into poorer marital quality and lower parent-parent child 
relationship quality is unknown and is an important question for future research. 


Men's Orientation to Family Relationships 


Usually research focuses on the impact of higher levels of testosterone on behavior. 
Much less attention has been devoted to declines in testosterone and the role of 
the social milieu in that decline. For example, testosterone declines when men 
marry (Grey et al. 2002, Mazur and Michalek 1998) and declines further when 
they become fathers (Story et al. 2000), but climbs when they divorce (Mazur and 
Michalek 1998). Additionally, fathers with lower testosterone are more attuned to 
their infant's cries (Fleming et al. 2002). The origin and duration of these changes and 
the consequences they have for behavior is unknown. The decline may derive from 
attempts to meet the expectations associated with new roles and expectations, or 
it may have biological roots in pheromones or other stimuli. The mechanisms that 
explain these testosterone changes are ripe for study. 

Examples of questions to be studied include: What behaviors change as a result 
of the declines in testosterone associated with marriage or becoming a parent? 
Whose behavior precedes the change in testosterone? Does the behavior following 
the decline in testosterone then lead to further changes in the hormone? Does 
the process of assuming more and more of the parenting responsibilities lead to 
further declines in testosterone or does it eventually contribute to an increase in 
testosterone that leads to the father becoming more involved with offspring? 

Answering such questions requires knowledge of risk and resilience factors 
that may influence context (Luthar et al. 2000, Masten et al. 1990). For example, 
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the well-known buffer, parent-child relationship quality, may depend on such 
things as whether the parent was raised in a supportive family, the current family 
is disadvantaged economically, and whether the quality of the parents' marriage 
is high or low. 


Gender Differences 


Testosterone's link to behavior in females has some parallels with males and 
some differences. In both female and male children, low levels of testosterone 
are related to depression (Booth et al. 2003). In the same study testosterone 
is negatively related to externalizing behavior in females and positively related 
to such behavior in males. There is too little research on testosterone-behavior 
associations among females to even tentatively account for the gender 
differences, making this an important question to address in future research. 

As noted above, the link between testosterone and sexual behavior among 
youth was found to vary by gender (Udry 1988). Although testosterone was 
positively related to interest in sex, having a father in the household and 
participating in sports inhibited the interest from being expressed in sexual 
intercourse in females but not in males. 

We know that high levels of testosterone are related to peer competence and 
the amount of time spent with peers by males, but not by females (Updegraff et 
al. forthcoming). A similar picture has been observed among adults. In a study 
of marital quality wives' testosterone was unrelated to marital quality whereas 
husbands' testosterone was related both positively and negatively related to 
marital quality depending on the males' role overload at the time marital quality 
was measured (Booth et al. 2005). 

On the other hand, testosterone's link to parenting behavior is similar for males 
and females. Higher testosterone is associated with lower levels of interest in 
marriage and parenting (Cashdan 1995, Udry et al. 1985) among females. In 
men, studies suggest that low levels of testosterone are associated with the 
anticipation of becoming a parent (Story et al. 2000) and being more attuned to 
caring for offspring (Fleming et al. 2002). Low testosterone has a positive role in 
marriage and parenting among both males and females. 

Whether testosterone has links to status, dominance behavior, aggression, 
risk behavior and initiative that is similar in males and females is unclear. Grant 
and France (2001) indicated that high testosterone women are more likely to 
report dominating behavior. Studies of women in prisons report both positive and 
null associations between testosterone and dominance or aggression (Banks 
and Dabbs 1996; Dabbs and Hargrove 1997; Dabbs, Ruback, Frady, Hooper 
and Sgoutas 1988). Some studies report testosterone to be negatively related 
to status (Gladue 1991). Clearly further research is needed to reach conclusions 
regarding gender differences in the links between testosterone and dominance 
and risk behavior. For example, higher levels of testosterone are positively related 
to interest in masculine activities in pre-pubertal females (McHale et al. 2004) 
and to resistance to parental socialization efforts encouraging feminine behavior 


(Udry 2000). 
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In addition, we must continue to explore developmental and Cultural factors. 
For example, Taylor and colleagues suggested that "fight or flight" is not an 
appropriate model to use in explaining behavior during a crisis. They proposed 
"tend and befriend" as a more appropriate model for females and that oxytocin 
is one of the key hormones involved. Drawing on conflict theory, Campbell (1995) 
argued that because of the greater value of mothers' health relative to fathers 
in providing sufficient resources to offspring, women are less likely to involve 
themselves in potentially dangerous conflicts over status. While women are 
less likely to engage in dangerous conflicts, women do engage in aggression to 
compete for scarce resources. Campbell argued that female non-aggressiveness 
is a myth created by men to sanction female aggression. 


Conclusion 


The advent of non-invasive means of measuring hormones and new paradigms 
has opened windows of opportunity to integrate testosterone into sociology. The 
new paradigms have shifted thinking away from biodeterminism and focused 
it on the integral role of the environment for understanding the links between 
biology and behavior. This realization has enabled research attention to focus 
on identifying new sources of individual differences, modeling intra-individual 
change, and exploring interactive and reciprocal effects. 

Social status is a fundamental building block of human groups. The way in 
which status is gained, maintained and lost is fundamental to understanding the 
structure of individual social networks, corporations and governments. Status 
governs the way in which resources are obtained and distributed. It also defines 
the way in which order is maintained. It is apparent that testosterone has a role 
- in organizing status hierarchies. Next steps in the research should incorporate 
females to a significant extent and move beyond the sports venue. 

To date much of the testosterone research on problem behavior has focused 
on aggression and antisocial behavior. Missing from this research is the 
understanding that the behavior of concern is dominance, risk taking and initiative, 
and that only a small portion of these acts involves aggression, defined as the 
intent to physically harm others. One of the reasons high levels of testosterone 
are so seldom associated with aggression is because of the protective influence 
of the social environment and the resilience individuals develop by living in such 
an environment. While much research has focused on externalizing behavior 
(aggression, deviance, crime), it is now apparent that testosterone has a role in 
our understanding of depression as well. The topic most in need of research is 
the process by which the environment moderates the link between testosterone 
and problem behavior. 

Largely ignored for many years is the positive role testosterone may play 
in family and friendship relationships. There are now enough studies to assert 
that this is rich area for research. As in other relationship domains, the social 
environment plays a important role in defining whether testosterone has a 
positive or negative influence on the quality of intimate relationships. Research is 
needed to explore a wide range of environmental factors that have the potential 
for affecting the quality of intimate relationships. 
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From the research to date it is clear that there are important gender differences 
that need explanation. Why should female testosterone count in some areas (e.g., 
parenting) but not others (e.g., adolescent peer relations and marriage)? Yet there 
is symmetry in the way in which low testosterone in both males and females 
enhances parenting. Research is needed to understand gender differences and 
similarities in testosterone's link to behavior. 

The field is still in the early stages of carefully describing the nature of 
testosterone-behavior links in everyday social life, but the establishment of these 
associations is only the beginning. The next generation of biosocial research 
needs to go beyond describing the associations and uncover the fundamental 
mechanisms and processes underlying them (Raine 2002). The appropriate 
measurement tools and statistical strategies necessary to move this endeavor 
toward understanding the nature of biosocial phenomenon are now available. 
In our opinion, the critical need is for sociologists to increase our understanding 
of how social, behavioral, cognitive and biological variables shape individual and 
group processes. 
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Achievement and Ascription in Educational Attainment: 
Genetic and Environmental Influences on Adolescent Schooling 


François Nielsen, University of North Carolina at Chapel Hill 


Abstract 

The classic ("status attainment") model of educational and occupational attainment 
suffers from three related shortcomings when used as a tool for comparative or policy- 
oriented research on social mobility: (1) ambiguity of model parameters as measures of 
opportunity for achievement vs. ascription; (2) vulnerability to incomplete specification of 
family background; and (3) confounding of environmental and genetic influences. These 
issues can be addressed in part by using a ("behavior genetic") model that distinguishes 
variance components associated with genetic endowment, shared (or common) family 
environment, and unshared (or specific) environment. Size of the genetic component 
(heritability) measures opportunity for achievement; size of the shared environment 
component (environmentality) measures social ascription. A multivariate behavior 
genetic model of adolescent verbal IQ, grade point average and college plans is estimated 
using data for six types of adolescent sibling pairs living in the same household: MZ 
twins, DZ twins, full siblings, half siblings, cousins and non-related siblings. Results show 
large genetic components, relatively small shared environmental components, and large 
unshared environmental components for all three outcomes. Parameters of the behavior 
genetic model can be used to compare mobility regimes across social contexts and the model 
therefore provides an important tool for comparative social mobility research. 


Introduction 


Recurring questions in the literature on social stratification and mobility involve 
evaluating the role of ascription vs. opportunity in educational and socio- 
economic attainment: whether some societies are more open than others, 
whether ascription is declining in the course of industrialization, or whether a 
given subgroup of society enjoys fewer opportunities. The status attainment 
model introduced by Blau and Duncan (1967) is one of the principal tools used 
to address such questions in a comparative perspective (Breen and Jonsson 
2005). Opportunity for achievement is typically associated with the effects on 
attainment of variables presumed to reflect inherent individual qualities and effort 
(e.g., cognitive ability, education); ascription is associated with family background 
characteristics (e.g., parental education, family SES). 
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The paper presents a case that the status attainment model is inadequate as 
a tool for comparing social mobility regimes due to ambiguity of interpretations 
of model parameters as representing opportunity vs. ascription, vulnerability to 
bias due to incomplete specification of family background, and confounding 
of environmental and genetic influences. Some of the shortcomings of the 
attainment model can be alleviated by using a behavior genetic model of 
attainment that explicitly distinguishes between genetic and environmental 
influences on educational and socio-economic outcomes. To illustrate this point | 
use data on pairs of siblings who differ in their degree of biological relatedness to 
estimate a model of schooling involving three measures: verbal IO, grade point 
average and college plans. The model decomposes the variance in verbal IQ and 
the educational measures into components corresponding to effects of genetic 
endowment, shared family environment and unshared (individual-specific) 
environment. 

The paper develops an idea that has been expressed earlier by researchers 
in several fields, including sociology, but has not taken root in our field. This is 
the view that behavior genetic models of attainment are an important tool of 
comparative social mobility research because they provide consistent measures 
of opportunity and ascription in a system of stratification: ascription is measured 
as the proportion of variation in attainment attributable to the shared environment 
(environmentality) and opportunity as the proportion of variation attributable to . 
genes (heritability). 

Subsequent sections discuss problems with the attainment model and the 
potential usefulness of a behavior genetic model of attainment. Later sections 
discuss the data, develop the empirical model of adolescent schooling, present 
the results of the analysis, and finally discusses broader implications of the 
findings for comparative social stratification research. Existing literature is used 
to provide a “demo” illustrating genetically-aware social mobility research. 


Achievement & Ascription 


Following Blau and Duncan (1967) sociologists have used sets of recursive 
equations to describe the process of educational and occupational attainment of 
individuals in a system of stratification. A typical model of status attainment (as 
this type of research came to be known) might consist of three equations: 


(1) RslQ = f,(FsOcc, FsEd) 
(2) RsEd = f,(RslQ, FsOcc, FsEd) 
(3) | RsOcc = f,(RsEd, RslO, FsOcc, FsEd) 


where IO denotes a measure of cognitive ability, Ed is educational achievement, 
and Occ is a measure of occupational prestige; R and F refer to the respondent 
and the respondent's father, respectively, and the functions are typically specified 
as linear (Duncan, Featherman and Duncan 1972). The career of the individual 
is envisioned as a process in which each level of attainment is a function of 
previous attainment and characteristics of the family of origin. 

Blau and Duncan (1967) discovered an empirical pattern that would be often 
replicated in later research. First, the direct effect of FsOcc on RsOcc (controlling 
for RsEd) is small, from which they concluded that there is relatively /ittle socíal 
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ascription. Second, the direct effect of RsEd on RsOcc is large; from this they 
concluded that (1) education serves to reproduce inequality (as most of the 
correlation between FsOcc and RsOcc is indirect, through RsEd), and (2) there 
is much opportunity for achievement (as the correlation of RsEd with RsOcc 
is driven by RsEd residuals representing unmeasured individual resources 
independent of family background). Overall Blau and Duncan interpreted their 
findings as reflecting a pattern of low ascription in modern industrial society. 

Stratification researchers adopted the attainment model as a means of 
capturing opportunity for achievement (or openness). Opportunity is associated 
with effects of intermediate achievement variables (such as RsEd); ascription 
with effects of background variables (such as FsOcc). With these substantive 
interpretations of the parameters, the status attainment model can be used as a 
comparative device to evaluate the relative openness of the stratification system 
in different societies or historical periods (Breen and Jonsson 2005; Ganzeboom, 
Treiman and Ultee 1991). The attainment model — resonating with powerful 
themes of social justice — is a centerpiece of the policy-oriented discourse 
on social stratification. In this normative vein, model parameters are used to 
contrast extent of meritocracy vs. enduring strength of social ascription (Olneck 
1977:151). 

The attainment model is unfortunately inadequate as a tool for comparative 
stratification research or for use in normative debates because of shortcomings 
related to (1) interpretations of model parameters in terms of opportunity vs. 
ascription, (2) the possibility of estimation bias due to incomplete specification 
of family background, and (3) confounding of environmental and genetic 
influences. 


Interpretation Issues 


Interpretations of the effects of background (such as family SES) or intermediate 
variables (such as IQ or education) are substantively ambiguous because 
intermediate variables may reflect social inheritance rather than intrinsic talent 
or effort. The difficulty in linking normative concepts to model parameters is 
exposed in sharp relief in the debate surrounding the finding of Herrnstein and 
Murray (1994) that, controlling for SES of family of origin, cognitive ability (IQ) 
has a strong effect on educational and socio-economic outcomes. The authors 
interpret this pattern (together with other evidence) as indicating increasing 
returns to cognitive ability (and opportunity) in contemporary U.S. society. Critics 
have responded by questioning the strength of ability effects and/or arguing that 
IO scores reflect social inheritance rather than native talent, so that IO effects 
reflect social reproduction rather than opportunity (Fischer et al. 1996). 


Model Specification Issues 


The second issue concerns the open-ended specification of family background. 
Herrnstein and Murray (1994) measure family background with a composite 
SES index based on parental education and income. Critics point out that the 
SES composite does not adequately control for all the relevant aspects of 
family background. Leaving important aspects of family environment out of the 
model produces specification bias which artificially inflates the apparent effect 
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of cognitive ability and thus the evidence for opportunity. If all these factors 
were properly controlled, critics claim, the effect of ability would be reduced or 
disappear. Re-estimating some of Herrnstein and Murray's (1994) models with 
more detailed family background measures does reduce (but typically does not 
eliminate) the estimated effects of cognitive ability on educational and socio- 
economic outcomes (Fischer et al. 1996; Korenman and Winship 2000). This 
exchange illustrates a general problem with the classic attainment model: the 
task of controlling for family background with measured variables is inherently 
open-ended as there is no way to guarantee that all relevant aspects of the family 
environment have been measured and included in the model. 


Confounding of Genetic and Environmental Effects 


The role of genes in occupational mobility and the problem of confounding 
environmental and genetic influences were already addressed in the mainstream 
sociological literature almost 40 years ago. In a remarkable paper Eckland (1967) 
argued that the social mobility research of his days (then mainly based on mobility 
tables) was flawed as it assumes, in estimating aggregate mobility, a null model 
in which sons from any category of origin are equally likely to reach any category 
of destination. If the abilities to reach certain destinations are in part genetically 
determined and, as a result, unequally distributed, it follows that sons from certain . 
origins will be more likely to reach certain destinations. Without control for these 
genetic effects, the resulting asymmetry will be falsely attributed to a lack of 
perfect mobility. Thus, Eckland claimed, the degree of social mobility cannot be 
properly estimated without controlling for the association between origin and 
destination due to genetic transmission of abilities. The implication is that in 
measuring social rigidity or ascription, any association between occupational 
achievements of father and son due to genetic causes should be partialled out 
from the overall association; social ascription is identified with non-genetic 
(presumably environmental) causes of inter-generational transmission of status 
(see also Eckland 1979). 

In another landmark article, Scarr and Weinberg (1978) presented results from 
a study of adopted children showing that the correlations between characteristics 
of adoptive parents and children's cognitive outcomes are very small, whereas 
the correlations between these outcomes and cognitive ability of the birth mother 
are larger, a pattern suggesting that effects of family background variables on 
achievement of children in biological families are due to genetic causes rather 
than the environmental mechanisms sociologists surmise. Such results imply 
that the association between parental education and educational attainment of 
a biological child may reflect in part genetic transmission of educational abilities 
from parent to offspring rather than purely environmental influences. Thus the 
effect of parental education is not a good measure of social inheritance. 

The argument underlying both Eckland (1967) and Scarr and Weinberg (1978) is 
that biological inheritance produces associations between background variables 
(e.g., parental education and occupation) and respondent outcomes (e.g., IQ and 
education) that are conceptually distinct from associations due to environmental 
mechanisms of social reproduction. In the usual attainment model estimated 
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from respondents raised in their biological families effects of background 
variables on cognitive, educational and occupational attainments of respondent 
are confounded with genetic influences. 


The Role of Genes 


Even though their articles were published in a mainstream sociology journal, 
Eckland's (1967) and Scarr and Weinberg's (1978) critique of the classical 
attainment model did not have much impact on the field. A literature investigating 
the genetic bases of educational and occupational attainment flourished briefly in 
the 1970s, involving economists and sociologists (see contributions in Taubman 
1977). The work of Jencks and colleagues in sociology was a prominent part 
of that literature (Jencks 1980, 1992; Jencks and Brown 1977; Jencks et al. 
1972). The research tradition persevered in economics (Behrman et al. 1980; 
Behrman, Pollack and Taubman 1995) where it is experiencing a mini-renaissance 
(Bjorklund, Jantti and Solon 2005), but faded in sociology.! Meanwhile since 
the early 1970s behavior genetic research has experienced explosive growth 
in the fields of mental health, child development and cognitive and personality 
psychology (Plomin et al. 1997), producing a sophisticated statistical methodology 
to disentangle genetic and environmental influences on behavior using data 
on twins, adopted children and other relatives (Neale and Maes, forthcoming; 
Rodgers et al. 2001; Rowe and Teachman 2001; Shanahan, Hofer and Shanahan 
2003). Findings on the substantial role of heredity in cognitive ability, personality 
and educational and socio-economic outcomes have also accumulated (Rowe 
1994). These conclusions on the role of genes in behavior are now finding their 
way to a broader public (Pinker 2002). 

Behavior genetic models partition the variance of a measurable trait (a 
phenotype) into a component due to genetic inheritance, a component due to 
the shared (common, between-families) environment of siblings (aspects of 
the family and the larger rearing environment that tend to make siblings reared 
together alike), and a component due to the unshared (specific, unique, within- 
family) environment of a sibling (environmental factors that differ among siblings 
and tend to make them different)? The ratio of each variance component to the 
total phenotypic variance is termed heritability, environmentality and specificity, 
respectively. This decomposition of the phenotypic variance is the key to resolving 
the difficulties of the attainment model discussed earlier. 

The realization has emerged in the behavior genetic literature that the behavior 
genetic model provides a clear conceptual distinction between opportunity for 
achievement and ascription. The shared environment component, as it reflects 
the combined impact of such factors as social class, parental network of 
acquaintances, minority status, neighborhood characteristics and other aspects 
of the rearing environment that constitute the common experience of siblings in 
a family and affect their outcomes (such as educational achievement) in a similar 
way, captures the background characteristics that stratification researchers 
presumably have in mind when they conceptualize mechanisms of social 
reproduction and the ascriptive assignment of status. As Rowe (1994:33) writes: 
"This ratio [of shared environmental variation to total phenotypic variation] has 
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important policy implications, because it indicates how a phenatype might be 
changed by altering the rearing conditions of children with poor phenotypes to 
be like those of children with good ones. The greater the shared rearing estimate, 
the more change can be expected to follow from changing rearing conditions." 
(See also Jencks 1980:734.) The shared environment component of the total 
variance in outcome can thus be interpreted as a measure of ascription (Heath et 
al. 1985, Scarr-Salapatek 1971). 

The genetic component, reflecting the extent to which individuals are able 
to achieve their genetic potential for the trait, can be taken as a measure of 
opportunity for achievement? In this interpretation, the phenotype is viewed 
as resulting from the interaction of genetic endowment with the social 
environment. Different social environments are viewed as more or less restrictive 
of the full expression of genetic potential. Favorable environments, permitting 
fuller expression of potential, are characterized by high heritability. Unfavorable 
environments, inhibiting expression of native talent, are characterized by low 
heritability. Heritability measures realization of genetic potential and therefore 
opportunity for achievement (Guo and Stearns 2002). 

The unshared environment represents a combination of measurement error 
and idiosyncratic environmental influences that affect siblings in different ways; 
examples are birth order, a childhood disease that affects one sibling and not 
another, or association of siblings with different peer groups (Jensen 1997, - 
Turkheimer and Waldron 2000). 

The problem of incomplete specification of family background that plagues 
the attainment model is rendered moot in the behavior genetic model, which 
measures the overall impact of the shared environment in black box fashion, 
without actually measuring, or even identifying, the variables involved. Thus 
it is no longer possible to spuriously overestimate the role of an achievement 
variable such as IQ or underestimate the role of family background by leaving an 
important environmental characteristic out of the model. 

Finally the behavior genetic approach, by distinguishing among variance 
components due to genes, shared environment and unshared environment 
effectively disentangles the confounding of genetic and environmental influences 
that plagues the attainment model. Heritability, environmentality and specificity 
can be compared across social settings, groups or social systems. Such 
comparisons potentially inform a comparative sociology of social stratification 
and mobility, permitting statements on the relative degrees of social openness 
across societies and historical periods, and across groups and social contexts 
within a given society. The components are also meaningful conversation pieces 
in normative debates concerning social inequality. 


Data 


Data are from the first two waves of AddHealth, a school-based longitudinal 
study of adolescents in grades 7 through 12 (Udry 1998). Data were collected 
from 1994 through 1996. Siblings living in the same household were identified, 
and if necessary, a sibling was added to the sample to complete a pair. Pairs 
were classified as monozygotic twins (MZ, N — 170), dizygotic twins (DZ, N — 
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290), full siblings (FS, N = 702), half siblings (HS, N = 242), cousins (CO, N = 
105), and non-related (NR, N = 174). Ns are pairs actually used in the analysis 
(on determination of zygosity see Rowe and Jacobson 1998). All sibling pairs are 
used, even though data on an individual may be repeated when an individual is 
member of more than one pair (on this, see Eaves et al. 1999:67). 

Verbal IQ (VIO) measures verbal cognitive ability; it is the score on a test 
consisting of 87 items from the Peabody Picture Vocabulary Test (PPVT) (Neiss 
and Rowe 2000). Grade Point Average (GPA) is the average answer to four 
questions in each of two waves concerning recent grades in English or language 
arts, mathematics, history or social studies, and science. Co/lege Plans (CPL) is 
the average answer to questions on college aspiration and expectation asked in 
both waves. Only self-identified blacks and non-Hispanic whites were included 
in the analysis and a few outlying observations with VIO scores below 50 were 
excluded. 

Average VIO score is about 12 points less for blacks compared to whites, and 
about 2 points less for females compared to males. There are also significant race 
and sex differences in GPA, and a smaller sex (but not race) difference in CPL. Such 
differences may inflate estimates of shared environmental effects in behavior genetic 
models (Rowe, Jacobson and Van den Oord 1999). To control these differences each 
variable was standardized within each of four race by sex categories (black female, 
black male, white female, white male). Zero-order associations of the measures with 
age are minute, so age was not controlled further. 


Model and Methods 


The school attainment model is depicted in Figure 1. It is a structural equations 
model (SEM) containing both observed variables represented by squares and 
unmeasured, latent variables represented by circles.» Measured variables are 
VIO, GPA and CPL. Straight arrows between VIO and GPA and CPL, and between 
GPA and CPL, represent elements of an educational attainment model relating 
the three variables in a recursive fashion, so that VIO is assumed to affect both 
GPA and college plans; GPA also directly affects CPL. This subset of effects is 
termed the phenotypic part of the model. 

The behavior genetic aspects are embodied in the latent variables. Measured 
variables for a given sibling are functions of three genetic factors (A,, A. A. 
assumed uncorrelated within a given sibling. Genetic factor A, affects all three 
phenotypes VIO, GPA and CPL; A, affects only GPA and CPL; A, affects CPL only. 
This patterning of the paths is called a Cholesky factorization (Neale and Maes, 
forthcoming; see empirical examples in Emde and Hewitt 2001). The Cholesky 
structure is adopted on substantive and methodological grounds further explained 
in Nielsen (2006). 

Each genetic factor for one sibling is correlated with the corresponding factor 
for the other sibling by a quantity k corresponding to their degree of relatedness 
(proportion of genes shared by common descent). MZ twins have identical genes, 
so k is always equal to 1. When mating is random (a strong assumption that is 
relaxed later) DZ twins and full siblings share half their genes, so k = .5. The 
parameter k is .25 for half siblings, .125 for cousins and 0 for non-related siblings. 
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Figure 1. Full (BACE) Model of Adolescent Schooling ` 





Note: VIQ = verbal IQ, GPA = grade point average, CPL = college plans. k fixed to 1.0 (MZ); .5 
(Z ES); 25 (HS); 125 (CGO); 0 (NR). 


Itis the ability to use genetic theory to specify in advance the association between 
genotypes of siblings — an idea going back to Fisher (1918) - that gives behavior 
genetic models the leverage to disentangle genetic from environmental effects. 

Latent factors C,, C, and C, represent the shared or common environment 
of siblings, which corresponds to the rearing environment, consisting of such 
variables as social class or family SES, quality of schools in the community, family 
connections, etc., that affect siblings reared in the same environment in the 
same ways and thus contribute to make them phenotypically similar. The shared 
environment is also decomposed into a Cholesky structure with three orthogonal 
factors (C,, C, and C.). As the shared environment is assumed to affect each sibling 
in the same way, the correlation of each C factor across siblings is fixed to 1. 

Latent factors E,, E, and E, represent the unshared or unique environment 
consisting of sibling-specific experiences that contribute to make siblings 
phenotypically different from each other. Such differentiating environmental 
influences might include parental preference, birth order, influences of different 
teachers or peers, or a disease affecting one sibling but not the other. In this 
model the specific environment also includes errors of measurement in the 
variables. The unshared environment factors constitute a Cholesky structure. 
The sibling-specificity assumption is implemented by fixing to O the correlation 
of each factor across siblings. 
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The model is estimated by deriving mathematically the expected covariance 
matrix of the observed variables for each type of sibling pairs as a function of 
model parameters (see Nielsen 2006 for derivation of the model and Mx script). 
Each matrix has dimension 6 by 6, with rows and columns corresponding to the 
observed phenotypes for each sibling (i.e., VIQ,, GPA,, CPL,, VIQ,, GPAs CPL, 
where subscripts denote siblings in a pair). The parameters of the model are 
then estimated simultaneously for the six types of siblings by minimizing the 
discrepancies between expected and observed covariance matrices according 
to the maximum likelihood (ML) criterion (Bollen 1989, Loehlin 2004). The SEM 
program Mx was used (Neale et al. 2003). Each 6-by-6 covariance matrix provides 
(6x7)/2 = 21 statistics (variances or covariances), so there is a total of 126 statistics 
over the six groups of sibling pairs. The full model contains 21 path coefficients 
to be estimated (six for each Cholesky factorization plus three for the phenotypic 
model), so there are 105 df remaining to test the fit of the model. 


Results 


The model is estimated from covariance matrices rather than correlations (Neale 
and Maes, forthcoming). However correlations for the six types of siblings are 
shown in Table 1, to provide a sense of the information on which estimation 
of the variance components is based. Correlations across siblings for the same 
variables are shown in bold type. Cross-siblings correlations are about twice as 
large for MZ twins (.724, .660 and .663 for VIO, GPA and CPL, respectively) than 
for DZ twins (.356, .332 and .264).° A classic estimator of heritability is twice the 
difference between the correlations of MZ and DZ twins. Thus one can estimate 
heritability as 2(.724 x .356) = .736 for VIO, .656 for GPA and .798 for CPL. 
These estimates are similar to those found in other studies of cognition-related 
outcomes for adolescents and young adults (e.g., Plomin and Petrill 1997). At 
the other extreme of relatedness, correlations for non-related siblings living in 
the same household are estimates of the pure impact of the shared environment 
of siblings on the outcomes; their small sizes (.063, .080 and .190) constitute a 
preliminary hint that the shared environment is not a substantial determinant of 
these variables in this population, except perhaps for CPL. 

The full model is denoted BACE, as it specifies direct paths relating observed 
variables (contained in matrix B), in addition to a full Cholesky structure associated 
with each latent component A, C and E. Fit statistics for the BACE model 
are shown on the first line of Table 2. Unfortunately the B matrix in the BACE 
model is not identified; the reason is that the ACE part of the model completely 
accounts for the observed variables and their correlations, so B cannot improve 
the fit." B can be estimated in a simplified model with no genetic component, a 
single shared environment factor C, affecting the observed variables and three 
uncorrelated latent variables E,, E, and E, representing the unshared environment 
of VIO, GPA and CPL, respectively. (C, is then equivalent to a pair-specific fixed 
effect affecting each observed variable.) The model is labeled BC,E, to indicate 
the shape of the matrices involved, with C reduced to a single column and E to 


a diagonal matrix. 
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Table 1: Correlations for Six Groups of Siblings (N = number of pairs) 


MZ Twins (below diagonal, N = 170) DZ Twins (above diagonal, N = 290) 


VIQ, 
GPA, 
CPL, 
VIQ, 
GPA 
CPL 


2 


2 


VIQ, 
GPA, 
CPL, 
VIQ, 
GPA, 
CPL 


2 


VIQ, 
GPA 
CPL, 
VIQ, 
GPA, 
CPL 


1 


` 


CPL, 
.035 
.062 
.264 
136 
292 


CPL, 
183 
081 
204 
203 


.295 


CPL 
.013 
-.007 
.190 
.155 
.180 


2 


VIQ, GPA, CPL, VIQ, GPA, 
.239 172 .356 .047 
P 2213 .132 .332 
.290 .378 .136 .105 
.724 .308 .239 .245 
.182 .660 322 .308 
.325 .362 .663 .374 .393 
Full Siblings (below diagonal, N = 702) Half Siblings (above diagonal, N = 242) 
VIQ, GPA, CPL, VIQ, GPA, 
.183 236 .310 -.103 
.295 434 .092 .278 
212 .380 111 102 
411 189 148 St 
.163 .360 .245 .265 
.133 .226 .332 252 .406 
Cousins (below diagonal, N 105) Non-related Siblings (above diagonal, N = 174) 
VIQ, GPA, GPL VIQ, GPA, 
292 .182 .063 -.101 
Sh 372 -.066 .080 
.061 .206 .099 .169 
.354 .127 .007 .253 
.090 .104 -.013 .191 
.238 .207 .121 271 224 


2 E d 
Note: Correlations in bold type are for the same variable across siblings: VIQ = verbal IQ, 


GPA = grade point average, CPL = college plans. Subscripts denote siblings. 


The fit statistics for the BC,E, model are shown on line two of Table 2. The 
x? is 555.888 for 117 df (p « .001), which is not a satisfactory fit. (With SEMs 
the goal is to obtain a non-significant model.) To check the significance of B, the 
model on line three of Table 2 drops B from the model. This results in ax? increase 
of 89.657 for three degrees of freedom, a highly significant deterioration in fit. 
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Table 2: Model Comparisons 
ee ee pea Se IT UV, VU UU ITU EE 


Fit Statistics Tests 
Model x df p AIC RMSEA Test Ax Adf p 
1.BACE 153.930 105 .001  -56.070 .042 
2.BC,E, 555.888 117 .000 321.888 A12 
GE 645.546 120 .000 405.546 .1200 | 3vs.2 89.657 3 .000 
4. ACE 153.930 108 .002 -62.070 041 


5. AE 179.069 114 .000 -48.931 .046 5vs.4 . 25.139 6 .000 
6. CE 296.203 114 .000 68.203 .079 6vs.4 142.273 6 .000 
T. ACE 221.945 111 .000 -.055 .064 7vs.4 68.014 3 .000 
8. ACE 188.139 111 .000 -33.861 054 8vs.4 34.209 3 .000 
9. ACE 167.437 111 .000  -54.563 .043 9vs.4 13.507 3 .004 
10.AC,E 158.807 111 .002  -63.193 040 10vs.4 4.877 3 .181 
11.AC¡E 165.435 114 .001 -62.565 044  11vs.4 11.505 6 .074 


11 vs. 10 6.628 J. .085 


Note: B = phenotypic paths; A = genetic paths; C = shared environment paths; E = specific 
environment paths; A,, C,, E,: off diagonal elements of A, C, or E fixed (independent factors 
model); A,, C,: lower triangular matrix A, C reduced to single column vector (common 
factor model). * favored model 


Thus the path coefficients relating the three observed variables of the model are 
significant in the context of this simplified specification of the latent structure 
with no genetic influences. One reason for the poor overall fit of Models 2 and 3 
is that the absence of genetic component implies identical covariance matrices 
for all six groups of sibling pairs, a pattern that is clearly inconsistent with the 
data (Table 1). 

For the full ACE model, x? is 153.930 with 105 df (p = .001), which at first sight 
is not a satisfactory fit. RMSEA is a measure of fit that adjusts for degrees of 
freedom and sample size (McDonald 1989). RMSEA is .042, below the threshold 
of .05 corresponding to a very good fit. Fixing to zero the shared environment 
structure C (model AE) causes a highly significant increase in x? of 25.139 for 
6 df (p « .001); therefore the shared environment structure cannot be dropped 
from the model. Fixing the genetic structure A (model CE on line 6) produces a 
highly significant increases in x? of 142.273 with 6 df (p < .001); thus the genetic 
structure cannot be excluded either. 

Models 7 to 11 test alternative specifications of the internal structure of the 
A, C and E matrices, specifically whether the lower triangular matrix can be 
replaced by a simpler model consisting of a diagonal matrix (three uncorrelated 
factors, each affecting a single outcome) or by a single column (a single factor 
affecting all three outcomes). Reducing A to a single column (model A,CE, the 
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common factor model) or to a diagonal matrix (model AdCE, the independent 
factors model) leads to unacceptably large x? increases (68.014 and 34.209 with 3 
df, respectively). Factors in the A matrix can be thought of as sets of genes. Thus 
the tests on lines seven and eight show that the genetic structure can neither be 
reduced to a single set of genes affecting all three outcomes (A,CE) nor to three 
distinct sets of genes, each affecting a single factor (A,CE). Likewise reducing 
the shared environment structure C to a diagonal matrix representing three 
independent factors (model AC,E) results in a significant increase in x? (13.507 
for 3 df, p = .004). However reducing C to a single column representing a single 
shared environment factor affecting all three variables (model AC'E) produces 
a non-significant increase in x? (4.877 for 3 df, p = .181). Finally, combining a 
common factor structure for the shared environment with a diagonal structure for 
the unshared environment (model AC1Ed) produces a slightly refined model that 
does not fit significantly worse than ACE (x? increase 11.505 for 6 df, p = .074) 
or AC,E (x? increase 6.628 for 3 df, p = .085), and therefore becomes the favored 
model, AC E is favored over AC E for its simplicity, despite a slightly larger AIC 
value: AIC is -62.565 for AC,E, vs. -63.193 for AC,E.° Estimated parameters for 
the two models hardly differ. 

Standardized path coefficients for the favored AC,E, model are shown in 
Figure 2. The model represents shared environmental influences acting on VIQ, 
GPA and CPL as a single latent factor affecting all three outcomes, which one ` 
might perhaps identify with a “privilege” factor capturing the cognitive and 
academic advantage shared by siblings due to their rearing environment. The 
unshared environment is represented by separate factors, each one affecting a 
single measure. It is the behavior one would expect if the unshared environment 
consisted largely of measurement error. This is somewhat surprising since one 
would have expected that some unshared influences (e.g., perinatal damage 
affecting one sibling but not the other) would affect all three variables in similar 
ways. Effects of the genetic factors tend to be the largest (.202 to .738), and 
effects of the shared environment the smallest (-.041 to .371), with effects of the 
unshared environment in between (.572 to .609). 

Table 3 shows the proportions of the total expected variances of the observed 
variables that are explained by the latent factors. Heritability (conventionally 
denoted h?) is the proportion of variance explained by genetic factors. Estimated 
heritability is high for GPA (.669) and for CPL (.600), and somewhat lower for 
VIO (.536). Environmentality (denoted c?) is the proportion of variance explained 
by shared environmental factors. Environmentality is substantial for VIO (.137) 
but almost nil for GPA and CPL (.002 and .03, respectively). Specificity (denoted 
el, or proportion of variance due to unshared influences, is substantial for all 
three measures (.327, .329 and .370 respectively). Recall that the unshared 
environment here includes residual variance? Table 3 also shows tests of 
significance for these parameters using ML-based confidence intervals.'? All 
estimates are significant in the sense that the ML 95 percent confidence interval 
does not include zero except environmentality for GPA. Thus one cannot reject 
the hypothesis that GPA is entirely explained by a combination of genetic and 
unshared environmental factors, with no significant role for the shared family 
environment. Environmentality for CPL also comes close to non-significance. 
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Figure 2. Standardized Path Coefficients for Favored AC,E, Model 





Note: VIQ = verbal IQ, GPA = grade point average, CPL = college plans. A,, A,, A, = genetic 
factors; C, = shared environment factor; E,, E,, E, = unshared environment factors. (One 
sibling shown.) 


These results do not suggest strong impacts of shared environmental factors 
on the schooling process for these adolescents in U.S. schools at the end of the 
20th century." 


Discussion: Heritability, Environmentality and Comparative Startification Research 


Results presented earlier indicate that the three schooling measures are highly 
heritable, relatively unaffected by the shared environment, and substantially 
affected by unshared environments. The shared environment seems to affect all 
three measures as a single latent "privilege" factor, whereas genetic influences 
are better represented as partially independent sets of genes specific to each 
outcome. Such findings are far from isolated. The view that cognitive ability and 
educational success have a substantial genetic basis and are (beyond childhood) 
little affected by shared environments has become commonplace in mainstream 
psychology (Brody 1992, Gottfredson [1994] 1997, Neisser et al. 1996, Sternberg 
and Grigorenko 1997, especially Hunt 1997). As McGue (1997) notes, the scientific 
opposition to these conclusions now consists of studies using behavior-genetic 
models tuned to produce lower heritability estimates (e.g., Daniels, Devlin and 
Roeder 1997; Feldman, Otto and Christiansen 2000). There is also evidence for 
a role of genes in the determination of earnings (Behrman, Pollack and Taubman 
1995; Bjorklund, Jantti and Solon 2005), personality traits and some social 
attitudes (Rowe 1994). These findings have moved Turkheimer (2000) to proclaim 
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Table 3: Maximum-likelihood Confidence Intervals for Heritabilities, Environmentalities 
and Specificities for Favored AC E, Model 


95% Cl 95% Cl 95% Cl 


2 2 


h Low Up c Low Up e Low Up 
VIQ .536 .408 .649 EN .060 217 22 .269 .397 


GPA  .669  .585 125 002  .000 .043 .329 215 .394 


CPL GOU .493 .677 .030 .001 .093 .970 .310 EN 
Note: Heritabilities (h?), Environmentalities (c?) and Specificities (e?). 
VIQ = verbal IQ; GPA = grade point average; CPL = college plans. 


three “laws” of behavior genetics, namely that (1) all human behavioral traits 
are heritable, (2) the effect of being raised in the same family is smaller than 
the effect of the genes, and (3) a substantial portion of the variation in complex 
human behavioral traits is not accounted for by the effects of genes or families. 
Pinker (2002:372-99) reckons that the three laws may be “the most important 
discoveries in the history of psychology (p. 372)." 

The most controversial implication of findings of a major role of genetic 
endowment in educational and socio-economic success is a concern that this 
knowledge could be misinterpreted to make inequality of school outcomes seem ` 
"natural, just, and immutable" and to justify opposition to efforts at improving 
school performance. For this reason Goldberger (1979) suggests abandoning 
the enterprise of estimating genetic variance components of educational and 
socio-economic achievement, but this advice is rejected by Taubman (19953), 
who points out that behavior genetic models are valuable in providing a measure 
of opportunity for achievement. As Behrman and Taubman (1995) write: "The 
share of the observed variation in schooling that is attributable to across-family 
variability in environment [environmentality] provides a measure of inequality of 
schooling opportunity." (p. 250) 

Heritability and environmentality estimates refer to a specific population, 
characterized by an existing range of environments. Thus these estimates give no 
guidance on how difficult it would be to change the trait through environmental 
manipulation outside the existing range (Jencks 1980, 1992:92-1 19).? Thus high 
heritability does not imply that all environmental policies are ineffective (Eckland 
1979). Likewise, it is not because socio-economic success has a genetic basis 
that inequality in socio-economic outcomes is desirable; to conclude otherwise 
is to commit the naturalistic fallacy (Pinker 2002:162-63). Time will tell how 
discoveries about the genetic basis of behavior will be received by sociologists, 
but it seems unlikely that the accumulated evidence for a major role of genes in 
socio-economic outcomes will be easily overthrown or ignored. 

The fact that heritability and environmentality are not fixed properties of a trait 
but population statistics inherently dependent on the specific social context has 
intriguing implications. For comparative social mobility research the parameters of 
the behavior-genetic model may be viewed as macro-social variables describing 
the nature of social mobility in a system of stratification. Heritability indexes 
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opportunity for achievement (realization of native potential) and, conversely, 
environmentality measures inequality of opportunity due to differences in rearing 
environments (ascription or social reproduction). The analysis reported in this 
paper — describing one society at a particular point in history — represents a 
single case from a macro-sociological comparative perspective or for purposes 
of policy-oriented evaluations. The value of behavior genetic models as tools of 
comparative social mobility research lies in their providing comparable heritability 
and environmentality estimates for educational or socio-economic outcomes in 
different social systems or for subgroups within a given social system, allowing 
comparisons among these macro-social units on a scale of ascription versus 
opportunity. Within a given social context further comparisons can be made of 
the ascriptive vs. opportunistic nature of the achievement process for different 
dimensions or stages of socio-economic attainment (e.g. cognitive measures, 
education, occupation, income). While such a systematic comparative corpus 
does not exist at present, one can already combine estimates from a number of 
published studies to construct a "demo" of what genetically informed comparative 
social mobility research might look like. 

A general pattern emerging from the literature is that realization of genetic 
potential (and thus heritability) is lower — and environmentality correspondingly 
higher — in social contexts that are either disadvantaged or less advanced on 
a social-developmental dimension. The pattern, originally proposed by Scarr- 
Salapatek 1971, was clearly in evidence in Heath et al.'s (1985) study in Norway 
which found that heritability of educational attainment is relatively low (41 percent) 
for both males and females born before 1940, who would have been exposed 
to the more elitist traditional educational system in that country. For two cohorts 
of males born later heritability rises to 74 percent and 67 percent, reflecting 
greater openness of the system; for females, however, heritability remains lower 
for these two cohorts (45 percent and 38 percent) suggesting that achievement 
opportunity has increased for males but not for females. A similar scenario 
emerges from the study of IQ, educational attainment, and occupation in Norway 
by Tambs et al. (1989). Here heritability is higher for the younger than for the older 
group, reflecting again less social ascription affecting the younger group. The 
same inter-generational pattern obtains in the study of educational attainment in 
Australia by Baker et al. (1996), but the pattern is not as marked in the Norwegian 
data analyzed by Lichtenstein et al. (1992). Rowe, Vesterdal and Rodgers (1999), 
using U.S. data on young adults, find heritability to be higher for IQ and educational 
attainment (64 percent and 68 percent, respectively) than for hourly wages (42 
percent) (see also Rowe, Jacobson and Van den Oord 1999). Guo and Stearns 
(2002) show that heritability of verbal IO for adolescents in AddHealth is lower 
when a parent is unemployed than when no parent is unemployed (42 percent 
vs. 54 percent), and for black as compared to white adolescents (58 percent vs. 
72 percent); lower heritability is thus again associated with fewer opportunities in 
disadvantaged social environments. Finally Turkheimer et al. (2003) find higher h? 
and lower c? for IQ in high-SES environments, compared to low-SES ones. 

However cursory, this partial survey of the literature is suggestive of some of 
the contextual variables on which genetically-informed comparative stratification 
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research might focus: historical period and country — as these, factors proxy 
for the nature of the mobility regime; relative socio-economic advantage, race, 
ethnicity and sex — as these characteristics define groups facing more or less 
opportunity; and age - both as it defines cohorts subjected to different mobility 
regimes and as it may otherwise affect model parameters.'* 


Conclusion 


The status attainment model has been used to evaluate strength of ascription 
vs. achievement opportunity in systems of stratification, but this use of the 
model is problematic. Estimating ascription from the size of family background 
effects and opportunity from the size of cognitive ability and education effects 
is misleading because attribution of the effects of variables to ascription or 
opportunity is arbitrary, because the model is vulnerable to misspecification of 
family background, and because effects of background and achievement variables 
are confounded with genetic influences. Behavior genetic models estimated 
from twins and other family data can disentangle ascription from achievement 
by controlling for genetic influences and by providing a black box estimate of 
shared environmental influences that does not require explicit measurement of 
the variables involved. The impact of the shared environment (environmentality) 
may be interpreted as a measure of ascription, and heritability as a measure 
of opportunity for achievement. A multivariate behavior genetic model of verbal 
IQ, grade point average and college plans estimated from data on adolescent 
sibling pairs who were in grades 7 through 12 of U.S. schools in 1994-95 shows 
that variation in all three measures of educational attainment has a large genetic 
component and a relatively smaller shared environmental component. These 
findings suggest high levels of educational opportunity for adolescents in U.S. 
schools at the end of the 20th century. 

A cursory overview of published estimates of the parameters of behavior 
genetic models of attainmentrelated variables illustrates the possibilities 
of a comparative sociology of stratification systems using heritability and 
environmentality parameters as fundamental measures of ascription and 
opportunity for achievement in a system of stratification. In this view heritability 
and environmentality are not fixed properties of a trait; they are properties of the 
stratification system that are expected to vary across societies, historical periods 
and social contexts. Parameters of the genetic model are estimated, not as 
permanent properties of a trait such as intelligence, but as a descriptive feature 
of the social context that may tell us something valuable, say, about the way men 
and women fared differentially in the educational system in Norway after World 
War Il, or how members of disadvantaged social or racial groups are less able to 
realize their native potential in the United States at the end of the 20th century. 

Models estimated in this paper are based on the assumption of polygenic 
inheritance, i.e., that the phenotype under study is affected by a number of 
genes, each of which has a relatively small effect on the phenotype (Fisher 1918). 
Another rapidly developing category of behavior genetic models uses genetic 
markers to locate on the genome genes with large effects. These methods have 
produced major findings, such as genes associated with major mental disorders. 
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Success of the gene-finding methods depends on the existence of genes with 
large effects on a trait; only then can the effect of a specific gene be detected 
against the background of other genetic and environmental influences. The 
possibility of identifying genes with large effects is methodologically attractive 
because part of the genetic contribution is then associated with an explicitly 
measured variable (i.e., presence or absence of a given allele or marker); effects 
of other genes are treated as a residual, latent genetic factor. There is a great deal 
of finality in the identification of a gene with a major effect on a trait. When the 
trait in question is a physical or mental disorder, identification of the responsible 
locus may facilitate development of a treatment. "7 

Traits of interest in social mobility research, such as cognitive ability, dimensions 
of personality and measures of socio-economic achievement may - or may not 
— be affected by genes of major effect that stand out against the polygenic noise 
of other genes and the additional din of environmental effects. When found, rare 
genes of large effect may not explain a large proportion of the genetic variance of 
a trait in the population. Gene-finding methodologies may one day identify genes 
responsible for educational and socio-economic achievement, thus resolving 
the latent genetic component into observed DNA measures. However to the 
extent that genetic influences on these outcomes are truly polygenic, structural 
equations models based on the assumption of polygenic inheritance like those 
estimated in this paper may prove useful for a long time to come (Neale and 
Maes forthcoming). 


Notes 


1. There is a widespread but incorrect perception among sociologists that 
Jencks' work had effectively ruled out any significant role of genes in the 
attainment process. In contrast Jencks (1992:13) recalls concluding in earlier 
Work that genes exert "a considerable influence" on cognitive test scores. 


2. Many behavior genetic models also include a genetic dominance component 
capturing non-linear effects of allele combinations when a trait is affected by 
genes with dominant vs. recessive alleles. 


3. Alternatively, genes can be viewed as an instrument (in the statistical sense) 
for native potential. 


4. The model described later does not allow disaggregating the shared 
environment into measured background characteristics, although other 
behavior genetic models do (Behrman, Pollack and Taubman 1995; Loehlin, 
Horn and Willerman 1997; Taubman 1995b; Waldman 1997). In any case, the 
shared environment component represents an upper bound for the effect of 
any specific dimension of the rearing environment on the outcome. 


5. There is an alternative regression-based behavior genetic methodology 
appropriate for univariate analysis called DF analysis (DeFries and Fulker 
1985, Kohler and Rodgers 2001). 
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13. 
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For CPL the DZ correlation (.264) is less than half the correlation for MZ twins 
(.663/2 — .332), a pattern suggesting a genetic dominance effect. However 
further data analysis does not support that hunch (Nielsen 2006). 


BACE produces the same estimates and the same fit as the ACE model 
(see line four of Table 2) for the behavior genetic part, but the estimated 
coefficients of B shift with changes in the initial values of the coefficients, 
indicating underidentification (Neale and Maes forthcoming). 


Akaike's Information Criterion (AIC) is a measure of model fit adjusted for 
degrees of freedom such that a smaller (more negative) value indicates a 
better fit. 


Onecanalso estimatethe proportion ofthe covariance between two measures 
explained by a set of factors. Associations among these three cognitive and 
educational measures seem largely due to genetic factors (Nielsen 2006). It is 
also possible to estimate the correlations among the genetic components of 
the variances in the three outcome variables (Neale and Maes forthcoming). 
These calculations suggest that the genes responsible for the heritability 
of the three measures constitute partially non-overlapping sets, a pattern 
inconsistent with the view that school success reflects a single "academic 
ability” factor. Rather, different measures of school success might be 
associated with different sets of psychological traits, each with a partially 
independent genetic etiology. 


ML-based confidence intervals are considered superior to those based 
on estimated standard errors (Neale et al. 2003, Neale and Miller 1997, 
Turkheimer et al. 2003). 


Nielsen (2006) contains supplementary analyses and discussions on the 
following issues: (1) similarity of shared environments of DZ twins compared 
to full siblings and MZ twins compared to DZ twins, (2) alternative model 
estimates using MZ and DZ twins only, (3) effects of allowing assortative 
mating on parameter estimates, and (4) testing genetic dominance in the 
model for CPL. 


Although it stands to reason that environmental interventions outside the 
normal environmental range are more likely to be expensive, impractical or 
unethical. 


A major discovery of behavior genetics in the past two decades is that 
heritability of cognitive ability increases, and environmentality declines, from 
childhood to adulthood (Plomin and Petrill 1997). 


The same logic would apply to identification of a specific property of the 
environment (shared or unshared) with a large effect on the trait. 
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Culture in Animals: 
The Case of a Non-human Primate Culture of Low 
Aggression and High Affiliation 


Robert M. Sapolsky, Stanford University and the Institute of Primate Research, 
National Museums of Kenya 


Introduction 


Philosophers often consider what it is that makes us human. For biologists 
considering the same, the answer is often framed in the context of what are the 
key differences between humans and other animals. 

Numerous such differences have been suggested, but over recent decades, 
there has been a consistent shrinking of that list. Thus, humans are no longer 
thought of as being unique in the ability to construct and use tools (with these 
skills now demonstrated in both primate and non-primate species [McGrew 1992]). 
Other species have a sense of self (as shown with the famed "mirror tests" [Gallup 
1970]), as well as a "Theory of Mind" that encompasses the recognition that other 
individuals have different information than oneself (Tomasello and Call 1997). Other 
species are now known to communicate symbolically (e.g., vervet monkeys have 
vocalizations where the relationship between signal and meaning is arbitrary, and 
where some vocalizations serve as synonyms). A strong argument can be made for 
the idea that other species can be so grieved as to fall into the affective state that 
we term depression. And we can no longer claim to be the only species that kills 
its own, or that does so in organized groups (Goodall 1986). 

One vestige of human uniqueness still often cited by anthropologists is 
culture. However, this notion has been challenged in recent years with numerous 
demonstrations of “culture” in other species, particularly primates. In this review, 
| will first consider what the term means to animal behaviorists, and theories 
about how cultural transmission can occur in other species. | will then review 
some of the most striking examples of non-human culture. Most of the examples 
to be discussed are narrow in focus, concerning a single feature of tool use, 
food acquisition or consumption, or communication. | will then expand the focus, 
considering cases of non-human “social culture.” By this term, | mean where a 
particular style of sociality permeates an array of behaviors, with this assemblage 
of traits fulfilling the criteria for culture. | will examine in some detail a recent 
study presenting the most detailed case of transmission of a social culture in a 
non-human species, one involving a group of primates with a multi-generational 
culture of low rates of aggression and high rates of affiliative behaviors (such 
as social grooming, play or sitting in contact). Finally, | will tentatively suggest 
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that this “peaceable” non-human primate culture may be relevant to the human 
condition. 

As a simplification, | will use “animal behavior” to refer to more specialized 
terms (e.g., the study of animals in natural settings [ethology], or the study of 
only certain taxonomic classes of animals [e.g., primatology, ornithology]). 
Throughout, | will use the term “animal” to mean “non-human animal species.” 


What does “culture” mean to an animal behaviorist? 


Early studies concerning animal “culture” often used the less provocative term 
“social traditions” (often at the behest of reviewers). At present, while there is 
still frequent use of the term “tradition,” “culture” has become commonplace 
among animal behaviorists (McGrew 1998, de Waal 2001, Whiten 2005). 

Broadly, the term is used in a way that is derivative of definitions of human 
culture (Kroeber and Kluckhohn 1966, Cavalli-Sforza 2000, de Waal 2001), and 
concerns both behavior and the material manifestations of behavior. Amid the 
various definitions offered by animal behaviorists, all require that the behavior 
occur in only a subset of a species. This can include, "two or more individuals in 
a social unit," in populations but not an entire species, or if in an entire species, 
that it show regional variability (McGrew 1992, Fragaszy and Perry 2003, Whiten 
2005). Moreover, such variability cannot arise from genetic factors. Thus, a - 
distinctive behavior shared by family members because they share a particular 
mutation would not constitute a culture. In addition, such variability cannot arise 
from ecological factors. Thus, if two populations of a species differed as to the 
month that they foraged for some fruit, it would not be considered culture if this 
difference arose simply because the fruit ripened at different times in the two 
habitats. 

Another prerequisite of animal culture is that the behavior persists past its 
originators; transmission can be intra- or inter-generational and spreads as a 
function of kinship or proximity (Perry and Manson 2003). Moreover, the spread 
requires observation or experience of the behavior (Fragaszy and Perry 2003). 

What constitutes evidence for culture among animal behaviorists? Some is 
of a style that would dissatisfy a molecular biologist, but will seem familiar to a 
sociologist. This is because much of animal behavior is observational in nature, 
a feature that is considered to be at the ethical core of most field studies of 
wild populations. With those constraints, progress in the field is derived from 
detailed case reports (e.g., the multiple decades of data concerning the behavior 
of one chimpanzee population studied by Jane Goodall), or discerning patterns of 
behavior across populations of animals. Those circumstances typically preclude 
the independent replication that is the gold standard of laboratory science. 
However, active experiments are occasionally carried out, even in wild animal 
populations; such studies can reveal whether there is causality among some 
behavioral correlates, something often unavailable to the sociologist. 

Given these constraints, evidence often takes the form of observing a 
distinctive behavior(s) in a group or population, detecting signs of its spread to 
new individuals, and a plausible argument about why this pattern does not reflect 
genetic or ecological factors. The most convincing way to rule out ecology as a 
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factor is to identify a nearby population (i.e., one considered to be in the same 
micro-ecosystem) in which the behavior does not occur; the most convincing 
way to rule out genetics as a factor is to demonstrate the spread of the behavior 
to non-relatives (e.g., émigrés into the group). 

Even stronger evidence comes when the above is seen and, in addition, 
the origins of the distinctive behavior are identified. As reviewed below, one of 
the most famous cases of animal culture is renowned not just for its careful 
documentation over decades, but also because its "inventor" was identified. In 
such cases, there is still the possibility that ecological or genetic factors, while 
not causing the spread of the behavior beyond its originators, may nonetheless 
facilitate its spread. As such, the gold standard for culture is one that can only 
occur in an experimental setting. This is when an individual(s) is removed from 
a group, is taught a new behavior that is genetically and ecologically arbitrary, is 
returned to the group, and where that novel behavior is then shown to spread. As 
will be seen, this has only been demonstrated a few times. 


How might cultural transmission occur among animals? 


Models have been proposed to explain transmission of social traditions (Whiten 
et al. 1999, de Waal 2001, Galef 1990). As a specific example to focus on when 
presenting such models, | will consider tool use among chimpanzee populations. 
Specifically, | examine the transmission of the behavior of pounding hammers 
against anvils, made of wood or rock, in order to break open nuts (Boesch and 
Boesch 1983). 

Instructional models involve teaching of hammer use, where a skilled individual 
shapes the movements of the neophyte. The use of the term “instruction” has 
generated some controversy, with some animal behaviorists preferring "active 
behavioral modification" to bring about the change. While there have been 
suggestions of teaching (Boesch 1991), most in the field regard this as not yet 
shown (de Waal 2001). 

In observational models, young learn nut cracking by observing and copying 
adults, a phenomenon akin to the idea of "diffusion" in the social sciences. 
Evidence for transmission through a route such as this has been reported for 
chimpanzees (Whiten 1998). 

In facilitation models, proximity to adults and their hammers increases the likelihood 
of the young experimenting with hammers and independently deriving the skill. 

Having now touched on how animal behaviorists think about the subject of 
culture, | will review some examples of culture in animals. 


Some Examples of Animal Culture 
Material Culture 


Evidence for material culture (i.e., construction of tools or the use of pre-existing 
objects as tools) has been reported frequently. An iconic image in anthropology 
is that of wild chimpanzees constructing and using tools. As originally reported 
by Goodali (1986), chimpanzees construct “termite sticks,” by stripping straight 
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sticks of their leaves and branches. These are then inserted into termite or ant 
mounds and withdrawn; insects clinging to the stick are eaten. Other examples of 
chimpanzee tools include the hammers discussed above, the molding of clumps 
of leaves for sponging up water or cleaning oneself, and the use of branches 
of leaves as a fly whisk; there is even paleontological evidence of century-old 
chimpanzee tools (Mercader et al. 2002). There are striking examples of cultural 
differences in such tool use. In one celebrated report that considered nine well- 
studied chimpanzee populations across Africa, numerous aspects of tool use and 
construction were found to constitute culture (Whiten et al. 1999). As examples, 
nut-hammering is exclusive to one population of West African animals, whereas 
tool use with leaves is restricted to East African populations. 

Among the other apes, insect sticks are also constructed by orangutans, with 
the style differing between populations in Sumatra and Borneo. No evidence of 
material culture has been seen among gorillas or bonobos, or among monkeys. 
A recent study reports the first evidence of material culture in a marine mammal 
(Krutzen et al. 2005); in careful work that ruled out the possibility of genetic or 
ecological factors in the transmission, a population of bottle-nose dolphins was 
shown to use sponges, embedded on their snouts, as a means to better forage 
on the ocean floor. 

A rich literature exists concerning tool use among birds, most often among 
corvids (crows, jay, ravens and jackdaws). New Caledonian crows, for example, * 
tear leaves off of sticks in two distinctive ways to gain access to food, with 
regional variation that constitutes culture. Moreover, wild-born crows of this 
species in captivity use novel material made available to them and modify the 
construction steps to generate sticks (Emery and Clayton 2004). 


Food Acquisition, Preparation and Consumption 


Most cases of tool use and construction among animals is for food acquisition. 
A few cases of culture concern food preferences, where two populations have 
equal access to the same food sources, but differ as to their preferences. This 
is seen among gorillas (Tutin and Fernandez 1992), and orca whales (Randell and 
Whitehead 2001). 

Examples exist of cultural aspects of food preparation. Two famous cases 
concern Japanese macaques living on beaches of an island. In 1953, a female 
invented the technique of washing dirty food (potatoes and tubers) in water. 
Remarkably, three years later, she invented the technique of separating grains 
of wheat from beach sand by throwing mixed handfuls onto water, and then 
collecting the wheat, floating on the surface (Kawai 1965, de Waal 2001). 

These two inventions were studied intensively. Both were transmitted to other 
group members, intra- and then inter-generationally, such that within decades, 
the behavior was ubiquitous. Variations emerged, including washing food in sea 
water, rather than fresh water, dipping clean foods into sea water (plausibly cases 
of salting the food), and subordinate animals protecting their food from dominant 
individuals by using (and even digging) small separate pools of water. These 
behaviors were first observed among macaques given food by humans (raising 
the possibility that this represented some sort of observational learning from 
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humans rather than legitimate invention); however, it has also been observed in 
unprovisioned groups (Nakamichi et al. 1998). 


Communication 


Rich examples of culture concern animal communication. For example, among 
chimpanzees, some groups have a distinctive gesture of knuckle knocking on the 
ground to gain attention, whereas other groups slap at branches to accomplish 
the same. Japanese macaques show cultural variation in the type of vocalization 
made when grooming (Sakura 1989). Examples of communicative culture are 
also found among cetaceans, famed for their vocalizations. Stylistic differences 
in whale song occur among different orca populations (Randell and Whitehead 
2001). As another example, a population of humpbacks living west of Australia 
migrated into an eastern population. Within a relatively short amount of time, 
residents of the eastern population adopted the song style of these newcomers 
(Noad et al. 2000). 

Cultural differences in birdsong have been intensively studied. Songbird 
species are highly variable in singing behavior. For example some species 
have an innate propensity towards learning their species' song (i.e., if a young 
bird raised in isolation is presented with the song of its own and of a related 
species with equal frequency and volume, there is preferential acquisition of its 
species' song), whereas other species have equal potential for song acquisition. 
In some species, a song is learned irreversibly during a narrow developmental 
time window (i.e., "imprinting"), whereas other species acquire new songs 
seasonally (and, remarkably, acquire new neurons in brain regions relevant to 
song at such times). Moreover, there are region-specific "dialects" of song within 
the same species (Marler and Slabbekoon 2002). In one case of acquisition of a 
regional dialect, differential shaping of behavior occurs. Specifically, young male 
cowbirds initially produce an undifferentiated repertoire of songs, then learn to 
produce their local song; females reward production of the appropriate dialect 
with copulation solicitation displays (Smith et al., 2000). 


Other Examples of Animal Culture 


Additional and intriguing examples of culture occur among chimpanzees. One 
concerns a "social custom" related to social grooming. In one Tanzanian region, 
there is the unique occurrence of "hand-clasp" grooming (in which each animal 
raises an arm, grasping hands with the other). Different groups in this population 
clasp hands differently, and when animals transfer between groups, they are 
both influenced by and, in turn, influence the style of hand clasping in their new 
group (Nakamura and Uehara 2004). 

Another social custom is the "rain dance" first reported by Goodall in 1986. 
With the onset of a heavy rain, animals initiate an agitated and dramatic series 
of displays involving chest thumping, hooting, tossing of branches and charging 
through an open field. This behavior occurs among all East African populations, 
but is virtually non-existent in West Africa. 

This overview raises some important points. First, the cultural complexity 
revealed in some of these cases transcends mere population differences in 
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the performance of a single behavior. The meta-analysis of the nine different 
chimpanzee populations revealed 39 behaviors having cultural variants (Whiten 
et al. 1999). Importantly, each chimpanzee population displayed a unique profile 
as to which variant they performed for each of the 39 different behaviors. 

Second, there is some insight as to the mechanisms of transmission within 
group. Amid one report of instruction of tool construction among chimpanzees 
(Boesch 1991), most evidence points to observation and facilitation. The food 
preparation among Japanese macaques is a good example of this. An additional 
example concerns groups of captive chimpanzees (Whiten et al. 2005); in each 
group, the matriarch was removed and trained in one of a number of ways to 
open a food container. Following the return of matriarchs to their groups, the 
technique was shown to spread without explicit instruction. In two striking 
examples of observational learning, there are gender differences in both tool 
use among chimpanzees (Lonsdorf et al. 2004) and dolphins (Kreutzen et al. 
2005); in both cases daughters more readily master the process than do sons. 
"Facilitative" transmission was shown in a study of captive-born crows with no 
prior experience in tool construction. Half the birds gained access to material 
after watching a human generate the sticks, whereas the other half had no 
opportunity to observe. The two groups did not differ in their latency to construct 
tools (Kenward et al. 2005). 

Some of these studies also afford insights into how cultural transmission 
may occur between groups. Such spread can take the form of a variant being 
invented in a single population and then spreading to neighboring groups, with or 
without incremental modification, or can take the form of independent invention 
at multiple sites followed by local spread. The existence of variants only found 
in West or East African chimpanzee populations, with some gradients of local 
modification, supports the single-location model. In other cases, such as knuckle 
knocking, there seems to have been independent invention at multiple sites 
across the continent. 

Another aspect of cultural transmission is the likely role of a large brain. This 
is emphasized in studies of culture in primate and cetaceans with high brain- 
to-body ratios. Less appreciated is the fact that among birds, corvids not only 
present the most striking examples of avian culture but, along with parrots, have 
the largest brain-to-body ratios among birds, similar to that of chimpanzees. 

Finally, transmission is obviously intertwined with sociality. This is particularly 
clear among orangutans, who are the most solitary of apes. Demonstrating 
the importance of sociality, the most socially interactive orangutan populations 
display the most tool use (Van Schaik et al. 2003). 


Social Culture among Animals 


| now consider the rare examples of social culture among primates. As discussed 
earlier, by "social culture" | mean a style of sociality that permeates an array of 
behaviors, with this assemblage of traits fulfilling the criteria for culture. 

The existence of even rudiments of social culture among non-human primates 
runs counter to one branch of primatology while supporting another. The former 
branch is anchored in evolutionary biology, with the view that the social structure 
of a species is as much sculpted by natural selection as is, for example, the length 
of the neck or the efficiency with which the kidneys filter blood in that species 
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(Wilson 1975). This tradition has emphasized how a surprisingly large number 
of the roughly 150 primate species are either "pair-bonding" or "tournament" 
species (Wilson 2000). Pair-bonding species are characterized by the following 
traits. (1) Females and males mate for life, both parent and, plausibly related to 
the presence of two contributing parents, tend towards twinning; (2) both sexes 
are of approximately the same body size and lifespan, with minimal secondary 
sexual characteristics; (3) females choose mates on the basis of their parenting 
skills, males are relatively selective about whom they mate with, and levels of 
male-male aggression are low; and (4) monogamous pairs are often territorial 
and live in environments with evenly distributed resources, such as a rainforest. 
The marmoset monkeys of South America and the gibbons of Southeast Asia are 
classic pair-bonding primates. 

In contrast, among tournament species, (1) males are polygamous with a 
few males accounting for most matings, and where there is minimal paternal 
care; (2) males have larger bodies and shorter lifespans than females, and have 
conspicuous secondary sexual characteristics; (3) females tend to choose mates 
based on their competitive prowess, males are indiscriminate as to who they 
mate with, and levels of male-male aggression are high; (4) social groups are 
rarely territorial, and resources are unevenly distributed, such as in savannas. 
Savanna baboons of East Africa and rhesus monkeys of India are classic 
tournament species. 

Intrinsic in this approach is a deterministic view of social structure - in effect, 
discover a new primate species, note nothing more than larger canines in males 
than females (a marker of tournament species), and one can reliably predict the 
species' social structure. Thus, this tradition in primatology is resistant to the idea 
that the social structure of a particular primate group can differ markedly from 
that species' "norm," and that this social milieu can be culturally transmitted. 

In contrast, primatology traditions more rooted in ethology and comparative 
psychology emphasize individual differences and the cognitive complexity of 
primates. In that view, if primates are smart enough to have self-awareness 
(Gallup 1970), theory of mind (Tomasello and Call 1997), or a concept of algebraic 
transitivity (Cheney and Seyfarth 1990), and to have divergent temperaments and 
personalities (Clark and Boinski 1995), it is implausible to think that the social 
milieu of a primate species inevitable, with the intersection of evolutionary 
pressures, ecology and physiology as social destiny. 

Itis this latter tradition that has produced evidence for social culture in primates. 
The evidence has taken two forms, with both quite sparse until recently. Specifically, 
that there is variability in the social milieu of different groups of animals of the 
same primate species and that these different milieus constitute cultures. 


Variability in Social Milieu within Species 


Just as with chimpanzees, there have now been long-term studies of numerous 
macaque and baboon populations (both feral and captive). These indicate that 
within a single primate species, there can be striking variability in social milieu. 
For example, amid all the populations demonstrating the basic social structure 
typical of some species, there is variation between groups in levels of aggression, 
rates at which aggression is followed by reconciliation between antagonists, 
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rates of grooming, of coalitional behavior, of hunting, and so en (de Waal and 
van Roosmalen 1979, Sapolsky 1983, Gust et al. 1993, Wallner 1996). Some of 
the variability seems to be related to group size and/or ecological variables (e.g., 
amounts of rainfall predicting rates of hunting), but numerous cases seem to 
reflect idiosyncratic group differences in social milieu. 


Cultural Transmissibility of Social Milieu 


With the exception of a case discussed below, none of these group differences 
in social milieu have resulted in spontaneous multi-generational cultural 
transmission. Instead, two notable examples of transmission have arisen from 
artificial experiments. 

The first was striking because of the natural setting in which it occurred. 
Both anubis and hamadryas baboons are found in Ethiopia. These species differ 
dramatically in social structure. Anubis baboons are found in fluid polygamous 
multi-male, multi-female groups. Hamadryas baboons are found in more arid desert 
regions and have a fission-fusion social system. The fusion state involves hundreds 
of baboons sharing a rare resource (a watering hole, cliffs used for sleeping), while 
the fission state consists of small dispersed groups in which a male herds young 
females into his group, forcibly retaining them in a stable “harem.” 

In one experiment, a young female from each species was transplanted into. 
a group of the other species. Remarkably, the females adopted the novel social 
system within hours. In the case of the anubis female transplanted into the 
hamadryas group, assimilation was readily explained, in that she was quickly 
subject to the forceful herding behavior of a (far larger) hamadryas male. In the 
reverse situation, after initially attempting to attach herself to an anubis male, the 
hamadryas female shifted to the more open anubis system (Kummer 1971). 

An even more detailed example of transmission of a social milieu was carried 
out with captive rhesus macaques and stumptail macaques, closely related 
species with very different social styles (de Waal and Johanowicz 1993). Rhesus 
macaques are characterized by rigid “despotic” dominance hierarchies in both 
genders, high rates of escalated aggression and low rates of reconciliation after 
fighting. Stumptail macaques, in contrast, have “egalitarian” hierarchies with 
frequent reversals of dominance, low rates of escalated aggression and high 
rates of reconciliation. 

In this study, separate groups of rhesus and of stumptail juveniles of both sexes 
were merged for five months. Over the course of that time, rhesus juveniles took 
on the behavioral style of stumptails, ultimately developing equal reconciliation 
rates; moreover, this acquired social style persisted after the rhesus were returned 
to larger, all-rhesus groups. As a number of features of this fascinating study: (1) 
rhesus, rather than stumptails, assimilated a new social style despite the mixed 
groups (balanced for sex ratio) predominately containing rhesus; (2) the new 
social style among rhesus was not readily ascribed to their directly experiencing 
the more “relaxed” style of stumptails, as most social interactions were within, 
rather than between species; (3) rhesus did not merely assimilate the concrete 
features of stumptail behavior, as the two species have different gestures and 
vocalizations for reconciliation post-fights, and the increased reconciliation rate 
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among rhesus involved increased rates of gestures typical of their species; (4) 
the more relaxed and affiliative behavior among rhesus was not likely to represent 
"solidarity" in the face of another species, as the rhesus were not preferentially 
interactive with each other when returned to the all-rhesus groups. Instead, the 
authors suggested that the novel rhesus social style was an emergent byproduct 
of living in the more relaxed, less tense and aggressive social milieu of stumptails 
(de Waal and Johanowicz 1993). 


A Culture of Low Aggression and High Affiliation among Baboon 
Circumstances Giving Rise to this Culture, and its Behavioral Features 


Now, consider a particularly rich example of a unique social milieu established 
among wild primates and transmitted as a social culture. This case involved a 
troop of anubis baboons on the Serengeti of East Africa, studied continuously 
since the 1970s. "Forest Troop" was typical of this tournament species, with: 
(1) despotic male hierarchy, high rates of male-male aggression and aggressive 
displacement onto innocent bystanders, male rank fluctuating, predominately with 
the outcomes of fights; (2) a static and hereditary female dominance system; (3) 
life-long female residency in their natal troop and male transfer between troops at 
puberty; (4) high rates of female-female affiliative behaviors, low rates of female- 
male affiliative behaviors and non-existent male-male affiliation; (5) foraging for 
food in the open savanna for much of the day. 

In the early 1980s, Forest Troop slept in trees 1 kilometer from a tourist lodge. 
During that period, an open garbage pit was expanded at the lodge. This attracted 
an adjacent baboon troop (the “Garbage Dump" troop), which slept near the pit 
and foraged almost exclusively there. 

During that time, many Forest Troop males went to the garbage pit at dawn for 
food. While not necessarily high-ranking males in Forest Troop, they were among 
the most aggressive, most likely to be involved in dominance interactions, and 
the least likely to be involved in female-male affiliative interactions. These traits 
were a prerequisite for this garbage “raiding,” because it required them to forgo 
early-morning socializing in their troop (the time of day with, for example, the 
highest rates of female-male grooming), and to be able to compete with the 
larger number of Garbage Dump males for access to refuse. 

In 1983, a tuberculosis outbreak occurred, originating from infected meat in 
the dump. From 1983-1986, most Garbage Dump animals died as did all refuse- 
eating Forest Troop males (46 percent of males); no other Forest Troop animals 
died (Tarara et al. 1985, Sapolsky and Else 1987). 

These deaths in Forest Troop changed the usual 1:1 female-male ratio to 2:1. 
Because of the selectivity of the deaths, by 1986, troop behavior had changed 
markedly, as only less aggressive, more socially affiliated males survived. The new 
social milieu involved a “relaxed” dominance style (less displacement aggression 
directed at females and subordinate males by dominant males), more tolerance 
by dominant males of occasional dominance reversals by subordinates, more 
inter-sexual grooming and closer proximity among animals. 
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Most remarkably, by 1993, no adult males remained from: 1986; all adult 
males had joined the troop following that time. Thus, the distinctive milieu that 
emerged during the mid-1980s because of the large number of male deaths was 
adapted by the next cohort of males who had transferred into the troop. This 
case of transmission of a social culture has been maintained by new generations 
of transfer males up to the present. 


Physiological Correlates of this Unique Culture 


The long-standing goal of the study of the Forest Troop had been in the realm of 
stress physiology. This work has examined what the social rank, patterns of social 
affiliation and personality among baboons, have to do with patterns of stress- 
related disease (e.g., levels of stress hormones and cholesterol, blood pressure, 
immune function) (Sapolsky 1982, 1993; Sapolsky and Share 1994; Sapolsky 
and Spencer 1997). (These studies required the anesthetization of animals for 
physical examinations.) A distinctive profile of the physiological correlates of rank 
had emerged from these studies, and the Forest Troop was striking in how it was 
an exception to these patterns. 

One example concerned levels of steroid hormones secreted by the adrenals 
in response to stress, called glucocorticoids. (The primate version of a GC is 
cortisol, also known as hydrocortisone.) GCs typify the two-edge nature of the. 
stress-response. When secreted in response to a classic stressor (e.g., a rapid 
sprint to avoid a predator), they are highly adaptive; they help divert energy from 
fat to exercising muscle, increase blood pressure and inhibit long-term building 
processes (such as growth, tissue repair or reproduction) that can be deferred. 
However, if secreted chronically, such as with the sustained psychosocial stress of 
social subordination, GCs are central to the emergence of stress-related disease, 
increasing the risks of adult-onset diabetes and hypertension, and impairing 
growth, tissue repair and fertility (Sapolsky et al. 2000). Thus, ideally, basal levels 
of GCs should be minimal, and elevated basal levels reflect chronic stress. 

In the context of that profile, individual differences in GC levels among 
primates are readily understood. (1) Across a wide number of primate species, 
the rank in the dominance hierarchy associated with the greatest amounts of 
physical and psychosocial stress and with the fewest social outlets for coping 
(€.g., grooming) has the highest basal levels of GCs (Abbott et al. 2003); among 
anubis baboon males in stable dominance hierarchies, these are subordinate 
individuals (Sapolsky 1990); (2) independent of rank, anubis males with the least 
socially “intelligent” competitive styles (i.e., worst at discriminating between 
neutral and threatening interactions with rivals) display elevated basal GC levels 
(Sapolsky and Ray 1989, Ray and Sapolsky 1992): (3) independent of rank and 
competitive style, anubis males with the lowest levels of social affiliation have 
the highest basal GC levels (Sapolsky et al. 1997). 

In contrast to this endocrine picture of males in typical anubis troops, elevated 
basal GC levels were not observed in subordinate males in Forest Troop. An 
additional physiological trait unique to Forest Troop involved “benzodiazepines,” 
a class of anxiety-reducing molecules. A natural BDZ occurs in the brain (but its 
chemical structure has yet to be characterized), while synthetic versions include 
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Valium and Librium (Ninan et al. 1982). In a typical baboon troop, subordinate 
males had far higher levels of BDZ signaling in their brains; this was interpreted 
as reflecting a neurochemical compensation for the high rate of anxiety-provoking 
social circumstances to which they are exposed. In contrast, subordinate males 
in Forest Troop had the same low level of BDZ signaling as do dominant males 
(Sapolsky and Share 2004). 


Mechanisms of Cultural Transmission in Forest Troop 


Animal behaviorists have considered various models of cultural transmission, 
and there are some insights available as to which apply to the Forest Troop case. 
Potentially, transmission could involve instruction, in which resident animals treat 
transfer males contingently. Specifically, affiliative behavior on the part of the 
adolescent would be reinforced (with grooming, for example), while aggressive 
behavior would be punished. However, as with virtually all examples of primate 
culture, there was little evidence for instructional models. 

There was also little evidence for a "self-selective" model. Adolescent baboons, 
during the process of transferring, may briefly join different troops before settling 
into one long-term (Pusey and Packer 1988). This raises the possibility that it is 
less aggressive and more affiliative adolescents who join the troop. However, 
this was not the case, in that patterns of aggressive behavior of recent transfers 
into Forest Troop did not differ from those of recent transfers into other troops. 

Resident males in Forest Troop were less aggressive with each other than in 
other troops, but were just as aggressive to recent transfers as were resident 
males in other troops. Thus, if transfer males adopted the troop culture by 
observation, it involved observation of interactions of resident males with each 
other or with females, rather than of interactions between resident males and 
themselves. 

In contrast to the treatment of transfer males by resident Forest Troop males, 
resident females treated transfers differently than in other troops. Forest Troop 
females were more affiliative; there was a shorter latency in this troop than 
in others for resident females to first groom transfer males, and to first give 
a species-typical greeting. Moreover, when compared to other females, Forest 
Troop females groomed recent transfers at a higher rate, and sat in closer 
proximity to them. 

Thus, the treatment by Forest Troop females of males during the first six 
months post-transfer — approximately the time needed for transfer males to adopt 
the Forest Troop behavioral style — was the most distinctive feature of the cultural 
transmission. It should be recalled that, because of the tuberculosis deaths, 
the troop's female-to-male ratio doubled, a skew still maintained, for unknown 
reasons. This raises the possibility that the distinctive treatment of transfer 
males by Forest Troop females was a mere consequence of the preponderance 
of females. However, similar ratios occur in other baboon troops that did not 
resemble Forest Troop behaviorally (Seyfarth 1976, 1978; Noe 1994; Strum 1982; 
Bercovitch 1985). Instead, the key appears to be the preponderance of females 
coupled with the particular behavioral style of the resident males. 
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These findings suggest a three-step scenario for the transmission of the Forest 
Troop culture is an emergent phenomenon. First, the preponderance of adult 
females and the distinctive behavioral style of resident males result in females 
being spared much of the stress (arising from, for example, displacement 
aggression) that characterizes their lives in other troops. Second, as a result of 
this lower level of stress and tension, resident females are more likely to “take a 
chance” and to be spontaneously affiliative towards new transfer males. Finally, 
amid the atypically high rates of social affiliation proffered by resident females, 
the behavior of transfer males, by default, comes to resemble that of resident 
males. Current work with this troop examines the validity of this model. 


Conclusion: Implications for Human Primates 


Research in animal behavior that has revealed the complexities of the social 
structures, communication, cognition and affect of animals forces us to recognize 
realms of continuity between humans and other species. The occurrence of animal 
culture demands the same. The term, as applied to animals, is not remotely as 
rich as for humans. We are unique in the magnitude of, and interrelations among 
facets of culture, in the fact that our cultural innovations can build upon prior 
innovation, and in the fact that the cultural features in a society can, collectively, 
constitute a core idea of symbolic importance (Tomasello 2000; Whiten et al. * 
2003). Nonetheless, animal culture, no matter how relatively impoverished, must 
challenge our human sense of uniqueness. 

This is particularly the case for a demonstration of transmission of an animal 
social culture, in terms of the amount of behavioral plasticity implied. But these 
findings have even more important implications for humans. Rhesus monkeys 
have long been recognized to have highly despotic dominance hierarchies. The 
harem as the building block of hamadryas society is considered to have evolved 
under the fierce selective pressure of desert life. And savanna baboons have 
been, for decades, the textbook example of a highly stratified and aggressive 
primate social system. And yet a hamadryas female can adapt to a novel social 
system within hours; rhesus can become as reconciliative as stumptails within 
months and maintain those behaviors afterward; savanna baboons can, over 
the course of a few years, develop a novel social milieu and transmit it multi- 
generationally. 

Currently, the life sciences are dominated by a highly reductive model — the 
gene and genome as the ultimate levels of explanation of life. This view has 
prompted the largest and most costly project in the history of human biology 
- sequencing of the human genome - and in the history of the life sciences 
— the subsequent sequencing of the genomes of many other species. The 
genetic paradigm heavily influences how biologists now think about behavior, 
including human behavior. Enormous attention is focused on comparison of 
mono- and dizygotic twins, and of twins reared together and apart. Psychiatry 
journals are replete with papers examining major genetic contributions to clinical 
depression, antisocial behavior and risk-taking behavior (cf, Kendler 2005). And 
in considering the relatively miniscule number of genes that differ between 
humans and chimpanzees (perhaps 2 percent), few scientists are interested in 


Culture in Animals * 229 


the genetics of why humans can consume far more meat than can chimpanzees 
without succumbing to cardiovascular disease (a different gene for a cholesterol 
transport protein) or why humans write symphonies and chimpanzees don't. 

For scientists steeped in that reductive mindset, particularly the most 
unsophisticated versions of it, there is a constant pull to frame primate, including 
human, social behavior in the context of something akin to genetic determinacy. 
As a fairly uninformed outsider to sociology, | sense that there are schools within 
that discipline that would also view many aspects of human social systems 
as unchangeable (albeit for different reasons than behavioral geneticists). 
Nevertheless, the extraordinary social plasticity of these non-human primates 
demonstrates that we humans, whether viewing ourselves from the perspective 
of a geneticist, a sociologist or anything in between, have no reason to claim that 
our social behaviors, including our less palatable ones, are set in stone. 
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Desperate Measures: 
Strikes and Wages in Post-Accord America 


Jake Rosenfeld, Princeton University 


Abstract 

Using previously unreleased data on nearly every authorized work stoppage that occurred 
between 1984 and 2002, this paper tests whether the positive wage-strike relationship held 
following the breakdown of the post-war labor-capital accord. Unlike in decades past, 
these findings indicate a complete decoupling of the wage-strike relationship. Even in those 
industries and regions where unions remain relatively institutionalized, strikes no longer 
increase aggregate worker pay. Strike activity also fails to narrow worker wage dispersion at 
the industry-region level. The findings highlight the need for rethinking existing theoretical 
models on strike activity and wages in this era of capital dominance. 


Introduction 


On Oct. 11, 2003, 21,000 grocery workers walked off the job following a prolonged 
contract dispute with their southern California employers. Soon 49,000 additional 
area grocery employees joined them when rival grocery chains locked them out in 
a telling display of capital solidarity. Five months later, management approached 
the union with a contract proposal that would freeze wages at previous levels, 
ratchet up employee contributions to health coverage, and institute a two-tier 
compensation and benefit package giving new hires much lower wages and 
leaner benefits than existing employees. Union strike funds had run dry, whereas 
management - with a combined net revenue topping $100 billion — showed no 
signs of further compromise, and workers were eager to return to their jobs. The 
union accepted management's offer and nearly 90 percent of union members 
voted to ratify the deal. The long-idled grocery workers returned under the new 
contract in early March 2004. 

Is this the new face of strikes? The strike was once thought of as labor's 
most potent weapon in resolving disputes over compensation matters. Research 
indicates that during periods when the labor movement is highly institutionalized, 
strikes (or the threat of strikes) often work to boost average employee wages, 
especially in the highly unionized industrial core (Kalleberg, Wallace and 
Raffalovich 1984; Rubin 1986; Rubin 1988; Wallace, Leicht and Grant 1993). In 
the United States, the institutionalization of unions within an overall framework of 


A previous version of this paper was presented in the session The Changing Workplace and 
Worker Response at the annual meetings of the Eastern Sociological Society, Washington, 
D.C., March 17-20, 2004, and at the FIEF Trade Union Institute for Economic Research 
seminar series, Stockholm, Sweden, June 1, 2005. The author thanks the Federal Media- 
tion Conciliation Service for data as well as FIEF seminar participants for helpful advice 
on an earlier draft of the paper. Two anonymous Social Forces reviewers also provided use- 
ful guidance on the manuscript. Direct correspondence to Jake Rosenfeld, Office of Popula- 
tion Research, 284 Wallace Hall, Princeton University, Princeton, NJ 08544-2091, (314) 
954-6738. E-mail: jhrosenf@princeton.edu. 


O The University of North Carolina Press Social Forces, Volume 85, Number 1, September 2006 


236 * Social Forces Volume 85, Number 1 * September 2006 


collective bargaining began with the state's passage of the Wagner Act in 1935 
(see McCammon 1993), and solidified during the post-World War Il years, the era 
of the labor-capital accord. While restricted from challenging fundamental power 
relations within the workplace, labor retained the right to strike over economic 
issues throughout the early postwar years (Wallace, Rubin and Smith 1988). 
The growth of the union movement following the war provided labor with the 
organizational base from which workers could successfully strike against their 
employers for higher wages. 

By the mid-1980s, the labor-capital accord had completely broken down 
Reagan's hard-line anti-union agenda, including the replacement of the striking 
PATCO workers in 1981, signaled a new political stance toward organized 
labor (Wallace, Leicht and Raffalovich 1999; but see Farber and Western 2002). 
Capital's increasing mobility further eroded a labor movement predicated on 
firm-specific bargaining. Unionization levels fell to post-war lows (Clawson and 
Clawson 1999). Faced with a crumbling organizational base, the strike became an 
increasingly rare weapon used by unions to settle contract disputes over wages. 
Figure 1 plots union densities and strike frequencies between 1984 and 2002. 
While the overall unionization rate among private sector, full-time employees fell 
by 40 percent in a little less than two decades, the drop in strike frequency was 
even more precipitous, declining by more than two-thirds in 19 years. 

In this paper | investigate whether the positive relationship between strikes ` 
and worker wages that existed during the immediate postwar decades persisted 
throughout the final years of the 20th century. Utilizing new data on nearly every 
authorized strike that occurred between 1984 and 2002, the analysis tests whether 
. Strike frequency works to boost median pay for non-professional workers, net of 
a variety of other factors including union membership levels. The detailed strike 
data allow for an investigation into the relationship between strikes and worker 
wages at the disaggregated industry-region level.*2 Moreover, the data allow 
for specific examinations into the impact of strikes in those industries where 
existing research and theory suggest they should have the largest effect. Finally, 
| test whether strike activity in highly unionized industries and locales operates 
to narrow workplace wage dispersion net of its affect on average worker pay. 
The model specifications utilized here provide the most stringent test of the 
relationship between strikes and wages to date. 

If the recent California grocery strike is any indication, strikes no longer have 
a beneficial influence on worker wages. However, given that the grocery strike 
occurred in an industry lacking an established union presence, such a dispute 
may not be representative of walkouts within the (relatively) highly unionized 
industrial core. Rubin's pioneering work on the topic revealed that strikes serve 
to boost worker pay only within sectors with a well-established, organized 
worker presence (Rubin 1986). The data used in her analysis end in 1976, prior 
to the complete dismantling of the labor-capital accord. Data limitations have 
previously prevented researchers from adequately investigating whether the 
strike remains a potent weapon in highly unionized sectors (or any sector, for 
that matter) during the post-accord period. It may be that strikes remain powerful 
weapons when confronting capital over-compensation issues, perhaps due to 
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their growing infrequency. Or it could be that the strike remains'a beneficial tool 
for workers fighting wage disputes only within a narrow confine of industries. Or, 
the California grocery strike could well prove representative of the new face of 
strikes: desperate, defensive measures deployed mainly to retain the status quo. 
This paper seeks to answer those questions. 


Unions, Strikes and Worker Pay 


Beginning in the mid-1930s, the institutionalization of the collective bargaining 
arrangement granted unions a hard-fought legitimacy in lobbying for workers' 
rights and more equitable pay practices. Since that period, labor market 
researchers have consistently demonstrated that unionization increases average 
wages among otherwise comparable workers (see Kuhn 1998 for an overview of 
the recent literature). Cross-sectional analyses reveal that among observationally 
equivalent workers, unionization boosts wages around 15 percent (Robinson 
1989). Various analyses using panel data have largely corroborated this finding 
(Freeman and Medoff 1984, Card 1996, Kuhn and Sweetman 1998). More recent 
research has shown that at the industry-region level, unions continue to boost 
median worker pay, despite the rapid loss of union members during the past few 
decades (Rosenfeld forthcoming). 

If anything, the 15 percent wage boost attributable to union membership ` 
understates the role unions play in raising worker pay. Numerous analyses have 
demonstrated the existence of union threat effects operating in certain locations 
and industries (Leicht 1989, Corneo and Lucifora 1997). That is, industries worried 
about the threat of unionization (often in highly organized locales and industries) 
may attempt to preempt the threat by raising wages to union levels. 

During the period of the labor-capital accord, strikes - representing labor's most 
potentially disruptive weapon in its fight over compensation issues - operated in a 
similar manner. While research has consistently shown that union members earn, 
on average, higher wages than their non-union counterparts, labor analysts have 
also demonstrated that union members who have participated in a strike earn, on 
average, more pay than their non-striking, unionized counterparts. For example, 
using a 1979 employee survey, Wallace, Leicht and Grant (1993) find in some of 
their models that workers who had directly participated in a strike had higher 
wages than comparable non-strikers. In a unique study focusing on the printing 
and publishing industry, Kalleberg, Wallace and Raffalovich (1984) argue that 
strike frequency sometimes served to increase labor's share, especially during 
the first few decades following World War Il. Ashenfelter, Johnson and Pencavel 
(1972) find evidence of a strike-union-wage link in the U.S. manufacturing industry 
between 1914 and 1963, with the greatest effect coming in the Depression 
years. The wage-strike link has been corroborated in cross-national research as 
well: Paldam (1983) provides evidence of a positive and reciprocal wage-strike 
relationship between 1948 and 1975 in 17 OECD nations. 

Strikes may boost non-striking workers' pay as well, much like union threat 
effects. A firm, anxious about an increasingly agitated shop floor, may cave to 
wage demands by union members hinting at a walkout. Or an employer may think 
twice about resisting its union's call for increased pay after witnessing a prolonged 
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strike drain a neighboring firm's revenue. Indeed, the positive relationship between 
strikes and worker compensation that Rubin (1986) finds must in part be due to 
these threat effects. Rubin's analysis operates at a broad sector level: Following 
Lewis (1963), she divides the economy into union and non-union sectors and 
investigates the effect of annual strike rates on these sector-specific wages. 
Controlling for macroeconomic indicators (unemployment, inflation, recession 
year, etc.) and union membership rates, Rubin finds that during the period of the 
labor-capital accord (1949-1976) strike activity boosts aggregate worker wages 
within the union sector, where the institutionalized presence of labor can weather 
the direct impact of a dispute on workers' well-being. However, even within the 
high union sector, strikes remain a sufficiently rare event that in any given year 
the number of non-striking workers dwarfs the number of strikers. Because we 
know strikes do not perform miracles on the wages of workers who walk the 
picket line (in fact, they shrink the wages of the striking worker while he walks the 
picket line), the positive, significant influence of strikes on compensation must in 
part be due to threat effects. 

If indeed strikes no longer work to boost worker wages, then we must expand 
our understanding of the role of union decline on worker wage stagnation 
during the recent past. Most of the relevant research has focused on declining 
membership levels and their impact on wages; results from these analyses may 
suggest another pathway through which the diminished fortune of the American 
labor movement has exacerbated wage inequality. 


Measuring Strikes in the Post-Accord Period 


Reagan's hostile policies toward labor extended all the way down to government 
spending on labor research. Starting in 1982, budget cuts prevented the Bureau 
of Labor Statistics (BLS) from collecting data on strikes involving less than 1,000 
workers.? Past work on the topic indicates that such large labor disputes - strikes 
involving 1,000 or more workers comprise less than 10 percent of all strikes — 
are non-representative of more typical work stoppages along a number of key 
dimensions (Skeels, McGrath and Arshanapalli 1988). According to this research, 
no size-selected sample of strikes in the postwar period results in a representative 
sample of all work stoppages. The authors conclude with the admonition: "Be 
aware that sampling by strike size may be hazardous to your intellectual health." 
(Skeels, McGrath and Arshanapalli 1988:589) 

To a great degree labor researchers have heeded their warning and quantitative 
work on strikes in the post-accord period has all but disappeared (for exceptions, 
see Morris 2003, Dixon and Roscigno 2003). In the only cross-industry, time- 
series analysis of the impact of strikes on wages during the 1980s, Wallace, 
Leicht and Raffalovich (1999:see especially p. 284) conclude that strikes have 
become virtually irrelevant for the economic standing of workers. Circumventing 
the biased BLS strike data, the authors utilize monthly Current Population Survey 
(CPS) data on the number of employees involved in work stoppages and calculate 
a measure capturing the percentage of working time lost due to strikes on a 
quarterly basis. While their analysis is suggestive of a decoupling of the wage- 
strike relationship in the post-accord period, and represents an admirable attempt 
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to avoid the problems with recent BLS work stoppage data, the reliance on CPS 
strike data presents two main shortcomings. 

First, data on the percent time lost due to strikes in a given period conflates 
two somewhat separate issues: strike frequency and strike size. While the two 
are obviously correlated, the influence of an atypically large strike might bias 
strike activity estimates.^ Since most of the research on the topic — including 
Rubin's seminal piece (1986) — make use of strike activity measures, a direct 
comparison with the prior findings in the literature requires a work stoppage 
frequency variable. Most importantly, Wallace et al. (1999) are unable to analyze 
their data by region or industry, and their analysis operates at the level of the 
entire private sector. Because strikes occur so infrequently nowadays, it would be 
quite surprising if they were to find a significant effect of strikes (either positive 
or negative) on all private sector wages. 

To avoid the data problems inherent in either the BLS or CPS estimates of strike 
activity, this paper utilizes data provided by the Federal Mediation Conciliation 
Service (FMCS) on nearly every authorized labor dispute occurring in the United 
States between 1984 and 2002. The data contain information on strike location, 
strike size, strike duration and strike date. In addition, the FMCS provided 
employer and product information that enable me to link each dispute with its 
appropriate three-digit Standard Industrial Classification (SIC) code.® Such a rich 
dataset allows for an analysis of the effect of strikes on worker wages at the ` 
detailed industry-region level. 

Moreover, the data allow for varying definitions of the strike. Given firm and 
intra-firm bargaining practices in the United States, sometimes two or more 
unions will represent different sectors of the same firm. If the unions decide 
to strike, both sets of workers may walk off simultaneously, but technically the 
action still represents two distinct work stoppages. Or, two sets of unionized 
workers at two different companies in the same town may coordinate and stage 
a walkout. Again, given firm- and union-specific bargaining practices, such a 
walkout would technically constitute two different strikes. The detailed work 
stoppage data utilized in the following analyses allow for more flexible definitions 
of a strike. As a preliminary test, | created a strike measure that collapses intra- 
firm work stoppages, and another that collapses intra-industry work stoppages 
that occur in the same vicinity. The following analyses rely on a firm-specific 
strike measure - that is, | count as one strike when two or more sets of unionized 
workers stage a simultaneous walkout from the same firm. However, all models 
were run according to varying strike definitions with similar substantive results. 

Regardless of the definition, the frequency of work stoppages declined 
dramatically during the past few decades. During this time of labor retreat, wages 
for non-professional, non-managerial workers remained rather stagnant. Figure 2 
plots annual strike frequencies (introduced earlier in Figure 1) and median full-time 
worker compensation between 1984 and 2002. The late 1990s' uptick in worker 
pay still left the typical worker little better off than he was in the mid-1980s. 

Research has shown that despite the steep drop in overall unionization levels, 
unions continue to boost worker pay in sectors where their presence remains 
substantial (Rosenfeld forthcoming; see Card 1998 for union effects on wages 
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in the public sector). What has yet to be sufficiently demonstrated is whether 
strikes fit the same pattern. The results of the analysis could confirm Wallace 
et al.’s contention that strikes have no impact on worker wages during the post- 
accord period. The typical strike during the 1980s and 1990s could simply be 
labor's last-ditch attempt to restore the status quo in the face of wage and benefit 
erosion. Or, strikes could still positively affect the wages of those unionized 
workers directly involved in a dispute, yet represent such a rare occurrence that 
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non-striking firms no longer fear the threat of a stoppage. Either scenario could 
result in a non-significant relationship between strikes and wages at the industry- 
region level. On the other hand, strikes could still exert a significant and positive 
effect on worker pay, especially — as Rubin's work suggests (1986) - within those 
industries where labor retains a strong (if crumbling) institutional base. Given the 
paucity of these industries in the post-accord period, such a finding is likely to be 
detected only in data disaggregated by industry and region. 

The paper also investigates the role of strikes in reducing intra-industry pay 
differentials among workers. While work stoppages may no longer increase 
average worker pay, they may serve to narrow pay gaps within particular regions 
and industries. Swenson (1989), in his discussion of the Swedish and German 
trade labor movements, emphasizes how distributional issues, and not simply 
demands to increase worker's share, often trigger strike activity. It is plausible 
that in this era of labor retreat, strikes operate to narrow worker pay dispersion 
without affecting labor's average wages. 

Finally, it may be that the labor-capital imbalance has grown to such an extent 
that strikes now operate to push worker wages downward. Cramton and Tracy 
(1998) show how, following on the heels of the PATCO dispute, employers' use of 
replacement hiring during work stoppages grew substantially. In another article, 
Cramton and Tracy (1992) provide data on labor's increasing use of the holdout in 
contract disputes, perhaps fearing the growing consequences of engaging in an ` 
actual strike. The permanent firing of striking workers, coupled with the hiring of 
replacements at lower wage rates, could dampen overall industry pay. Similarly, 
the threat of permanent replacement may force striking employees back to work 
for lower wages, providing another mechanism through which strikes could 
operate to lower worker pay. Ideally, one could test for the effect of replacement 
hiring and holdouts on strike outcomes. However, no existing dataset contains 
comprehensive information on either tactic, but a negative strike-wage finding 
could hint at their growing use in labor disputes. 

Past research has established that the relationships between strikes and 
wages are historically and spatially contingent. What no research has adequately 
done is establish how these relationships changed following the breakdown of 
the labor-capital accord. 


Data 


With the exception of the airline industry (which falls under the jurisdiction of the 
National Mediation Board), parties unable to resolve contract differences must 
file with the FMCS prior to any work stoppage, as specified in section 8d of the 
National Labor Relations Act (NLRA). The strike data provided by the FMCS contain 
detailed measures on the number of workers involved in a dispute, duration, 
location, date, employer name, union name and company product. One possible 
research strategy would be to link these firm-level strike data to firm-level data on 
pay rates and unionization levels. Unfortunately, no such comprehensive dataset 
exists for the striking FMCS firms. Moreover, a firm-level analysis would only 
capture the costs or benefits of striking for the striking workers. This analysis 
uses detailed industry-level workforce information, available in the CPS. The 
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aggregated nature of the data captures both the direct effects of a strike on 
wages, as well as threat effects operating at the industry-regional level. 

What these FMCS data exclude are so-called wildcat strikes, or strikes 
unauthorized by the union, and strikes over union recognition itself. Evidence 
suggests that the frequency of these types of strikes has been on the decline for 
decades; in 1980, they made up only 13 percent of all strikes, down from nearly 
two-thirds of all work stoppages in the late 1930s (McCammon 1990, Snyder 
1977). Whether or not the proportion of wildcat strikes or stoppages over union 
recognition has increased during the recent period of labor's retreat is beyond 
the scope of this paper. (See Byrne and King 1986 for a discussion of wildcat 
strikes in the manufacturing industry.) 

In order to categorize the data, | used a four-step process that made use of 
company information, product information and the union involved, assigning 
each strike a 3-digit SIC code. The first step in the categorization process involved 
assigning SIC codes to those strikes whose company names appear in Dun & 
Bradstreet's Million Dollar Directory. The directory categorizes major companies 
in the United States by primary four-digit SIC code, along with any secondary 
codes that may apply. However, many large companies span several industries, 
and many of the firms in the strike database had either changed names, dissolved 
or otherwise do not appear in the D&B listing. Step two in the process involved 
linking specific product searches to SIC codes. If the product listing was missing 
or ambiguous and the firm did not appear in the D&B database, | then would 
search for any company information on the internet, making use of the detailed 
addresses of the companies contained in the FMCS data. Finally, for the handful 
of strikes that eluded classification, union information was used to assign a three- 
digit SIC code.? Around 10,000 strikes had to be categorized in all. 

The assigning of three-digit SIC codes helped prevent coding errors. Given 
that the models presented below only make use of one- and two- digit SIC codes, 
any minor coding errors at this stage would be eliminated after aggregation. For 
example, if a striking firm is misclassified as occurring within the metalworking 
machine industry (three-digit SIC code 354) when it properly belonged to the 
general industrial machine industry (three-digit SIC code 356), it will not matter 
in an analysis where all firms producing machinery are grouped together. Initially 
assigning firms to one- or two-digit industry codes would have increased the 
likelihood that a mistake at the stage of SIC designation would ultimately affect 
the findings of the analysis. 

Due to changes in employment levels within and across industries and regions 
over time, a raw measure of total strikes by industry would present a misleading 
measure of changes in strike activity. To control for employment levels and 
shifts, this analysis uses CPS Merged Outgoing Rotation Group (MORG) files 
to construct industry and region-specific measures of employment. Regional 
groupings disaggregated by state are presented in Appendix 1. MORG files 
were preferred over the March CPS files due to the greater sample size of the 
MORG datasets. With a few exceptions, employment numbers are restricted to 
non-professional, non-managerial workers — ¡.e., those workers at risk of a work 
stoppage. However, included in the dataset are a few private-sector professional 
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Table 1: Strike Rate and Union Representation for Selected Industries, All Years 


Mining .015 19.1 
Construction .010 235 
Manufacturing .020 24.7 
Transportation .006 35.7 
Wholesale trade .009 8.1 
Retail trade .001 6.4 
Finance, insurance, real estate .001 4.1 
Services .003 8.0 
Highly Unionized Sample 
Industry Strike Rate % Unionized 
Mining .015 19.1 
Construction .010 23.5 
Lumber/wood/furniture .029 18.7 
Metals .050 34.3 
Machinery .020 18.4 
Transportation equipment .012 47.3 
Toys/miscellaneous manufacturing 012 12.8 
Food/tobacco .019 30.3 
Textiles/apparel .005 14.6 
Paper publishing .011 22.9 
Chemicals .020 21.2 
Transportation .006 32.3 
Communications .004 40.6 
Utilities/sanitary services .009 40.8 


Notes: Strike rate represents total number of strikes per thousand non-professional, non-mana- 
gerial workers. Means are weighted by cell counts. 


workers that regularly strike. | then constructed a specific strike rate for each 
industry-region cell by merging the CPS employment figures with the strike data. 
The strike rate represents the total number of strikes occurring in a particular 
industry and region in a particular year divided by the total number of non- 
professional, non-managerial employees in that industry-region cell for that year. 
| also constructed various measures of average strike severity, such as average 
strike size and duration, in order to test whether the effect of strike frequency on 
wages differed after controlling for average size and length in each industry-region 
grouping. Such measures also allowed for specific examinations of whether the 
various measures of strike activity commonly found in the literature (such as total 
number of person days lost or a raw strike frequency measure) affected wages 
differently during the post-accord period. 

Table 1 presents strike rates and unionization levels averaged across years and 
regions for the major industries used in the analysis. Unionization data also come 
from the CPS's MORG files. Since my primary analysis is at the industry-region 
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level, | aggregate each year of MORG individual-level data into a final dataset with 
cell entries for each industry-region grouping. 

As shown in Table 1, the main analysis divides industries into their one-digit SIC 
code designation. For the highly unionized sample, the data are disaggregated 
down to a combination of two-digit SIC codes and CPS codes (see bottom half 
of Table 1 for industry listing). 

Besides strike rates and unionization levels, other covariates that could affect 
changes in non-professional worker pay over time include a range of demographic 
characteristics such as the racial makeup of the workforce, gender composition, 
potential experience (age) and educational levels. Human capital theory posits 
that increasing education levels should translate to wage gains for both workers 
and managers. Controlling for levels of education, the relegation to lower-paying 
tasks within occupations should depress median pay in high-minority or high- 
female cells. To control for these effects and others, the final dataset includes 
items capturing the percentage of minority workers within each industry-region 
grouping, the percentage female, educational attainment scores (separate items 
for high school dropouts, high school graduates, and those with any college 
experience), and age measures. This information also comes from the MORG 
files. Table 2 presents descriptive measures of the control variables used in the 
analysis at three points in time in order to provide a general picture of recent 
workforce trends. This table is averaged across all SIC-1 digit industry divisions. 


Table 2: Weighted Means of Workforce Demographic and Wage Measures, Full Sample 


1984 1993 2002 
Non-professional, Non-managerial Workers 
Median weekly wage* $500.16 $480.46 $510.00 
Percent unionized 20.4 15.0 11.9 
Percent female 42.4 43.8 43.1 
Percent minority** 17.3 20.7 26.5 
Percent — HS 19.5 14.6 14.2 
Percent HS 69.6 43.7 39.9 
Age 30.7. 341 38.6 
Managers and Executives 
Median weekly wage $894.53 $871.66 $952.04 


* All wages are expressed in constant 2002 dollars. 


As displayed above, female representation among the non-professional, full- 
time workers held steady during the period analyzed in this paper. The percentage 
of minority workers grew substantially, largely mirroring immigration trends. 
Average educational levels of non-professional workers also grew: By 2002, the 
vast majority of non-professional, non-managerial workers had completed high 
school, and a large segment had some college experience (calculated by adding 
the high school dropout and graduate categories and subtracting from 100). Table 
3 below presents similar measures for the restricted, high union sample. Wages 
(expressed in all tables in constant 2002 dollars) in these industries remain higher 
than in the economy as a whole, but have yet to recover to their mid-1980s levels. 
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Table 3: Weighted Means of Workforce Demographic and Wage Measures, High Union Sample 
O TEE 


1984 1993 2002 

Non-professional, Non-managerial Workers 

Median weekly wage* $597.60 $561.47 $586.45 

Percent unionized 32.9 26.0 21.4 

Percent female 28.5 27.7 23.4 

Percent minority** 17.4 20.7 26.3 

Percent < HS 23.6 17.9 17A 

Percent HS 68.6 48.6 45.3 

Age 36.9 38.2 39.7 
Managers and Executives 

Median weekly wage $1081.45 $1030.66 $1097.57 


* All wages are expressed in constant 2002 dollars. 
** Minority category restricted to African-Americans and Hispanics. 


The high union sample is somewhat older, and heavily male in comparison to the 
full sample dataset. 

In addition to the control variables presented above, some of the models also 
include a region-specific unemployment measure, included to capture the general - 
effect of business cycle trends on median worker pay. Regional unemployment 
numbers come from the Bureau of Labor Statistics (BLS). Finally, given that 
historically unions concentrate in monopoly sectors where employers can 
more easily pass on the union wage premium to consumers, | include a median 
managerial pay measure for each cell. The measure should partially control for 
the effect of higher average firm rents on wages, which, if systematically related 
to strike frequency, could bias the strike rate effect. 

The dependent variable used in all models (also created by aggregating 
individual-level MORG data) is the median weekly wage for non-professional, 
non-managerial, full-time workers in each industry-region cell. Wage measures 
are expressed in constant 2002 dollars. As Table 2 reveals, the educational gains 
of workers did not translate into much wage growth during the 1980s and 1990s. 
Even the late-1990s economic boom barely raised non-professional worker pay 
above its mid-1980s level. 

Relying on the CPS for wage data in an investigation of strike effects may 
be problematic because of the possibility that the striking workers — and their 
wages - captured in the CPS are non-representative of all workers involved in 
work stoppages. However, the CPS remains the national source for wage data, 
and there exists no real alternatives for a study of this sort. The use of the CPS's 
MORG file does help alleviate some of the concern raised above. Unlike relying 
on a single month's data, the MORG file contains a quarter of each monthly CPS 
sample. This means that (1) no one month is oversampled in the database, which 
could bias the findings if that particular month contained an abnormally large or 
small number of striking workers and, (2) the total sample size is three times as 
large as a typical CPS monthly survey. 
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Each cell in the data represents one industry-region grouping for a particular 
year (for example, wholesale trade workers in the South in 1992). For the main 
models using the data organized by SIC-1 digit codes (results presented in Tables 
4 and 5), the final sample size is 576 (eight industries by four regions by 18 years). 
For the restricted, high-union dataset, the final sample size is 1,008 (14 industries 
by four regions by 18 years). Sample sizes do vary across models due to differing 
model restrictions, such as whether or not the data are limited to cells with union 
representation rates above 25 percent. 


Method 


To best estimate the impact of strikes on weekly worker pay, l fit a fixed-effects 
model that includes a separate effect for each industry-region and a time effect 
for each year (in this case, year ranging between 1984 and 2002). For industry / 
at time f, 


A , 
Yi =a phy, +X D Te, 


where a, is the fixed industry-region effect, v, is the fixed year effect, x’, 
represents the exogenous variables in industry-region / at year t, and e, is the 
error term. The fixed effects control for unmeasured differences between 
industry-regions that remain constant over time, and control for forces that affect 
the various industry-regions equally, but vary over time. For example, the fixed 
industry-region effects control for unmeasured industrial characteristics such as 
average industrial concentration or the lower wage scales attributable to regional 
differences, whereas the year fixed-effects capture the influence of a macro-level 
economic event like the recession of the early 1990s. 

Thus, in its most stringent specification, the model used in this analysis 
only allows for variation across time within specific industry-regions. Such a 
specification effectively controls for many of the dominant factors that influence 
wages. Unlike past analyses, the data and model utilized in this paper directly 
target the effect — if any — of strike rates on worker wages within particular 
industries and regions. Also, unlike most research on the topic, the strike rate 
measure used in the models presented below is lagged one year. Lagging strike 
activity accomplishes both statistical and theoretical goals. For starters, using a 
lagged measure partially controls for potential endogeniety issues.’ Aside from 
this, specifying a lagged strike rate makes practical sense. A strike always lowers 
the wages of workers involved during the strike itself, and a strike typically occurs 
following contract expiration (see McCammon 1994). The net gain of strikes on 
wages - if any - will not be felt until the strike has concluded and the employees 
begin working under a new contract. An unlagged strike activity measure 
captures both the wage loss due to the strike itself, along with any long-term 
work stoppage effects (whether gains, boosts or no effect). Consequently, the 
use of an unlagged strike measure confuses short- and long-term effects of labor 
disputes. Because the FMCS strike data begin in 1984, in order to allow for the 
lagged strike rate item, all other covariates in the model begin in 1985. 
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Models 1 through 3 in Table 4 present results based on SIC one-digit divisions 
(see Table 1 for industry breakdowns). The sample analyzed here includes both 
highly unionized industries, such as mining, as well as those industries historically 
lacking an institutionalized labor presence, such as retail trade. | estimate these 
models in order to provide a baseline comparison with Rubin's (1 986) and others’ 
work on this subject. Like nearly all the macro-level work on this topic to date, 
Models 1 and 2 in Table 4 lack industry effects. Instead, the data cover all major 
industries in the private sector. 

Model 1 in Table 4 above includes a linear year term, a percent unionized 
variable, demographic controls, managerial pay and a regional unemployment 
measure. The large negative coefficient (significant at the .001 level) on the 
female representation variable indicates that a percent increase in the number 
of females is associated with a $3.82 drop in average weekly compensation. 
Higher age and education levels work, as expected, to boost median weekly 
compensation. Also, in Model 1 a percent increase in union representation 
increases median worker pay by about $1.63 per week. The union coefficient is 
significant at the .001 level, revealing that, according to this model, during the 
1980s and 1990s unions continued to increase average worker compensation 
despite labor's eroding institutional base. : 

Somewhat surprisingly, in Model 1 the strike rate coefficient — representing 
the effect of the main variable of interest in the analysis — is significant and 
negatively associated with median worker pay. There is reason to distrust this 
finding, however. The linear year term does not adequately account for non-linear 
time trends in worker pay between 1985 and 2002, nor does the model control 
for industry effects. In Model 2, | introduce a separate period effect, which 
controls for all unmeasured time-varying non-linear effects on wages. Following 
this further specification, the regional unemployment measure is insignificant, as 
the introduction of a year fixed-effect adequately controls for annual changes in 
unemployment rates.? After the inclusion of the year fixed-effects, the strike rate 
coefficient remains significant and negative. Without industry effects, however, 
it is hard to discern whether work stoppages actually operate to bring wages 
down or whether strike activity remains concentrated in those industries which 
experienced the greatest wage losses over the past few decades. 

Model 3 adds an industry-region effect, which renders the strike rate coefficient 
non-significant. The inclusion of industry-region fixed effects boosts the R2 and 
straightens out some of the puzzling coefficients found in Models 1 and 2. Now, 
the percent minority variable negatively affects worker wages, lowering them 
by $3.83 per week for every percent increase in African-American or Hispanic 
representation. High school dropouts earn significantly less than individuals 
with some college experience, while high school graduates are only marginally 
worse off than the college educated. The lack of a strong college vs. high school 
graduate wage premium is due in part to the occupational composition of the 
sample, which is limited to non-professional, non-managerial workers. Other 
covariates trend in the expected directions, and the positive and significant union 
effect remains despite this further specification. 


Table 4: Baseline Models Predicting Median Weekly Pay 
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Model 1 Model 2 Model 3 
Lagged Strike Rate -525.31* -1002.77*** 162.17 
(242.31) (194,24) (145.33) 
Year -6,82*** _ - 
(.67) - - 
Regional Unemployment -5.29** 1.65 .69 
(1.59) (2.09) (1.08) 
Managerial Pay as: AE 05 
(.02) (.01) (.01) 
Worker Variables: 
Proportion Unionized 162.76*** 241.93*** 609.17*** 
(25.24) (20.27) (39.01) 
% Female -381.94*** -405.97*** -232.10*** 
(14.94) (11.84) (50.96) 
% Minority 182.06*** 30.19 -383.20*** 
(30,46) (26.11) (48.75) 
Less than HS -744.11*** -743.62*** -440.31*** 
(44.77) (35.31) (52.66) 
HS Degree -27.17 -392.94*** -38.47° 
(19.90) (28.00) (21.95) 
Age 140.5325 102.88*** 41.38** 
(22.57) (18.23) (13.83) 
Age' 21.2078 Age -.47** 
(.31) (.25) (.18) 
Year fixed-effects No Yes Yes 
Industry-region fixed-effects No No Yes 
Constant 11971.29*** -1348.21*** -321.15 
(1507.05) (337.65) (261.40) 
N 576 576 576 
Number of parameters 12 29 60 
R? .88 .93 .98 


Note: Standard errors are in parentheses. Dependent variable in all models is the weekly 
median wage for non-professional, non-managerial workers. Wages in constant 2002 dollars. 


Models weighted by cell counts for each industry-region. 
p«.10 *p«.05 **p«.01 ***p«.001 


Following Rubin (1986), Models 1 and 2 of Table 5 divide the economy into 
union and non-union sectors. As Rubin (1986) discusses, a large union presence 
in an industry can better insulate its workers from the effects of a strike, and 
strikes in historically high-union industries tend to be more costly to employers. 
The FMCS data indicate that from the mid-1980s to 2002, more than three- 
quarters of all strikes occurred within the highly unionized sector. (See Rubin 
1986 and Lewis 1963 for a discussion of the union and non-union sectors and the 
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industries that comprise them.) Restricting the data to high-union industry-region 
groupings provides an initial test of whether, following Rubin's Work (1986), the 
positive relationship between strikes and worker wages exists only in the highly 
unionized sector of the economy. 


Table 5: Fixed-Effects Models Predicting Weekly Worker Pay, Full Sample 
EE E i 
Model 1 Model 2 


A A O E, p ss 
Lagged Strike Rate 96.69 -22.78 
(275.06) (177.61) 
Median Managerial Pay .04* .02 
(.02) (.02) 


Worker Variables 
Proportion Unionized 282.76* 431.99*** 
(113.99) (66.37) 
% Female -92.89 -381.08*** 
(64.58) (84.08) 
% Minority -406.71** — -199.71** 
(60.69) (69.80) 
Less thanHS -42442**  -475.11** 
(68.20) (75.93) 
HS Degree  -112.69* MO" 
(36.74) (33.96) 
Age 58.05; -7.97 
(16.06) (27.33) 


Age’ -.67** 15 
(.21) (.36) 
Year fixed-effects Yes Yes 
Industry/region fixed-effects Yes Yes 
Restricted to low union industries Yes No 
Restricted to high union industries No Yes 
Constant -623.78' 671.96 
(324.29) (495.17) 
N 288 288 
Number of parameters 42 42 
R .98 DI 


Note: Standard errors are in parentheses. Dependent variable in all models is weekly median 
wage for non-professional, non-managerial workers. Wages in constant 2002 dollars. Models 
weighted by cell counts for each industry/region grouping. 

p<.10 *p<.05 *p<.01 ***p<.001 


Both models in Table 5 above include industry-region and year fixed-effects. 
Again, the data are divided into one-digit SIC code industry divisions. Model 1 of 
Table 5 corroborates Rubin's claims about the low union sector, as no significant 
relationship exists between strikes and wages. Union presence still operates to 
boost median weekly worker pay in these low union industries, although the effect 
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is rather modest. Model 2 of Table 5 provides the first detailed, updated test of 
whether strike activity continues to influence wages positively in the unionized 
sector in the post-accord period. As shown, after controlling for industry and 
region effects, the model reveals a complete de-coupling of the strike/wage 
relationship. The strike rate coefficient is negative and not significant. However, 
it could be that the broad one-digit industry groupings obscure strike effects only 
detectable at a more disaggregated level. 

For the models presented in Table 6 below, the data are restricted to 14 
industries within the highly unionized sector (see Table 1 for industry breakdowns). 
Model 2 in Table 6 further restricts the sample to those industries with union 
representation rates above 25 percent, and Model 3 includes only those industry- 
region cells with unionization rates above 25 percent that registered a strike.? 
That is, Model 3 restricts the data to those industry-regions with a large union 
presence that actively engages in work stoppages (roughly 17 percent of the 
high union cells did not register a strike in a particular year). The results of these 
models should reveal if any positive relationship still exists between strike activity 
and worker wages, given that the data are restricted to those industries where 
the labor movement remains relatively institutionalized and active. 

Model 1 of Table 6 provides no evidence for the existence of a wage-strike 
relationship during the post-accord period. The strike rate coefficient is negative 
and not significant. Model 2 restricts the sample to those industry-region 
cells with union representation rates above 25 percent. Again, the strike rate 
coefficient fails to reach statistical significance, and remains negatively signed. 
Other covariates trend in the expected directions, although the education and 
age items are no longer significantly related to worker pay. | tested various 
measures of strike activity (person days lost, total number of workers involved, 
raw strike counts, etc.) for their effects on weekly worker wages. None of these 
variables — either inserted into the model alone or in combination with other 
strike measures — had a significant influence on worker pay (results not shown 
but available upon request). 

If any positive wage-strike relationship exists in post-accord America, Model 3 
of Table 6 should capture it. The sample for Model 3 only includes those industry- 
region cells with a well-institutionalized union presence that regularly strikes. Still, 
despite this final specification, the strike rate coefficient remains negative and fails 
to achieve significance. All other covariates in the model operate as expected. 

Finally, it may be that while strike activity fails to increase average pay, 
it reduces intra-industry and region wage differentials. As other research has 
argued, in certain institutional settings workers will strike not to increase labor's 
share, but to reduce wage dispersion within the working class (Swenson 1989). 
Given restrictive top coding and sample size issues for the workforce measures, 
the data used in this analysis are not ideal for testing wage inequality effects. 
For these reasons, the models in Table 7 below provide rough estimates of strike 
activity's impact on the 80/20 weekly pay ratio for workers, as opposed to other 
measures of inequality, such as the more commonly used gini coefficient or 
90/10 pay ratio. 

All models in Table 7 are restricted to those industries within the highly 
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Table 6: Fixed-Effects Models Predicting Weekly Worker Pay, High Union Sample 


Model 1 Model 2 Model 3 
Lagged Strike Rate -47.55 -144.08 -149.82 
(78.32) (106.78) (111.74) 
Median Managerial Pay 1022 .03* .02 
(.01) (.01) (.01) 
Worker Variables 
Proportion Unionized 258.02*** 280.99*** 340.05*** 
(30.27) (42.93) (46.87) 
% Female -293.13*** -287 45*** -264.58*** 
(32.38) (55.16) (63.16) 
% Minority -126.32** -161.30** -139.97* 
(31.15) (56.77) -75.21) 
Less than HS -236.69*** -81.77 -15.59 
(40.30) (70.74) (74.78) 
HS Degree 25.86 -1.86 28.17 
(17.57) (26.26) (30.73) 
Age -5.29 33.53 20519 
(13.95) (21.89) (22.33) 
Age’ Apa -.39 -.28 
(.18) (.28) . (.28) 
Year fixed-effects Yes Yes Yes 
Industry/region fixed-effects Yes Yes Yes 
Restricted to highly unionized industries No Yes Yes 
Restricted to cells where strike rate > 0 No No Yes 
Constant 641.36* -116.22 -54.25 
(264.52) (420.86) (429.00) 
N 1008 494 426 
Number of parameters 82 65 65 
R | .96 .94 .95 





Note: Standard errors are in parentheses. Dependent variable in all models is weekly 
median wage for non-professional, non-managerial workers. Wages in constant 2002 
dollars. Models weighted by cell counts for each industry/region. 

*p<.05 *p<.01 **p <.001 


unionized core with union representation rates higher than 25 percent. Somewhat 
surprisingly, the first model of Table 7 reveals a positive association between strike 
activity and worker pay inequality, although the effect is only marginally significant. 
Greater unionization, as well as higher median worker pay, works to reduce within 
industry-region wage inequality. The next two models tease out exactly how strike 
levels might increase the 80/20 weekly pay ratio. The second model indicates that 
strike activity has no effect on the upper distribution of worker wages, meaning 
that work stoppages must lower the wage rate of those in the bottom portion 
of the pay distribution. Indeed, the final model in Table 7 shows a significant, 
negative effect of strikes on the 20^ percentile of worker wages. 

Caution should be taken from reading too much into the results from Table 7, 
however, given the instability of the 80^ and 20^ percentile measures, as well as 
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Table 7: Fixed-Effects Models Predicting Worker Pay Inequality, High Union Sample 


80th 20th 
80/20 Percentile Percentile 
Lagged Strike Rate .88° 66.28 -153.43* 
(.49) (122.05) (73.44) 
Median Managerial Pay x 100 -.00 .87 .83 
(.00) (1.29) (.78) 
Worker Variables 
Median Worker Pay x 100 -.07** 64.20*** 42.24** 
(.02) (5.51) (.03) 
Proportion Unionized -.03** -126.73* 92.98** 
(.21) (51.36) (30.90) 
% Female -.18 -78.35 -30.28 
(.26) (64.87) (39.04) 
% Minority -.06 -106.17 -27.48 
(.26) (65.36) (39.33) 
Less than HS -.78* -329.13*** 17.40 
(.33) (80.81) (48.62) 
HS Degree -.47*** -144.18*** 42.64* 
(.12) (29.95) (18.02) 
Age -.33"* -118.23*** 25.06* 
(.10) (25.04) (15.07) 
Age'x 10 .04** 1510 -3.17 
(.01) (3.22) (1.94) 
Year fixed-effects Yes Yes Yes 
Industry/region fixed-effects Yes Yes Yes 
Restricted to highly unionized industries Yes Yes Yes 
Constant 9.54*** 2974.69*** -361.46 
(1.93) (480.07) (288.87) 
N 494 494 494 
Number of parameters 66 66 66 
R: Re .96 .93 


Note: Standard errors are in parentheses. Dependent variable in first model is the 80/20 weekly 
worker pay ratio. Dependent variable in second model is the worker 80th pay percentile, and 
dependent variable in third model is the worker 20th pay percentile. Wages expressed in con- 
stant 2002 dollars. Models weighted by cell counts for each industry/region grouping. 

Ips eelere 


the rather modest strike effect. Predictions indicate that if no strikes had occurred 
in the 1984-2002 period, workers in the 20^ percentile of the wage distribution 
would have enjoyed a 1 percent boost in their compensation. The results from 
Table 7 do indicate no positive role of strikes in reducing wage inequality in post- 
accord America. While unionization still seems to boost lower-paid worker wages 
(a percentage point increase in the unionization rate boosts low-paid workers’ 
wages by about a $1.00 per week), strikes do not seem to be an effective weapon 
in labor’s arsenal in reducing inequality. Combined with the results from Tables 
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4 through 6, these findings suggest that at the industry-region level, strikes have 
had no significant effect on worker pay in the post-accord period. 

To ensure all findings are robust, Appendices 2 and 3 explore variations of the 
main model specifications. It is well established that, net of other contributing 
factors, larger firms offer their employees higher wages, although the effect 
seems to be diminishing over time (Hollister 2004). If average firm size within 
particular industry-region groupings is systematically related to strike activity, the 
exclusion of a firm size measure could bias the effect of strike activity on wages. 
The CPS MORG data do not contain firm size information; however, beginning 
in 1989, the March CPS survey includes an item asking about employer size. As 
a check of robustness, all models presented in the analysis were rerun on the 
truncated (1989-2002) time series with the firm size item: results do not differ 
and the main models are presented in Appendix 2. 

Nonstationarity is often a problem in time-series analyses. Given that the 
predictor of interest — lagged strike rates — is strongly trended in many of the 
panels, | quantitatively explore nonstationarity in four ways, as displayed in 
Appendix 3. For ease of comparison, Model 1 of Appendix 3 replicates the main 
model for the high union sample (Model 1 from Table 6). Next, | reran the main 
model of interest with a lagged dependent variable on the right-hand side of the 
equation, effectively changing the dependent variable from a wage level to a wage- 
change rate. In Model 3, | specify a model with an AR-1 autocorrelation structure 
with panel-specific errors. In Model 4, | drop those panels that fail the augmented 
Dickey-Fuller (ADF) test for the presence of a unit root (Model 4 in Appendix 3). 
Finally, Model 5 in Appendix 3 displays the results of a first-differenced model, a 
particularly conservative test. In none of these models is the strike rate coefficient 
significantly related to worker pay; in all of the models unionization significantly 
and positively drives median worker compensation upward. 

As a final check of robustness, union*strike rate interaction terms were 
included in models using the full sample and the highly unionized sample. In 
neither of the models was the interaction term significant. 


Discussion 


The analyses presented use information on nearly every authorized strike in 
America since 1984 to test whether the previously established positive link 
between strike activity and worker compensation held following the breakdown 
of the postwar labor-capital accord. The results reveal that the positive influence 
of strike activity on wages - whether restricted to the entire workforce, the 
highly unionized sector or to those industries within the highly unionized sector 
that maintain the strongest union presence - has disappeared. Indeed, in sharp 
contrast with previous research (see, for example, Rubin 1986), strikes no longer 
affect workers' wages even in those industries with a well established union 
presence. Strike activity also fails to narrow worker pay dispersion. 

This paper expands upon the existing literature in numerous ways. Reagan- 
era BLS budget cuts severely restricted strike data availability, and consequently, 
research on the topic has nearly disappeared. Since the cuts, the little research 
done on strikes has been limited either to individual case studies (see Dixon 
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and Roscigno 2003 for an example) or restricted to those conflicts involving 
1,000 or more workers. (See Skeels et al. 1988 for a discussion of why these 
disproportionately large stoppages are non-representative of typical labor 
disputes.) Wallace, Leicht and Raffalovich's (1999) research represents the only 
investigation of the strike-wage relationship in the post-accord era. However, the 
authors' reliance on a person days lost strike measure as well as their data's high 
level of aggregation render their strike related findings preliminary. 

Given the drastic decline in strike activity over the past 20 years, any strike effect 
will probably be felt at a highly disaggregated level. The analyses in this paper 
take advantage of previously unreleased data on nearly every work stoppage in 
the United States between 1984 and 2002. By linking this detailed strike data to 
workforce information at the industry-region level, these analyses represent the 
most comprehensive and rigorous tests of the strike-wage relationship during 
the 1980s and 1990s. Indeed, given the level of disaggregation, these analyses 
represent the most comprehensive test of strike-wage effects found in the 
literature for any period. 

Past research has shown that the effectiveness of strikes depends on labor's 
ability to protect its employees from the adverse consequences of a work 
stoppage (Rubin 1986). In order to emerge successfully from a strike, workers 
require adequate strike funds and the institutional muscle to prevent capital from 
hiring replacement employees, coupled with a supportive public willing to hold the 
strike line (if necessary). The analyses presented in this paper hint at a breakdown 
of these factors - even in those models restricted to the most highly unionized 
industries. The findings also reveal another pathway through which declining 
union power has exacerbated wage inequality in the recent past, with strikes no 
longer working to raise average worker pay at the industry-region level. 

However, given the level of data aggregation, these tests cannot detect the 
typical outcome of an individual strike.'' It may be that in this era unions call for 
strikes only when faced with the most desperate circumstances, such as a plant 
closing, rapid downsizing or severe pay cuts. Even a “successful” strike under 
such adverse circumstances is only likely to restore the status quo and unlikely 
to translate into real wage growth. 

Or, strikes may now be such a rare event that even a strike settlement which 
leads to wage increases for the workers involved fails to reverberate across 
similar firms, leading to a minimal overall effect on wages. That is, given their 
relative infrequency, employers may no longer feel threatened by strikes. Finally, 
the evidence presented in the paper may indicate an increased use of lockouts by 
employers. No work stoppage data distinguishes between strikes and lockouts, 
so the lack of a positive strike effect may reflect the increased willingness of 
employers to lock out their workers in order to secure wage cuts and other 
concessions (Cramton and Tracy 1998). 

It may also be that the dominant type of strikes in the period of the labor-capital 
accord — strikes over economic issues - have fallen in frequency compared with 
other types of walkouts. Making use of earlier BLS strike data, Wallace (1989) 
calculates that for the entire period of the labor-capital accord, strikes over wages 
rarely comprised less than half of all walkouts, rising to over two-thirds by the 
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early 1980s. Strikes about job security issues (layoffs, tenure arrangements 
and the like) remained relatively infrequent from the 1950s onward. However, 
since the complete break-up of the accord, strikes over job security issues 
may now loom larger as workers scramble to retain their positions or seniority 
arrangements in the workplace. No data on strikes during the past few decades 
distinguishes strike type, and a lack of a wage effect may be partly driven by a 
growth in disputes over job security. (No theoretical reason exists to suggest a 
recent growth in other union-authorized strike types, such as union organization 
walkouts.) 

In reality, a combination of factors probably contributed to the erosion of the 
strike-wage relationship in the 1980s and 1990s. While this paper establishes 
that at the industry-region level — even industries and regions once firmly within 
the union sector - the positive strike-wage link has been severed, future research 
should focus on a representative sample of individual firms in order to reveal 
exactly what occurs following a work stoppage. Only time-series firm-level data 
can bring to light the precise effects an individual work stoppage has on wages 
for the employees of the striking firm. 

One implication from this paper is the need to rethink our understanding of 
strike predictors in the United States. That is, given the seeming lack of a wage 
benefit attributable to strikes, why do they still happen? It may be that the strike 
predictors of old (including business cycle conditions such as the unemployment . 
rate, inflation rate, as well as organizational factors, such as levels of unionization) 
may no longer explain strike activity all that well. One factor that seems increasingly 
important is the role of health care coverage and costs. Indeed, the California 
grocery strike of 2002 erupted in part over planned employer cutbacks in health 
care coverage. Future research on these issues should pay particular attention to 
changing strike predictors and specifically the role rising health care costs play in 
triggering worker walkouts. 

The current analysis omits a formal analysis of strike effects on fringe benefits 
for three main reasons. One, data on health insurance coverage (a major 
component of the non-wage benefit package) has been issued only intermittently 
since the early 1980s and only for a fraction of the total CPS sample, posing large 
sample size problems for an aggregate analysis. Second, this paper is situated 
within a body of literature that either excludes fringe benefits altogether or folds 
them into an umbrella compensation measure. Third, the available anecdotal 
evidence suggests that cutbacks in fringe benefits have been an employer-driven 
phenomenon, rendering an analysis of their relationship to strikes more suited to 
an investigation of strike predictors, rather than strike outcomes. 

The results of this analysis also point to the need for future theoretical 
work on macro-level trends in strike activity within industrial nations. Previous 
research posits an inverted U-shaped curve where strike activity increases during 
the early stages of working class organization, levels off as labor becomes 
firmly institutionalized within the polity, and then declines as the working class 
shifts its focus to the political arena (Korpi and Shalev 1979, 1980). Other work 
suggests that strike activity will decrease with the expansion of the welfare state 
and welfare benefits. If the market regains its place as the site of distributional 
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struggles following welfare benefit cutbacks, strike activity should increase 
(Hibbs 1978, see especially p. 175). While perhaps illustrative of tendencies 
in certain European nations, neither scenario effectively illuminates trends in 
strike activity in post-accord America. In contrast to Hibb's contention (1978), 
if anything, welfare entitlements were scaled back in the 1980s and 1990s, and 
wages were increasingly left prey to purely market forces, yet strikes continued 
to decline. Korpi and Shalev (1979, pp. 179-80) argue that declining strike rates 
depend on a highly organized working class with a well-established, secure 
place in the polity. Strikes are an increasingly rare event, but not because of a 
robust labor movement content to battle capital in the political realm. The current 
American labor movement is — historically speaking - neither well-organized nor 
a particularly effective political actor. To adequately capture trends in America, 
future theoretical work must contend with a withering labor movement, a 
retreating welfare state and the decline of the strike as a common weapon used 
in industrial disputes. 

Future macro-level quantitative work should pay close attention to the 
ongoing rumblings within the American labor movement itself. Just as the 
political and economic transformations of the early 1980s triggered dramatic 
changes in the relationships between the union movement, strikes, and wages, 
current re-alignments may lead to new directions in union focus and effects. For 
example, new research suggests that certain unions actively avoid NLRB election 
campaigns whenever possible, viewing them as cumbersome, time-consuming 
affairs that grant management the upper hand (Sharpe 2004). Innovative unions 
may increasingly seek ways to avoid the confines of federal regulations when 
considering sending their members out on strike, or the strike itself may be 
replaced with new and less easily defeated bargaining strategies. 


Conclusion 


Representing labor's most potent weapon in its arsenal in decades past, the 
typical strike was called during periods of contract re-negotiation to disrupt the 
productive process at the site of production itself. Substantial strike funds coupled 
with the damage prolonged work stoppages caused employers — especially in 
capital-intensive industries — enabled a well-established labor movement to use 
the strike to achieve wage gains for its members. Research detected these gains 
even at extremely high levels of data aggregation. 

Labor's steep slide during the past few decades has resulted in post-war lows 
in union representation. Nonetheless, union presence within an industry still 
translates into higher wages compared to industries and regions lacking labor 
representation. Strike activity, on the other hand, no longer positively influences 
worker pay at the industry-region level. Strike activity also fails to translate into a 
narrower wage distribution for workers within particular industries and regions. 
Unions may call fora strike on such rare occasions that whatever effect the dispute 
has on the workers directly involved fails to influence industry and regional pay 
scales. Employers — emboldened by Reagan’s firing of the air traffic controllers in 
the PATCO strike — may increasingly use lockouts or other strategies to squeeze 
wage concessions from their workers. Union leaders, sensing the tides shifting 
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toward capital in their long-running campaign to secure strong benefit packages 
and wage rates for their members, may simply refrain from striking except In 
the most desperate situations. Whatever the specific circumstance, the general 
effect of the once-powerful strike has withered away, rendering an already uneven 
battle that much more lopsided. 


Notes 


d 


What date constitutes the turning point in labor-capital relations is an em- 
pirical matter that depends on the particular issue under investigation (union 
membership, strike frequency, labor's representation within the political are- 
na, etc.). However, by the mid-1980s — the starting point for this investigation 
— labor is in full retreat regardless of what measure of worker strength one is 
interested in investigating. 


Like other data on strikes, these data do not distinguish between union-di- 
rected walkouts and employer lockouts. Implications of this data limitation 
are discussed. 


Prior to the cuts, the BLS kept detailed records of every work stoppage in- 
volving at least six employees. 


A helpful Socia/ Forces reviewer indicates that the bivariate correlation be- 
tween strike frequency and working time lost due to strikes is .44. 


SIC codes were preferred over the NAICS for two main reasons. First, given 
that the NAICS only began in 1997, the older Dun & Bradstreet guides utilized 
for strike categorization only classify companies by SICs. Because more re- 
cent guides classify companies by both NAICS and SIC codes, | decided that 
unnecessary error could be introduced into the strike coding by converting 
all the pre-1997 SIC codes into NAICS, instead of simply relying on the con- 
tinuous SIC codes. Second, as indicated in the text, step three of the strike 
categorization process involved internet searches for company information. 
Many of the older strikes involve companies that no longer exist, and often 
the information on these defunct companies only includes their SIC code. 


The author is aware of the danger of this practice, what with, for example, 
the UAW now organizing graduate students. The practice was only used for 
a few strike classifications, with knowledge of the common cross-industry 
unionization practices. 


Unless the strike and wage data are collinear over time. Given the extremely 
low bivariate correlations between wages and the various strike measures, 
this was not a major concern. 


While the coefficient seems large, the actual magnitude of the effect on 
wages - given the strike rate variable's distribution — is actually quite small. 
A -1000 coefficient, for example, translates into about a $1.00 decrease in 
weekly wage compensation. 
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9. The inclusion of year effects in the subsequent models always renders the 
regional unemployment rate non-significant. It is therefore dropped from the 
models presented in Tables 5, 6 and 7. 


10. The six industries with union membership rates above 25 percent averaged 
across all years include metals, transportation equipment, food/tobacco, 
transportation, communications, and utilities/sanitary services. While the 25 
percent cutoff is admittedly arbitrary, | also ran models including a variety 
of other cutoff points (20 percent, 30 percent, 33 percent, etc.). In none of 
these models was the strike rate coefficient significant. 


11. Nor could the type of analysis pursued in this paper be expanded backwards. 
Gaps within the strike series itself — in particular between 1977 and 1983 
— prevent one from constructing a seamless series. Moreover, even if | want- 
ed to replicate my analysis only using pre-1978 strike data, CPS-union data 
only extends back to 1977; and only beginning in 1983 has the CPS greatly 
expanded its union coverage to include industry and geographic tags (see 
www.unionstats.com). If the literature on strike effects was contradictory, a 
replication of historical data using a version of the methodology employed in 
this study might be warranted. However, prior research uncovering a positive 
strike-relationship has been sufficiently varied in level of analysis and time 
period covered that | remain confident an extension backwards of my own 
methods would corroborate the general pattern previously discovered. 
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Appendix 1: Regional Groupings by State 


1. Northeast/ mid-Atlantic Region 


Maine 

New Hampshire 
Vermont 
Massachusetts 
Rhode Island 
Connecticut 
New York 

New Jersey 
Pennsylvania 
Delaware 
Maryland 


Washington D.C. 


2. South 
Virginia 
West Virginia 
North Carolina 
South Carolina 
Georgia 
Florida 
Kentucky 
Tennessee 
Alabama 
Mississippi 
Arkansas 
Louisiana 


3. Midwest 
Ohio 
Indiana 
Illinois 
lowa 
Michigan 
Missouri 
Wisconsin 
Minnesota 
North Dakota 
South Dakota 
Nebraska 
Kansas 
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4. West 


Oklahoma 
Texas 

New Mexico 
Arizona 
Montana 
Idaho 
Nevada 
Wyoming 
Colorado 
Washington 
Oregon 
California 
Alaska 
Hawaii 

Utah 
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Appendix 2: Firm Size Effects on Weekly Worker Pay 


Full Sample High Union Sample 
Lagged Strike Rate -137.19 -47.14 
(182.03) (100.39) 
Average Firm Size 29.29** 5.90 
(8.38) (6.40) 
Regional Unemployment -2.62 -3.11 
(1.60) (2.05) 
Median Managerial Pay 03" .02* 
(.01) (.01) 
Worker Variables 
Proportion Unionized 398.03*** 176.07*** 
(47.09) (34.54) 
% Female -193,45*** -250.42*** 
(54.81) (35.51) 
% Minority -278.20*** -84.02* 
(52.24) (33.55) 
Less than HS -440.14*** -282.48*** 
(57.21) (44.10) 
HS Degree -50.49* -43.19* 
(23.42) (20.16) 
Age 48.31** -3.56 
(16.47) (18.14) 
Age' -.56^* 12 
(.21) (.23) 
Year fixed-effects Yes Yes 
Industry/region fixed-effects Yes Yes 
Constant -439.95 650.30* 
(311.79) (348.00) 
N 448 784 
Number of parameters 57 81 
R .99 .96 


Note: Standard errors are in parentheses. Dependent variable in all models is weekly median 
wage for non-professional, non-managerial workers. Wages in constant 2002 dollars. Models 
weighted by cell counts for each industry/region grouping. 

'p«.10 *p«.05 **p<.01 **p<.001 
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Appendix 3: Autocorrelation Tests for High Union Sample 





Lagged 

Strike Proportion 

Model N Rate Unionized 
1. Main model (Model 1 from Table 6) 1008 -47.55 258.02*** 
2. Main model with lagged dependent variable 1008 -53.45 1190.031 
3. PCSE with panel specific AR-1 autocorrelation structure 1008 -39.89  229.41*** 
4. Main model minus panels exhibiting autocorrelation 558 (Ie ON S Pate 
5. First differenced model: 95229 3145.43 0115/53358 


*p«.05 *p<.01 **p<.001 
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Do High Technology Policies Work? High Technology Industry 
Employment Growth in U.S. Metropolitan Areas, 1988-1998 


J. Craig Jenkins, Ohio State University 
Kevin T. Leicht, University of lowa 
Arthur Jaynes, Ohio State University 


Abstract 

Since the 1970s, federal, state and local governments have launched an array of new 
high technology development programs. Researchers and policy-makers disagree about 
the relative merits of these policies. We address the effects of seven of these policies on 
high tech industry employment growth in metropolitan statistical areas in the United 
States between 1988 and 1998. A conditional change score design shows that technology 
grant/loan programs and technology research parks have direct effects net of location 
and agglomeration factors. Five of seven programs positively interact with existing 
agglomeration advantages to create growth in high technology industry employment. 
Technology development programs compensate for deficits in agglomeration resources. Our 
results suggest that high-technology development can be planned by designing programs 
that magnify existing local growth advantages. 


Since the early 1970s, state and local governments have launched a wide array 
of new economic development programs to promote high tech development. 
Popularly called "third wave," "new industrial" and "entrepreneurial" policies, 
these initiatives entail direct state intervention in the creation of new enterprises, 
products, markets and technologies. By helping to identify market opportunities, 
fostering local innovation capacities, and making public investments in new 
technology and private enterprises, these governmental programs have 
attempted to promote "risky but potentially productive undertaking(s) that would 
not have gone forward without governmental support." (Eisinger 1988:230) 
These initiatives involve direct governmental intervention in the creation of 
new technology, products, markets and enterprises. We focus on seven major 
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programs: public venture capital programs, Small Business Innovation Research 
(or SBIR) programs, grant and loan programs to finance the development of new 
technology, university-affiliated technology development centers, technology 
deployment/transfer programs, technology business incubators, and technology 
research parks. 

Our research addresses two questions about these programs. First, how 
effective are they at promoting high technology industry employment growth 
net of existing agglomeration and location factors? Second, do these programs 
magnify or compensate for agglomeration and location factors, including existing 
high technology industry? 

We take a regional perspective by examining the growth of high technology 
industry employment in metropolitan areas (MSAs) in the United States between 
1988 and 1998. Several large MSAs such as Chicago, Washington, D.C., Boston, 
and San Jose, started this decade with major high techn concentrations that 
continued to grow. Others such as Los Angeles, Detroit and Seattle, had large 
high technology sectors that declined. Other smaller MSAs such as Wichita, 
Kansas, Austin, Texas, Dayton-Springfield, Ohio, and Raleigh-Durham, N.C., began 
with small high tech sectors but grew rapidly. Our major concern is whether the 
high tech development policies adopted by state and local governments helped 
create this high tech industry employment growth. 

Policymakers and the general public treat employment as the central yardstick 
for evaluating these programs. The primary rationale for these programs is that 
they create quality jobs. High tech also has significant spillover effects on the 
technology and growth of other industries (Hadlock, Hecker and Gannon 1991; 
Hecker 1999; Zachariadis 2002), and is critical to the economic competitiveness 
of specific regions and nations in an increasingly global economy (Atkinson and 
Gottlieb 2001; Devol, Koepp and Fogelbach 2002; Fujita, Krugman and Venables 
1999; Harchaoui et al. 2002; National Science Board 1998:Ch. 6). As we show 
below, wages in high technology industries are typically 30 to 40 percent higher 
than in other industries. Labor productivity in high technology manufacturing 
increased between 1987 and 1999 by an average of 9.5 percent per year 
compared to 3.2 in the manufacturing sector as a whole (Kask and Sieber 2002). 
High tech employment was second only to human capital as a predictor of per 
capita income growth in U.S. states between 1995 and 2000 (De Vol et al. 2002). 
High tech development is critical to the prosperity and economic competitiveness 
of the states and communities that have invested in these new programs. But do 
these high tech programs work? 


Differing Views on Location-specific Technology Policies 


Some analysts argue that high tech development is a path-dependent process 
that cannot be influenced by public policies (Kenney and von Burg 2000). Others 
contend that conventional location factors, such as labor and housing costs, low 
taxes, regulatory environments and access to transportation and markets are of 
limited importance (Florida 2002). The major input factor in high tech production 
is scientific intelligence harnessed to technical problem-solving. In principle, high 
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tech industry is location-free and cannot be influenced by traditional industrial 
recruitment incentives. 

However, high tech industry is locality bound as it depends heavily on 
interpersonal networks and the social reinforcement of entrepreneurial activities 
(Florida 2002, Florida and Kenney 1990, Kenney and Von Burg 2000, Lee et al. 2000, 
Kolko 2002, Saxenian 1994, Thornton 1999). High tech entrepreneurs and workers 
need access to tacit and technical knowledge as well as social reinforcement for 
entrepreneurial activities that are important to industrial growth in general (e.g., 
Sorensen and Audia 2000) and especially to growth in high tech industries (Koepp 
2002, Lee et al. 2000). High tech firms tend to cluster geographically because of the 
need for information exchange, the interpersonal transmission of tacit knowledge 
about business formation and product development, localized concentrations of 
skilled labor, lifestyle amenities, and research facilities associated with research 
universities, large corporations and federal research labs (Branscomb and Florida 
1998; Cohen, Nelson and Walsh 1996; Goldstein and Renault 2004; Kenney and 
von Burg 2000; Kolko 2002; Luger and Goldstein 1997). Most new high tech firms 
are spin-offs of other high tech firms, typically located within the same immediate 
area as the parent enterprise. Markusen, Hall and Glasmeier (1986:154-56) found 
that agglomeration effects from Fortune 500 headquarters, business services and 
military RED affected high technology manufacturing growth in MSAs between 
1973 and 1977. Access to venture capital is critical for converting new ideas into 
commercial products (Florida 2002, Kenney and Florida 2000, Thornton 1999). State 
and local high tech programs may substitute by providing start-up capital (public 
venture capital, SBIR funding, and technology grants and loans), technical and 
management advice (business incubators, research parks), information networks 
(business incubators and research parks, networking programs) and, perhaps more 
importantly, operate as catalysts for magnifying the impact of existing location and 
agglomeration advantages. 

Proponents of high tech policy have traditionally debated two general 
approaches: (1) a centrally-directed infrastructure strategy of investing in public 
research and specialized infrastructure to attract existing high technology 
industries to specific locations or “technopoles” (Castells and Hall 1994), and 
(2) a more decentralized entrepreneurial strategy of reinforcing local innovation 
capacities by investing in new enterprises and products, and promoting the 
development of local networks and partnerships (Clarke and Gaile 1989, 1998; 
Eisinger 1988; Osborne 1988). Federal, state and local governments have 
used both strategies, and the mix of these programs suggests that they are 
complementary parts of a general high-tech strategy. We want to know which of 
these policies promote tech employment growth. 

We focus on decade-long change in high tech industry employment in MSAs 
between 1988 and 1998. MSAs constitute an ideal ecological unit for evaluating 
these policies as well as underlying location and agglomeration advantages. 
In 1990 an MSA was defined as an integrated labor market within a one-hour 
commute of a central city of 50,000 or more population. By mapping the location 
of federal, state and local high technology programs to these MSAs, we evaluate 
whether these programs had an impact net of existing location and agglomeration 
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advantages and whether these programs magnify or compensate for these local 
growth factors. Several location and agglomeration factors, such as major air 
hubs and federal R&D expenditures, are likely to have benefits that operate on 
a larger ecological scale than cities or counties, pointing to the advantage of a 
focus on MSAs. Product development typically requires at least 5 to 10 years to 
move from an initial business proposal to actual production on 8 scale that would 
measurably affect employment. 

We begin with a discussion of the nature of high tech industry and its 
distribution across MSAs. 


High Tech Employment in Metropolitan Areas, 1988-1998 


High tech industry is generally defined in terms of "the design, development, and 
introduction of new products and innovative manufacturing processes, or both, 
through the systematic application of scientific and technical knowledge." (U.S. 
Congress, Office of Technology Assessment, 1984:8-9) Such enterprises produce 
sophisticated products, use advanced or state-of-the-art techniques, have high 
expenditures on research and development, and employ a disproportionately 
large share of scientific, technical and engineering personnel. 


What is High Tech Industry? 


Analysts have taken two general approaches to measuring the growth of high 
tech industry. An output approach focuses on the technical sophistication of 
an industry's product or the extent to which products have undergone rapid 
change. The Bureau of the Census (National Science Board 1998:6.12-6.13), for 
example, used the expert judgment of industry analysts to identify leading-edge 
technologies in 10 product areas. Most popular accounts of high technology 
industry (e.g., Kotkin 2001) rely on similar criteria. However, there is little 
agreement on what constitutes a sophisticated product, and there is no clear 
way to link such products to employment change. 

A second approach is to focus on /nputs to industries by examining the 
proportion of workers in technology-oriented occupations or the business costs 
devoted to research and development. This approach provides objective criteria 
and has a direct link to employment data. Markusen et al. (1986) identify high tech 
manufacturing as industries that exceed the manufacturing mean in the percent 
of scientific, technical and engineering personnel in three-digit SIC industries. 
Hadlock et al. (1991) improve on Markusen et al. (1986) by using the Occupational 
Employment Statistics Survey (or OES) of private employers to identify the 
proportion of technology workers engaged in research and development. 

We use Hecker's (1999) refinement of Riche, Hecker and Burgan’s (1983) and 
Hadlock et al.'s (1991) approach that has three main advantages. First, it uses 
newer OES estimates of scientific and technical personnel. Second, it uses a 
more stringent input criterion of having at least twice the industrial mean in both 
employment in research and development and employment in all technology- 
oriented occupations. Third, it includes services as well as manufacturing, 
which is critical in view of the growing significance of high tech services. "High 
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technology" is thus defined as all private sector industries where employment 
in both research and development and in all technology-oriented occupations 
is at least twice the national industrial mean. We use Hecker's estimates, 
which rely on the 1987 OES. The OES covers all industries except agriculture 
(but includes agricultural services), forestry, fishing, private households and the 
federal government. Hecker's definition provides a conservative objective basis 
for gauging industrial employment trends. At the MSA level, it also overlaps 
closely with the industry lists used earlier by Riche, Hecker and Hadlock (1983), 
Markusen et al. (1986) and Hadlock et al. (1991).' 

We examine the first difference change in private sector employment atthe MSA 
level in the 31 three-digit SIC industries identified by Hecker (1999). To measure 
this, we use the Current Employment Statistics Survey (or CES), compiled by the 
Bureau of Labor Statistics from Social Security establishment unit reports (also 
known as the ES-202 program). Four of our high tech industries are in services 
and 27 are in manufacturing. In 1988, there were 6.6 million high tech industry 
jobs inside of MSAs, representing 74.6 percent of the national total of 8.8 million 
high tech industry jobs. By 1998, high tech industry employment inside of MSAs 
grew to more than 6.9 million jobs, but reflecting the region diffusion of the high 
tech industry, this constituted only 67.6 percent of 9.8 million national high tech 
industry jobs. High tech's share of jobs in MSAs declined from 9.2 to 7.9 percent 
of total private sector MSA employment, reflecting more rapid employment 
growth in other industries. Nonetheless, high tech industry continued to be a 
source of "good jobs." In 1988 the mean wage for high tech industry jobs was 
$29,046 (in 1988 current U.S. dollars), which was 48 percent greater than the 
mean private sector wage of $19,628. In 1998, this high tech wage premium had 
declined slightly to be 36.3 percent greater than the mean private sector wage 
($42,892 in current 1998 U.S. dollars for the high tech industry vs. $27,329 for all 
private sector jobs). 

We analyze the 291 MSAs for which CEW employment data are available 
for both 1988 and 1998. Our 291 MSAs contain 97.04 percent of the 1990 U.S. 
Census estimate of the national metropolitan population and provide the most 
complete set of reliable estimates of change in MSA employment available. 

Table 1 identifies the top 20 MSAs in terms of the growth of additional high 
tech industry employment between 1988 and 1998 and the top 20 MSAs in terms 
of the loss of high tech industry employment. In contrast with the conventional 
wisdom that advantaged high tech areas experienced greater growth in new jobs, 
the percent of employment linked to the high tech industry in 1988 is negatively 
correlated with the change in high tech industry employment between 1988 and 
1998 (r = -.144, p < .014). This indicates the trend toward the dispersion of 
high tech industry employment and convergence among MSAs in terms of the 
presence of high tech industry (see also Kolko 2002). As noted earlier, some MSAs 
such as San Jose, California, and Washington, D.C., are high tech "meccas" that 
had major concentrations of high tech industry in 1988 and experienced further 
high tech industry employment growth during the 1990s. Others, including 
Los Angeles, Detroit and Seattle, had large high tech sectors but experienced 
significant loss in high tech industry jobs. Why? 
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Table 1: High Tech Industry Employment Change, 1988-1998 


e 


— EE $ 
Top 20 High Tech Industry Job Gainers 
# High Tech High Tech Change in 


Jobs 1988 Jobs 1998 Jobs 1988-98 


MSA Name 

1. Chicago, IL 73:115 302,108 128,993 
2. Washington, DC-MD-VA 191,979 288,469 96,490 
3. Boston-Lawrence-Salem, MA 290,423 350,583 60,160 
4. Houston, TX 83,822 139,978 56,156 
5. Rochester, NY 19,976 75,063 55,087 
6. Atlanta, GA 93,750 144,485 50,735 
T. Portland, OR 40,295 83,865 43,570 
8. Dallas, TX 151,510 192,682 41,172 
9. New Haven-Waterbury-Meriden, CT 34,226 74,514 40,288 
10. San Francisco, CA 56,315 95,674 39,359 
11. San Jose, CA 267,542 302,176 34,634 
12. Minneapolis-St. Paul, MN-WI 130,077 160,211 30,134 
13. Oakland, CA 67,746 95,993 28,247 
14. Newark, NJ 57,338 85,200 27,862 
15. Austin, TX i 35,568 62,679 2111 
16. Dayton-Springfield, OH 28,846 52,912 24,066 
17. Raleigh-Durham, NC 36,191 59,426 23,235 
18. Salt Lake City-Ogden, UT 29,535 52,679 23,144 
19. Wichita, KS 6,399 28,422 22,023 
20. Cleveland, OH 70,744 91,438 20,694 


Top 20 High Techindustry Job Losers 
High Tech High Tech Change in 
Jobs 1988 Jobs 1998 Jobs 1988-98 


MSA Name 

1. Los Angeles-Long Beach, CA 499,945 351,000 -148,945 
2. Detroit, MI 258,553 121,456 -137,097 
3. Seattle, WA 148,797 97,474 -51,323 
4. Hartford-New Britain-Middletown, CT 79,946 39,580 -40,366 
5. Nassau-Suffolk, NY 119,528 80,212 -39,316 
6. Bergen-Passaic, NJ 82,559 55.008 -27,551 
7. Baltimore, MD 66,927 46,312 -20,615 
8. Flint, MI 34,953 15,879 -19,074 
9. Fort Worth-Arlington, TX 65,762 47,703 -18,059 
10. Lansing-East Lansing, MI 23,905 7,523 -16,382 
11. St. Louis, MO-IL 112,196 95,883 -16,313 
12. Allentown-Bethlehem-Easton, PA-NJ 25,464 9,819 -15,645 
13. Saginaw-Bay City-Midland, MI 18,500 5,041 -13,459 
14. Peoria, IL 16,364 3,106 -13,258 
15. Buffalo-Niagara Falls, NY 35,614 23,262 -12,352 
16. Binghamton, NY 15,501 4,178 -11,323 
17. Ann Arbor, MI 28,096 17,289 -10,807 
18. Syracuse, NY 23,618 13,430 -10,188 
19. Richland-Kennewick-Pasco, WA 12,420 2,864 -9,556 


20. Melbourne-Titusville-Palm Bay, FL 39,269 29,762 -9,507 
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Location Theory 


While our major interest is the effect of high tech development programs, 
other explanations need to be included. Analysts of regional development have 
traditionally focused on /ocation advantages and agglomeration effects. Location 
explanations suggest that lower input factor costs (such as wages or taxes) 
and central location advantages, such as access to transportation and markets, 
facilitate economic development. Irwin and Kasarda (1991) show that centrality 
in the airline industry hub system contributed to economic growth and, given the 
importance of rapid transit to high tech industry, we would expect airline access 
to facilitate high tech development. Similarly, central access to markets as gauged 
by greater population density and interstate highway access should boost high 
tech development. Markusen et al. (1986:155) found that a favorable climate, 
airline access and educational options in terms of the number of four-year higher 
education institutions contributed to high tech industry employment growth 
between 1973 and 1977. They also found that a greater percent black population 
discouraged high tech industry employment growth, which they interpreted as 
reflecting racial prejudice and human capital deficits. Other location factors, such 
as higher mean wages in high tech industry and recreational and arts amenities 
were not statistically significant. 


Agglomeration Theory 


Many argue that location factors are of little importance for high tech industry 
and that agglomeration processes are more important. Due to the labor-intensive 
nature of high tech production and the importance of tacit knowledge (Saxenian 
1994), these firms are unlikely to achieve internal economies of scale, but may 
achieve external economies of scale by building business partnerships and 
extensive interpersonal networks among distinct firms to exchange technical 
and market information (Kenney and Von Burg 2000). Numerous case studies 
of particular high tech industries and regional high tech clusters indicate the 
importance of entrepreneurial resources (such as venture capital) and a pattern 
of corporate spin-offs, inter-firm personnel transfers and collaborative business 
partnerships (Cortright and Mayer 2002, Kenney and Florida 2000, Lee et al. 2000, 
Rogers and Larsen 1984, Saxenian 1994, Thornton 1999). Markusen et al. (1986) 
found that military R&D facilities contributed to high tech industry job growth 
along with concentrations of business service firms, such as accounting, legal and 
public relations firms. Although many note that high tech development has been 
greater outside the older more dense cities that were the urban centers for earlier 
industrial development, human ecologists argue that greater population density 
sustains high rates of social interaction and specialization, thereby promoting 
social and economic innovation (Hawley 1981). This innovation capacity should 
support high tech development. We therefore include population density as an 
agglomeration factor as well as a location advantage. 
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Federal, state and local entrepreneurial programs are designed to compensate 
for or to reinforce agglomeration processes. Several studies have examined the 
effects of particular high tech policies but typically without comparing units with 
and without these policies. Luger and Goldstein (1991) found that more than half 
of the high tech parks founded between 1950 and 1989 failed in the early stages 
and, of those that survived, less than half generated significant total employment 
growth in their immediate region (mostly older parks affiliated with major 
research universities that had more comprehensive service programs). But they 
did not compare counties without parks to those with parks, leaving unclear the 
distinctive contribution of parks to employment growth. A more relevant outcome 
measure would have been high tech industry employment. Yet this study did 
point to the possible importance of proximity and ties to research universities 
as magnifying the effects of these parks. In a study of biotechnology patents, 
Cortright and Mayer (2002) found that university and federally funded biomedical 
RED contributes to commercialized knowledge in MSAs that also have significant 
concentrations of private venture capital. In other words, it is the combination 
of biomedical R&D with private venture capital that leads to biomedical patents. 
Most university and federal research does not lead to patents but the existence 
of favorable university policies and local venture capital appear to be important 
contextual factors. Audretsch, Weigand and Weigand (2002) show that SBIR 
programs in Indiana provided small enterprise financing for a significant number 
of new high tech enterprises that otherwise were unlikely to have been created. 
All of these studies have the major limitation that they did not compare ecological 
units with and without programs, leaving unclear the distinctive impact of high 
tech policies. We examine seven programs that have been initiated by federal, 
state and local governments over the past three decades to promote regional 
high tech development. 


1. Public venture capital programs provide start-up, 
intermediate and commercialization financing for new 
products and firms. While the specific financing terms 
vary widely, these programs put state government in the 
position of taking a royalty or ownership position in new 
products and enterprises. Some investments create equity 
shares in a private stock company or royalty claims against 
the sales of particular products. Others are convertible into 
long-term bonds. In the case of a business failure, some 
investments are converted into grants with no repayment 
obligation while others remain liabilities resolved in 
bankruptcy proceedings. 


2. Small Business Innovation Research programs (or 
SBIRs) require federal agencies that make grants for 
technology innovation to set aside a specified portion for 
small business. Local SBIR programs chartered by state 
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governments administer these grant programs and, over 
the past two decades, have become a significant source of 
federal funding for new enterprise development (Audretsch 
et al. 2002, Wallsten 1998). 


3. Technology grant and loan programs provide financing 
for the development of new products, typically through 
a competitive application process where  would- 
be entrepreneurs provide business plans for the 
commercialization of specific products. We include in 
this category tax subsidized private venture capital firms, 
Business Investment Development Companies (BIDCOs) 
and commercialization programs that rely exclusively on 
grants and loans. 


4. Technology development programs, typically associated 
with research universities and government-industry 
consortia (e.g. Sematech), focus on basic and applied 
research to develop new products and technology. Some 
are funded directly to operate research programs while 
others are funded by federal and state grants and contracts 
to develop new technologies and products. 


5. Technology deployment and transfer programs focus 
on the utilization of “state of the art” technology. Taking 
land grant university rural extension programs as their 
historical model, these programs focus on the transfer 
and deployment of existing “state of the art” technology 
through consulting, customized labor training, technical 
reports, conferences and symposia. Some programs are 
administered as state agencies while others are organized 
as nonprofit corporations. Technology deployment and 
transfer programs typically deliver services through 
contracts and direct delivery. 


6. High technology business incubators provide subsidized 
space for research and development and, in varying degrees, 
technical and business advice, including assistance in 
securing public and private financing. Some incubators are 
state or local governmental agencies; some are affiliated 
with research universities, two-year colleges and research 
parks, while others are independent nonprofit and for-profit 
corporations. Incubators often have multiple sponsors, 
including federal agencies and private corporations as well 
as state and local governments. We include only business 
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incubators with state and local government sponsors, that 
have a declared high tech focus. 


7. Technology research parks provide subsidized long-term 
space for high technology businesses along with varying 
degrees of business financing and managerial/technical 
assistance. Most are affiliated with research universities and 
typically operate as state-chartered non-profit corporations. 
Both business incubators and research parks typically 
specialize in particular technologies, attempting to foster 
local expertise, partnerships and networks. 


While we view the distinction between infrastructure and entrepreneurial 
strategy as a continuum, business incubators, research parks and technology 
development programs are infrastructural because they subsidize research and 
development costs in specific locations. Public venture capital, SBIRs, technology 
grant/loans and technology deployment/transfer programs are entrepreneurial 
because they attempt to strengthen existing innovation capacities. 


Capturing State and Local Technology Programs 


Table 2 summarizes the counts of these state and local technology programs 
that were operating in metropolitan and non-metropolitan areas in 1990. We 
derived information on these programs from the comprehensive listing of state 
government technology programs compiled by the Carnegie Commission on 
Science, Technology and Government (Coburn and Berglund 1995), Luger and 
Goldstein's (1991) list of technology research parks in 1990, Eisinger's (1991) list 
of public venture capital programs in 1989-90, the 1990 membership list of the 
National Association of Business Incubators (1990), Clarke's (1986) inventory of 
state technology programs for the National Governor's Association, the inventory 
construction by the Minnesota Department of Trade and Economic Development 
(1988), the Directory of Incentives for Business Investment and Development in 
the United States published by the National Association of State Development 
Agencies (1986, 1991 and 1994), and a comprehensive review of the websites of 
all state technology development agencies (for an on-line directory, see: http:// 
www.ncscienceandtechnology.com/ External Programs.htm). In 1990, business 
incubators were the most numerous with 137 locatéd inside of MSAs, followed 
by 99 research parks, 94 technology development programs and 77 technology 
deployment/ transfer programs. Roughly two-thirds of these programs were 
located inside metropolitan areas with the remainder in non-metropolitan areas. 
There were significantly fewer public venture capital, SBIR and technology grant 
and loan programs, reflecting the fact that state governments typically establish 
only one headquarters office in or near the state capital. 

To gauge the effects of these programs, we use the sum of the total years' 
duration of each type of program for each MSA. As Wolman and Spitzley 
(1996) argue, it is important to move beyond measuring the simple presence 
of programs to capture policy scale. To gauge the scale of these programs, we 
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Table 2: High Tech Programs Inside and Outside of MSAs, 1990 


Programs In MSAs Outside MSAs Total 
Public Venture Capital 20 2 22 
SBIRs 17 9 26 
Technology Grants & Loans 16 D 21 
Technology Development 94 15 109 
Technology Deployment/Transfer 77 23 100 
Technology Incubators 137 41 178 
Research Parks 99 18 147 
Total 460 114 573 


summed the total years of existence of each program type within each MSA as 
of 1990. If an MSA had multiple, e.g., technology incubators, then we summed 
the total years of existence for all such programs as of 1990. While it would be 
ideal to have measures of the cumulative funding for these programs and other 
program details, such data are not available and would be near impossible to 
collect reliably. A poorly-funded but long-term program might have little impact 
but, in general, programs with longer duration have likely succeeded in mobilizing 
more resources. 

Table 3 identifies the top 10 MSAs in terms of their cumulative years of 
experience with each of these seven high technology programs as of 1990. Some 
MSAs; such as Boston and Albany, NY have longstanding public venture capital 
programs. Minneapolis leads in SBIR experience, Little Rock, AR and Boston in 
technology grant and loans, Raleigh-Durham, NC in technology development, 
Augusta, GA in technology deployment/transfer, and Philadelphia in technology 
business incubators and research parks. 

Table 4 lists the top 20 MSAs in terms of the sum of all program years of 
experience. Raleigh-Durham, NC is the leader, with more than 152 cumulative 
years of high tech program experience, followed by Philadelphia (125 years), 
Augusta, GA (116 years) and Pittsburgh (81 years). The mix of these programs 
varies significantly, with Raleigh-Durham having more technology development 
and deployment/transfer programs and Philadelphia having more incubator and 
research park experience. More than 54 percent of the 291 MSAs have at least 
one year of program experience and 27 MSAs had five or more high technology 
programs operating in 1990. We now turn to our method for analyzing the effects 
of these programs and location/agglomeration advantages on the growth of high 


tech industry employment. 


Methodology 
Model 


The main question facing policy makers is how to generate high tech industry 
employment net of existing employment levels. To address this, we use a 
conditional change design (Finkel 1995:6-9) where the dependent variable 
is the first difference change in the number of high tech industry jobs, which 
is regressed on the lagged endogenous variable (i.e., high tech industry 
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Table 3: Top 10 MSAs in Total Years of Experience with High Tech Programs, 1990 


Public Venture Capital Programs 

Boston-Lawrence-Salem-Lowell, MA 

Albany-SchenectadyTroy, NY 

Hartford-New Britain-Middletown, CT 

Madison, WI 

Indianapolis, IN 

Lansing-East Lansing, MI 

Philadelphia, PA 

Allentown-Bethlehem-Easton, PA-NJ 

Portland, OR 

Little Rock, AR 

SBIR Programs 

Minneapolis-St. Paul, MN 

Salt Lake City-Ogden, UT 

Boise City, ID 

Houston, TX 

Columbus, OH 

Charleston, WV 

Billings, MT 

Hartford-New Britain-Middletown, CT 

Little Rock, AR 

Baton Rouge, LA 

Technology Grant & Loan Programs 

Little Rock, AR 14 

Boston-Lawrence-Salem-Lowell, MA 14 

Indianapolis, IN 9 

Topeka, KS 9 

Salt Lake City-Ogden, UT f 

Montgomery, AL 6 

Washington, DC-MD-VA 5 
4 
4 
1 
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Anchorage, AK 
Austin, TX 
Oklahoma City, OK 
Technology Development Programs 
Raleigh-Durham, NC 48 
Great Falls, MT 29 
Detroit, MI 23 
Lincoln, NE 21 
Middlesex-Somerset, NJ 20 
Ann Arbor, MI 19 
Akron, OH 18 
Salt Lake City-Ogden, UT 16 
Orlando, FL 13 
Dayton-Springfield, OH 12 
Technology Deployment/Transfer Programs 
Augusta, GA-SC 116 
Raleigh-Durham, NC 69 
Macon-Warner Robins, GA 58 
Charlotte-Gastonia-Rock Hill, NC-SC 43 
Atlanta, GA 29 
Columbus, GA-AL 29 
Albany, GA 29 
Athens, GA 29 
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Savannah, GA 29 
Philadelphia, PA 27 
High Tech Business Incubators 

Philadelphia, PA 43 
Pittsburgh, PA 41 
Buffalo-Niagara Falls, NY 25 
Chicago, IL 22 
Minneapolis-St. Paul, MN-WI 16 
Boston-Lawrence-Salem-Lowell, MA 15 
Syracuse, NY 15 
Toledo, OH 12 
Albany-Schenectady-Troy, NY 10 
Dayton-Springfield, OH 10 
High Tech Research Parks 

Philadelphia, PA 43 
Oklahoma City, OK 40 
San Jose, CA 39 
Richland-Kennewick-Pasco, WA 36 
Raleigh-Durham, NC 31 
Washington, DC-MD-VA 30 
Lafayette-West Lafayette, IN 29 
Huntsville, AL 28 
Champaign-Urbana, IL 27 


Tampa-St. Petersburg-Ciearwater, FL 26 


employment in 1988) and additional independent variables. There are theoretical 
reasons for this design. The initial starting point of high tech industry employment 
is likely to affect subsequent change, either facilitating growth due to the location 
and agglomeration advantages associated with existing high tech industry or, 
alternatively, by capturing factors (diffusion, overbid wages) that geographically 
disperse high tech employment over time. Insofar as measurement errors are 
consistent across time, this design reduces problems with serially correlated 
measurement errors. We measure independent variables at the starting point or 
as near as possible with one exception — contemporaneous population change 
— which is introduced as an exogenous control variable. 
Our model is as follows: 


(MAS DD OO VG Ub (ZZ ye 


where Yf2 - Yt1 represents the first difference change in high tech employment, 
Yt1 is high tech employment in 1988, X+1 independent variables measured at 
or near 1988, Zt - Zt1 independent variables that change concurrently over the 
period in question (1988-1998), and e is an error term. To capture the contextual 
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effects of policies in favorable agglomeration contexts, we also add to this 
equation interactions between policy measures and agglomeration variables net 
of the respective main effects of said policies and agglomeration variables. 

We first control for location and agglomeration factors, then add the 
cumulative years for each type of high tech program. High tech programs need 
time to generate significant employment. We therefore use the total years of 
program experience for each policy type through 1990. We address two key 
questions about these policies: (1) Do high tech policies have a direct impact 
net of location and agglomeration factors? (2) Do these policies magnify local 
agglomeration processes? These programs often are presented as substituting 
for agglomeration factors by growing new industries and employment in areas 
that previously lacked them. This should be indicated by significant positive 
effects of policy scale net of controls for location and agglomeration factors. 
Programs are also seen as magnifying local innovation capacities. This should 
be evident from positive interactions between agglomeration contexts (including 
existing high tech industry employment) and policies. 


Measurement 


Our dependent measure is the first difference change score in the number 
of private sector high tech industry jobs between 1988 and 1998. The mean 
change in high tech industry jobs is 2,277.8. In our regression analysis, we ` 
control for the number of high tech industry jobs in 1988 (mean = 20,107). Both 
measures are moderately skewed, leading us to also test the natural log for all 
independent measures with a skewness of 3.0 or more. This identified the same 
set of statistically significant factors as the unlogged measures. We present the 
unlogged results because they provide interpretable partial slopes indicating the 
number of jobs created by specific independent variables as well as interpretable 
adjusted R2 statistics. 

To capture location factors that may be attracting high tech industries, we use 
the following measures with means in parentheses: (1) the 7988 mean wages 
of high tech industry jobs ($28,856); (2) 7988 median housing costs ($86,400); 
(3) climate quality based on the number of sunny above freezing days per year 
(175.9); (4) recreational amenities based on the number of major recreational 
organizations (61.1 with a scale of 1-100); (5) arts amenities based on the number 
of art museums and attendance (65.5 with a scale of 1-100); (6) airport access 
based on the number of daily air flights(91.3); (7) freeway access based on the 
number of federal interstates highways in the metropolitan area (1.25); (8) market 
centrality based on population density in 1990 (404.6 persons per square mile); (9) 
higher education options based on the number of 4-year colleges and universities 
(2.77); and (10) the percentage black population (10.5 percent). Mean high tech 
industry wages were derived from the CES data, and population density and the 
percent black from the U.S. Bureau of the Census (1991). All other measures 
come from Boyer and Savageau (1989). 

Agglomeration advantages are measured by: (1) population density in 1990; 
(2) the number of Fortune 500 manufacturing corporate headquarters (Fortune 
1989 [1.45]); (3) the number of private venture capital corporations in 1987-88 
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(Morris and Isenstein 1989 [8.368]); (4) university RED expenditures for 1990 
based on the earliest available data on the top 100 universities in terms of R8D 
expenditure in the National Science Foundation's (2002) WebCASPAR database: 
www.nsf.gov/sbs/srs ($50,503,000); and (5) federal and military research facilities 
as gauged by Hook's (2003) estimate of the mean of the 1980 and 1990 square 
meters of research space operated by the U.S. Defense Department, NASA, NIH 
and the nuclear arms labs (49.9 square meters). As noted earlier, we include 
population density in both location and agglomeration equations since it can be 
argued to tap both theories. To capture exogenous uncontrolled growth factors, 
we also introduce percent population change between 1990 and 1997 drawn 
from U.S. Bureau of the Census (2000) (8.1 percent). We use two-tailed tests 
of significance and inspected all equations for multicollinearity using variance 
inflation and tolerance statistics, finding no evidence of problems. We also 
tested for influential outliers (Bollen and Jackman 1985), and we removed Detroit 
from the regression results below as a result, making our regression sample 290 
MSAs.2 How do these factors affect high tech employment change? 


Results 


Our main regression results are presented in Table 5. The lagged endogenous 
control is not significant (Model 1), indicating little effect of the start point 
number of high tech industry jobs. The adjusted R? of .001 provides a baseline ` 
for comparing subsequent models. 

In Model 2, the lagged endogenous term, mean high tech wages, a favorable 
climate and the number of daily air flights are significant predictors in the 
predicted direction. This suggests that wages encourage the dispersion of high 
tech industry. The number of four-year colleges and universities is significantly 
negative, the opposite found by Markusen et al. (1986:155). We suspect this 
reflects the fact that the majority of these schools are liberal arts colleges 
with little direct impact on high tech development. It may also reflect the post- 
1960s growth in higher education where four-year colleges are no longer critical 
amenities, especially relative to research universities. Later we introduce the 
more meaningful university R&D measure. Population density is not significant. 
Percent black is also not significant, suggesting that the industry avoidance of 
larger minority populations earlier found by Markusen et al. (1986:155) may 
have dissipated. Location factors contribute 18.7 percent to explained variance 
(adjusted R? = .188) over the baseline model. 

Model 3 controls for agglomeration factors, showing that, in addition to a 
negative effect of the endogenous lag term, Fortune 500 headquarters, university 
R&D and military R&D positively affect change in high technology industry 
employment. These effects point to the importance of entrepreneurial resources 
and public sector R&D investments and resemble Markusen et al.'s (1986: 154-56) 
findings that large manufacturing corporate headquarters and military spending 
contribute to high tech employment growth. Population density and private 
venture firms are not significant. In Model 4 we add population change, which is 
positive and significant but does not alter any of the other factors. This suggests 
that other factors associated with population growth do not alter the effects in 
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Model 3. The agglomeration factors in Model 3 add 17.9 percent in explained 
variance over the baseline model. 

Model 5 examines the effects of our policy measures. In addition to the 
negative effect of the endogenous lag term, technology grants and loans, business 
incubators and research parks contribute to high tech industry job growth. Public 
venture capital, SBIRs, technology development and technology deployment/ 
transfer programs are not statistically significant. Programs associated with 
both the infrastructure (business incubators, research parks) and entrepreneurial 
strategies (technology grants and loans) are effective at promoting high tech 
employment growth. At the same time, these results raise questions about the 
efficacy of the other four programs. Compared to the baseline model, high tech 
policies added 10 percent to explained variance. 

Model 6 combines all the significant location and agglomeration variables with 
the seven policy measures to see whether high tech development policies affect 
high tech employment growth net of location and agglomeration factors. In 
addition to the location and agglomeration factors, technology grants and loans 
and research parks have positive independent effects on high tech industry job 
change. The partial slopes in this equation indicate that each additional grant 
and loan program year creates 2,666 high tech jobs and each additional year of 
research parks creates an additional 364 high tech jobs. Business incubators 
lose significance due to a moderate correlation with Fortune 500 headquarters 
(r = .39). When Fortune 500 headquarters are removed from this equation, the 
incubators effect is positive and significant (b = 680.8; t — 2.688), suggesting that 
their direct benefits are associated with the presence of Fortune headquarters. 

In addition to the lagged endogenous terms, a favorable climate, air flights, 
Fortune 500 headquarters, university R&D and military R&D still contribute to high 
tech industry job growth. The strongest standardized coefficients in this equation 
are the endogenous lag term (-.76), Fortune 500 headquarters (.49), and air flights 
(.35) followed by technology grant and loan programs (.24), military R&D and 
climate (both .16), and research parks (.15), indicating that agglomeration factors 
are the dominant forces behind high tech growth. These factors add 36.3 percent 
in explained variance to the base model. 

Do these programs magnify or compensate for the agglomeration factors? To 
examine this, we test interaction terms between our seven policy measures and 
our agglomeration factors. We also include interactions with daily air flights (for 
a facilitator of agglomeration, see Irwin and Kasarda [1991]), population density, 
and 1988 high technology employment along with the major agglomeration 
measures. Because population change is an exogenous control, it is not combined 
in interactions but is retained as a control in the models. If the interaction term 
is positive, policies are magnifying the agglomeration effect by creating greater 
positive change in high tech industry job growth. If the interaction is negative, 
policies are compensating for lower levels of agglomeration by generating high 
tech employment growth. In addition to significant interaction terms, the main 
effects may remain significant, indicating that a policy or agglomeration factor has 
a direct effect in addition to its contribution in a significant interaction term (see 
Jaccard, Turrisi and Wan 1990; Aiken and West 1991). If only the interaction term 
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and neither of the main effects are significant, the policy benefits are limited to 
specific agglomeration contexts. We add each interaction term separately to the 
variables in Model 6 of Table 5 with the other policy measures removed and use 
centered measures to multicolinearity (see also Aiken and West 1991 :32-35). 

Model 7 in Table 5 shows the first of these 42 equations. To save space, 
Table 6 shows only the statistically significant interaction coefficients along with 
standard errors and t-statistics plus the adjusted R? (full results are available 
from the authors). 

With the exception of SBIRs, which do not produce any statistically significant 
interactions, all of the high tech policies have contextual effects in at least two 
or more contexts. In several of these equations, the main effect for policy is not 
significant, indicating that the agglomeration context specifies the setting where 
these policies are more effective. In others, both the main effect and the interaction 
term is positive, indicating that the policy and the agglomeration context continue 
to have effects in addition to the combination of policy with the context. 

Public venture capital programs had no statistically significant effects in our 
earlier equations but these programs contribute to high tech industry employment 
growth in all of our agglomeration contexts, including more densely populated 
MSAs. In MSAs with significant military R&D, these programs help compensate 
for agglomeration deficits. Four of these contexts — air flights, Fortune 500 
headquarters, military R&D and existing high technology industry — are additively . 
significant alongside the interaction effect. 

High tech grant and loan programs interact positively with all the agglomeration 
contexts and, in five of these contexts, work additively in addition to interactively 
combining with agglomeration contexts to boost high tech industry employment. 
Fortune 500 headquarters and initial high tech industry employment also work 
additively in these equations. Overall, grant and loan programs appear to be one 
of the most consistently beneficial high tech policies. 

Technology development programs interact negatively with several 
agglomeration contexts, indicating contexts in which these programs compensate 
for agglomeration deficits. Negative interactions with air flights, population density, 
Fortune 500 headquarters and initial high tech industry employment indicate that 
technology development programs are compensating for initial agglomeration 
deficits in these contexts. The main effects for air flights, Fortune 500 headquarters, 
university RED and military R&D are positive in these equations, indicating that 
these factors still contribute to high tech industry employment growth. 

Technology deployment and transfer programs are not statistically significant 
in the earlier equations but they do increase high tech industry employment in 
conjunction with private venture capital and initial high tech industry employment. 
Because these programs do not generate new technology but rather attempt 
to ensure that “state of the art” technology is being utilized, this suggests that 
these contexts maximize the employment benefits of technology deployment 
and transfer programs by accelerating the adoption of current technology. In 
both contexts, the main effect of the agglomeration context is also statistically 
significant. 

The two strongest "technopole" factors - technology business incubators and 
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research parks — contribute to high tech job growth in all agglomeration contexts 
except for military R&D centers for incubators. In the incubator equations with air 
flights and Fortune 500 headquarters, the main effect of incubators is negative and 
significant, suggesting that incubators do not stimulate high tech employment 
growth outside of MSAs with air hubs and Fortune 500 headquarters. Both 
"technopole" policies, the main effects of air flights, Fortune 500 headquarters 
and initial high techn industry employment also retain their statistical significance 
in these interaction equations. ` 

Overall, these interactions indicate that both the infrastructural and 
entrepreneurial strategies are conditioned in their effects by the presence of 
agglomeration advantages. The critics of high tech policy are correct that high 
tech development is an organic process that cannot be planned from scratch. 
Existing agglomeration advantages are important to securing benefits from these 
policies. In a few contexts, policies compensate for agglomeration deficits. At 
the same time, the limits of these technology policies should not be overstated. 
Some of these policies — technology grants and loans, incubators, and research 
parks — have independent effects net of favorable agglomeration contexts 
and interactions among these. The best formula appears to be using both 
infrastructural and entrepreneurial policies by adapting them to complement 
existing local innovation capacities, thereby magnifying the effects of 
agglomeration advantages. Technology development programs are distinctive in 
compensating for agglomeration deficits. We find no evidence that the centralized 
or the decentralized approach is inherently superior. Multiple policies drawing on 
both approaches are effective, especially when they complement existing high 
tech resources or compensate for initial disadvantages. 


Conclusions and Implications for Policy 


Our results support the argument that federal, state and local high tech policies 
are contributing to high tech development and the creation of “good jobs.” States 
and localities in partnership with federal programs have initiated a range of new 
programs to enhance local innovation capacity and to create new "technopoles" 
in specific locales. 

Of these new high tech policies, technology grant and loan programs and 
research parks have been the most effective in promoting high tech industry 
employment growth. Technology grant and loan programs represent the core 
of a decentralized entrepreneurial approach, funding the development of new 
products and associated technologies proposed by existing enterprises and 
would-be entrepreneurs. Research parks represent the core of a centralized 
"technopole" approach that provides infrastructure for the creation of new 
high tech industry. By drawing on both approaches and recognizing the need 
for policy to complement existing agglomeration advantages, states and local 
governments have increased high tech industry employment. 

Our results support the idea that high tech development is an organic, path- 
dependent process while showing that governmental planning can accelerate this 
process. The major driving forces behind high tech development are underlying 
location and agglomeration factors, especially Fortune 500 headquarters, air hub 
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centrality and university/military R&D. At the same time, several of these policies 
work independently and interactively with existing agglomeration advantages. 
In this sense, high tech development can be planned. This planning process 
needs to take into account existing location and agglomeration strengths and 
deficits. While we have not examined the details of this planning process, it is 
clear that high tech programs need to be adapted to the existing mix of high 
tech industry and other location and agglomeration resources. Processes that 
facilitate this policy adjustment, such as industry-government advisory boards, 
the use of nonprofit corporations and other policy tools, may contribute to policy 
effectiveness. High tech industry cannot be created entirely from scratch, but it 
can be aided by effective policy. 

The conventional debate over infrastructural vs. entrepreneurial approaches 
to high tech development appears somewhat misguided. Some have argued for 
major infrastructural initiatives to create "technopoles" and new “Silicon Valleys.” 
Critics correctly point out that this strategy is unlikely to work by itself and that the 
informal networks, entrepreneurial traditions, skilled labor and tacit knowledge 
bases that are critical to high tech development cannot be planned. However, 
government policies can nurture these local contexts where agglomeration 
processes are already in place. Hence a "technopole" strategy has to be adapted 
to fit the local strengths and weaknesses of existing high tech resources. At the 
same time, this approach needs to be balanced by more decentralized policies, 
such as public venture capital and technology grant and loan programs that 
depend on the initiative of existing enterprises and would-be entrepreneurs. 

We also find support for existing arguments about the importance of location 
and agglomeration advantages. Numerous studies have discussed the importance 
of the R&D spillovers from large Fortune 500 companies, research universities 
and military research labs (Goldstein and Renault 2004, Hooks 2003, Kenney 
2000, Lee et al. 2000, Markusen et al. 1986, Saxenian 1994). Others have pointed 
to the advantages of a good climate (Markusen et al. 1986) and centrality within 
the air transport system (Irwin and Kasarda 1991). Much of this literature has 
relied on case studies to demonstrate the plausible link between existing public 
and private sector R&D and the creation of new high tech firms, patents and 
regional industrial clusters. Little of this work has used systematic comparison 
and multivariate controls. Our contribution has been to use these techniques 
to gauge the relative contribution of these factors on high tech employment 
growth. Local agglomeration networks and institutions create local capacities 
for innovation that are central to high tech development. In this sense, high tech 
industry is locality based. It requires a local innovation capacity that sustains 
the continuous creation of new enterprises and technologies, including business 
restarts and the recycling of technology. At the same time, policies adapted to 
complement existing high technology strengths may magnify the benefits of 
these advantages and, for technology development policy, compensate for initial 
agglomeration deficits. " 

Do these policies break with traditional industrial recruitment policies? Many 
analysts have promoted the new technology policies as alternatives to traditional 
industrial recruitment, arguing that they allow states and localities to "grow their 
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own" new industry (Osborne 1988; Eisinger 1988; Clarke and Gaile 1989, 1998). 
High tech programs differ from traditional industrial recruitment in requiring 
greater targeting and direct governmental participation in the development of new 
technology, products, markets and enterprises. Yet this distinction should not be 
overstated. The benefits of these policies depend on favorable agglomeration 
contexts. High tech policies help build on existing high tech industry and, for the 
most part, do not create new industries from scratch. In this sense, high tech 
policies may be seen as a more sophisticated form of industrial recruitment and 
retention, allowing areas with existing agglomeration and location advantages to 
capitalize on their assets. Future research should examine whether these policies 
reinforce or reduce existing regional disparities in high tech development. 


Notes 


1. We explored a variety of different measurement schemes, modelling 
strategies and variable measurement techniques. Details are available 
from the authors. 


2. Detroit is an influential outlier because it lost high technology employment 
despite two longstanding programs - the Michigan Energy Research and 
Resource Association and the Metropolitan Center for High Technology. Ap- 
parently the two programs failed to halt this decline. : 
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Economic Globalization, Industrialization and Deindustrialization 
in Affluent Democracies 


David Brady, Duke University 
Ryan Denniston, Duke University 


Abstract 

This study reexamines the relationship between economic globalization and manufacturing 
employment in affluent democracies. After reviewing past research, including the well- 
supported Rowthorn model, we propose a differentiation-saturation model that theorizes 
thatglobalization hasacurvilinearrelationship with manufacturingemployment. Usingtwo 
different techniques, we analyze the most comprehensive sample of 18 affluent democracies 
from 1960-2001. We examine 12 globalization measures and provide the first analysis of the 
curvilinear relationship between globalization and manufacturing employment. We find 
that some aspects of globalization have linear effects on manufacturing employment, most 
of which are positive. We find more evidence, however, that globalization has a curvilinear, 
inverted U-shaped relationship with manufacturing employment. The evidence for the 
Rowthorn model is mixed. GDP per capita and its square do not have robust effects, but 
agricultural employment is one of the most important causes. Including globalization in 
the model weakens the evidence for the Rowthorn model. There is some evidence that 
globalization has different effects across different varieties of capitalism, regions and 
historical periods. Ultimately, our analyses partially support both the Rowthorn model 
and our differentiation-saturation model. 


One of the most dramatic social changes of the past half-century has been the 
decline of manufacturing employment in affluent democracies. As Figure 1 shows, 
manufacturing employment as a share of the labor force declined substantially 
from 1960 to 2001 in the 18 affluent democracies in our study. In the early 1960s, 
manufacturing employment averaged nearly 40 percent across these countries. 
In addition, the maximum value countries (Germany and Switzerland) approached 
50 percent. Even Ireland, the least industrialized in 1960, had more than 22 percent 
employed in manufacturing. By 2001, massive deindustrialization had occurred 
across all 18 affluent democracies. The most industrialized country had fallen 
to less than 30 percent manufacturing employment, the average was below 25 
percent, and the least industrialized country was below 20 percent. 

Certainly, the widespread decline of manufacturing was not uniformly linear. 
From 1960 to 1975, the least industrialized countries experienced growth in 
manufacturing employment - the minimal value rose from 22.2 in 1960 to 28.7 
in 1973. During that period, heavily industrialized countries reached a plateau. 
The maximum value remained above 48 percent until 1971. Partly driven by 
manufacturing growth in the less industrialized among these countries, the mean 
peaked slightly above 38 percent in 1965 and remained above 36 percent through 
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Figure 1. Trends in Manufacturing Employment as a Percent of Labor Force in 18 Affluent 
Democracies, 1960-2001 
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1974. Nevertheless, manufacturing employment has experienced a secular 
decline at least since the mid-1970s. The average has declined almost every year, 
and by 2001, no country had even 30 percent manufacturing employment. 

What caused this dramatic social change? In debates on deindustrialization, 
one of the most controversial explanations has been the impact of economic 
globalization. While early researchers called attention to economic globalization 
and even suggested it was a central cause, subsequent empirical research was 
more skeptical. Following a substantial body of research, most analysts have 
converged on what can be called the Rowthorn model. This model emphasizes 
that deindustrialization is mainly caused by rising economic development and 
productivity. By contrast, economic globalization is not very influential and may 
actually augment manufacturing employment. Hence, the scholarly consensus 
has been that economic globalization has not had a very substantial influence on 
deindustrialization. 

In this study, we revisit this controversial debate, providing new 
evidence and scrutinizing the relationship between economic globalization 
and deindustrialization. While controlling for the Rowthorn model, we 
comprehensively test economic globalization's potential effects by considering 
12 indicators. We examine whether economic globalization has linear effects 
and provide the first test of whether economic globalization has curvilinear 
effects. We now review past research and then propose a theoretical model 
suggesting that economic globalization may have curvilinear effects on 
manufacturing employment. 


Past Research 


The concept globalization has been deployed for wide-ranging purposes (Guillén 
2001a; Sklair 1999, 2001). We make no claim to capture all dimensions of the 
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diverse globalization literature. It is beyond our study to examine cultural, political 
and legal globalization, the longer history of international economic integration that 
marks the rise of the modern capitalist system in the 16th century or developing 
countries. Rather, we strictly concentrate on globalization as it has been discussed 
in the literature on deindustrialization in the latter half of the 20th century in 
affluent democracies. Solely for the purposes of studying deindustrialization, we 
conceptually define economic globalization as international economic exchange 
and the flow of goods, services and capital across international boundaries. We 
operationalize economic globalization as international trade and investment. 
Henceforth, we simply refer to "globalization." 

Bluestone and Harrison (1982:6) coined the term "deindustrialization" and 
defined it as the "systematic disinvestment in a nation's core manufacturing 
industries." They were concerned with the wave of plant closings and the decline 
of manufacturing that was occurring in the United States in the late 1970s and early 
1980s (Wallace and Rothschild 1988). Though they offered a broad explanation 
for deindustrialization — including declining profits, anti-union management, the 
avoidance of taxation and welfare responsibilities, and shifting power relations 
between capital and labor (Grant and Wallace 1994, Schwartz and Zukin 1988) 
— Bluestone and Harrison argued that one cause was a “globalization gambit.” 
The globalization of production was undertaken partly to replace domestic 
manufacturing (Harrison and Bluestone 1988). Early research was supportive, 
illustrating how globalization undermined manufacturing in affluent democracies 
(Bluestone 1984). For example, Stopford and Turner (1985) showed that the 
outward FD! of British firms contributed to the loss of manufacturing jobs in the 
United Kingdom. 

Wood (1994) provides one of the more persuasive accounts of the 
globalization-deindustrialization link. Wood contends that trade with developing 
countries and the associated rise in demand for unskilled labor contributes to 
deindustrialization. Along with greater free trade, enhanced transportation and 
communication facilitate the movement of production to developing countries 
(Dicken 2003, Wallace and Brady 2001). Mixing new evidence with the argument 
that developing countries have capitalized on their comparative advantage in 
less-skilled, cheap labor, Wood concludes that trade with developing countries 
contributes to deindustrialization. In the most sophisticated test of these matters, 
Alderson (1999:718) concludes that, “Globalization [manufactured exports +, 
outward foreign direct investment -, and manufacturing imports from developing 
countries -] has played an important, independent role in the deindustrialization 
of advanced industrial countries.” (See also Alderson 1997.) 

Despite these contributions, several compelling critiques have emerged. 
First, globalization has probably been too small to be the main cause of 
deindustrialization. Though globalization has increased in all affluent democracies, 
globalization remains small relative to the size of the economy in countries like 
the United States (Bairoch 1996, Gordon 1994, Krugman 1996, Wade 1996). 
Second, the image of globalization that motivated these earlier studies was 
an unrepresentative portrait of international economic exchange. Imports from 
or investment fleeing to developing countries is a relatively small part of total 
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globalization, and most international economic exchange occurs among and 
between the developed democracies (Dicken 2003, Gilpin 2001; Guillén 2001a, 
Hirst and Thompson 1996). Third, studies emphasizing globalization's impact 
on deindustrialization often failed to acknowledge the role of rising economic 
development (except Alderson 1997, 1999). Without considering rising economic 
development, it is unclear whether the relationship between globalization and 
deindustrialization is spurious or causal. 

These critiques contributed to the formation of an alternative general 
explanation for the rise and decline of industrialization in affluent democracies 
— the Rowthorn model (Rowthorn and Wells 1987; Rowthorn and Ramaswamy 
1997, 1998). In this model, deindustrialization is principally caused by rising 
productivity as measured by an inverted U-shaped relationship between GDP 
per capita and manufacturing employment. As a country develops economically, 
agricultural employment is replaced by industrial employment (Bluestone and 
Harrison 2000:72, 227). This occurs until a country reaches a point of industrial 
maturity, after which service jobs replace manufacturing jobs and service sector 
growth outpaces that of the industrial sector. Wolf (2004:178) explains: “If 
growth of labour productivity is higher than the growth of output, employment 
must shrink. If growth of productivity is sufficiently higher in manufacturing than 
in the rest of the economy, the employment share of manufacturing will fall, 
even if the employment level does not.” Hence, deindustrialization mainly occurs 
because fewer manufacturing workers are needed to produce manufacturing - 
goods (Baumol 1989, Bluestone and Harrison 2000:227-29). Rowthorn and 
Ramaswamy (1997) explain, “Deindustrialization is not a negative phenomenon, 
but is the natural consequence of the industrial dynamism in an already developed 
economy.” This account offers a parsimonious and powerful explanation focusing 
on internal factors instead of globalization. 

Recent research supports the Rowthorn model (Dewatripont et al. 1999, 
Feinstein 1999, Rowthorn and Ramaswamy 1998). Rowthorn and Ramaswamy 
(1997) show that the declining share of manufacturing employment has mirrored 
the decline in the share of manufacturing value added in the economy. After 
controlling for development and rising productivity, globalization has, at most, 
very small effects on manufacturing (Dewatripont et al. 1999, Golub 1999, Wolf 
2004). Burtless and colleagues (1998) show that had the United States never 
experienced a trade deficit from 1964-1992, manufacturing trends would have 
been almost identical. Though they acknowledge that North-South trade may 
explain slightly less than one-fifth of deindustrialization in advanced economies, 
Rowthorn and Ramaswamy (1998) contend that internal factors like rising 
productivity are the paramount causes. While finding that globalization does 
matter, Alderson (1999) concludes that deindustrialization is mainly caused by 
the Rowthorn model and would have occurred without globalization. At this 
point, the Rowthorn model probably represents the consensus explanation in 
studies of deindustrialization. 


The Differentiation-Saturation Model 


Despite the value of the Rowthorn model, we suggest three unexplored 
limitations. First, we question the naturalistic imagery of rising worker productivity 
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mechanically leading to firms and countries evolving away from manufacturing 
jobs. While firms often respond to increasing productivity by adjusting work 
forces, we are skeptical that economic phenomena function so simply and 
mechanistically as to preclude other influences. Second, while the Rowthorn 
model concentrates on the internal dynamics of firms — e.g., technology and rising 
productivity — economic sociologists have demonstrated that external factors 
drive much firm behavior (Fligstein 2001). We advocate for a sharper focus on the 
relations among manufacturing firms, through competition and isomorphism, for 
example. Third, the Rowthorn model presumes a linear relationship between 
globalization and manufacturing across all countries and historical periods. To our 
knowledge, no study has tested a curvilinear relationship. In every.study we could 
locate, only the linear effect of globalization was considered. This is an important 
omission, and there are good reasons to expect a non-linear relationship, such as 
qualitative changes in how globalization operates as it grows. 

Theoretically, we propose that globalization should be modeled with a 


Figure 2. Conceptual Model of Curvilinear Relationship between Globalization and Manu- 
facturing Employment 
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curvilinear relationship. Initially, globalization will cause a growth of manufacturing 
employment's share of the labor force. The globalization of a domestic economy 
at low levels will foster a differentiation of manufacturing industries that actually 
cultivates an increasing specialization of manufacturing employment across 
different countries. Subsequently, greater levels of globalization will cause a 
decline of manufacturing employment. As economies move from moderately 
to highly globalized, saturation will undermine manufacturing employment. 
While rising productivity may contribute to deindustrialization, we argue that 
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globalization has played a role as well. We present a conceptual model of this 
causal relationship in Figure 2. 


Differentiation 


At lower or initial levels, globalization has the potential to cause increased 
manufacturing employment (Gordon et al. 1982). Obviously, this can occur if we 
are referring to globalization as exports and inward investment. For example, 
Alderson (1999) finds that net manufactured exports significantly increase 
manufacturing's share of the labor force. Rowthorn and Ramaswamy (1997) 
find that total exports and exports to developing countries boost manufacturing 
employment. If countries are selling their manufactured goods to other countries, 
it is clear that globalization can boost manufacturing employment (e.g., Japan, 
Gao 2001). 

Beyond this, globalization can boost manufacturing employment if 
globalization entails the total flows of goods, services and investment across 
national boundaries and enhances the capacity of domestic manufacturers to 
specialize in certain products. The idea of specialization has been longstanding in 
the economics of trade. The Heckscher-Ohlin (or H-O) model of trade postulates 
that countries specialize in different products based on their comparative costs 
and advantages relative to other countries (Gilpin 2001:206). Indeed, Rowthorn 
and Ramaswamy (1998:11) acknowledge, "The pattern of trade specialization - 
in manufacturing among the advanced countries is an important factor that 
accounts for the variation in the structure of employment from one advanced 
country to another." 

For example, the United States might be able to increase manufacturing 
employment in durable goods if manufacturing firms can acquire parts and 
components for cheaper prices from other countries. With the extra revenue 
and stable profits from such transactions, U.S. manufacturers may be able to 
retain current manufacturing employees and invest in personnel, equipment and 
facilities. In turn, manufacturing employment could remain stable or actually 
increase while worker productivity increases, albeit in a specialized area. 

The other countries supplying parts and components need not be developing 
countries. While much of the research on the H-O model presumed. a 
comparative advantage in manufacturing would result from a surplus of cheap 
labor, a comparative advantage can come from a variety of sources. Countries 
can have a comparative or competitive advantage in manufacturing because of 
state policies, public investments and institutionalized traditions (Porter 1990). 
Many countries maintain a specialization in certain manufactured products even 
after their labor has become more expensive and productive. In part, this reflects 
countries’ distinct, historical trajectories of economic development (Berger 
and Dore 1996). Guillén (2001a:246) explains, "German, French, Japanese, and 
American firms are competitive in the global economy, but rarely in the same 
industry and market segment." Guillén elaborates that Germany specializes 
in high-quality engineering intensive industries such as machinery, luxury 
automobiles and chemicals (Streeck 1991); France specializes in large-scale 
technical industries including high-speed trains, satellites, rockets and nuclear 
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power (Storper and Salais 1997); and Japan specializes in assembled goods, i.e., 
household appliances, electronics and automobiles (Gerlach 1992). Economic 
sociologists have demonstrated that a country's comparative advantage reflects 
institutional histories (Orru et al. 1997). Biggart and Guillén (1999) and Guillén 
(2001b) show that countries specialize in certain manufacturing industries as a 
result of matching their institutionalized patterns of authority and organization to 
global markets. Because of these sociological comparative advantages, countries 
at many levels of development can sustain a manufacturing presence through 
globalization (Guillén 2001b). 

As stated above, most of the goods and services that flow into the United 
States do not come from developing countries (Gilpin 2001:207). Rather, 
countries supplying parts to U.S. manufacturers simply need to be countries that 
can specialize in cheaply producing those parts because of a comparative or 
competitive advantage. Those advantages need not emerge solely from cheap 
labor, but can emerge from a host of alternative sources. As Gilpin (2001:208) 
explains, "Most trade has taken place, contrary to H-O theory, between countries 
with similar factor endowments; most exports of industrialized economies go to 
other industrialized countries.” 

Another reason specialization characterizes the relationship between 
globalization and manufacturing is because much of the international exchange 
we observe involves transfers within multinational corporations. As Gilpin 
(2001:210) explains, “A substantial proportion of world trade now takes place 
as intra-firm transfers at prices set by the firms and as part of global corporate 
strategies. . .In the late 1990s, over 50 percent of American and Japanese trade 
was intrafirm trade.” Gilpin (2001:297) even reports, “One-half of all imports and 
exports in the world economy are estimated to be transactions between parent 
corporations and subsidiaries.” Following the prior example, a multinational 
corporation might import components from Mexico into the United States, 
where U.S. manufacturing workers might perform the final assembly. If this firm 
is behaving efficiently, the cheaper components from the Mexican division might 
lead to manufacturing job retention in the United States.’ Regardless of the 
specific transactions, this process involves a division of labor between countries, 
as each uses comparative and competitive advantages to specialize in different 
manufacturing products. At the same time, globalization can enhance countries’ 
capacities to sell their products in other countries by using those advantages 
(Doner 1991).? If this differentiation characterizes the relationship between earlier 
or lower levels of globalization and manufacturing employment, we can expect a 
positive linear effect of globalization on manufacturing employment. 


Saturation 


As affluent democracies move to higher levels of globalization, we expect 
manufacturing employment will decline. At these subsequent, higher levels of 
globalization, saturation characterizes the relationship between globalization and 
manufacturing. Three complementary and interrelated processes are involved 
in saturation: competition, mimetic isomorphism and spatialization. These 
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processes only take hold at higher levels of globalization; competition, mimetic 
isomorphism and spatialization are less likely to occur at modest levels. 

First, when globalization exceeds a certain threshold, domestic manufacturers 
begin to face heightened international competition. In recent decades, more 
countries generated the institutional infrastructure to facilitate the development of 
manufacturing firms that can compete internationally (Dicken 2003:44, McMichael 
2000:164). As a result, domestic manufacturers faced a greater density of 
competitors and found it more difficult to maintain the same employment levels 
(Carroll and Hannan 2000:Ch.19).3 With higher levels of globalization, developing 
countries have further expanded their manufacturing workforces. These younger 
firms in industrializing countries can produce manufacturing goods more 
efficiently partly because of lower labor costs. Just as important, however, are 
modern facilities and technology. Rather than upgrading or refurbishing older 
facilities, industrializing countries can expand manufacturing with more recent 
investments in modern technology (e.g., South Korean steel). For example, in the 
boom of Japan's peak globalization, cheaper imports and parts from Southeast 
Asia — fueled by Japanese financial investment and speculation — undercut small- 
and medium-sized Japanese manufacturing firms (Gao 2001). Moreover, this 
competition from greater globalization facilitates exchange between developed 
countries, which can further undermine the manufacturing sector. 

Our view of this competition is consistent with earlier research that suggested 
that globalization reduces manufacturing's share of the labor force (Wood 1994). 
However, we expect that this competition only undermines manufacturing 
employment when globalization reaches higher levels (Rodrik 1997). Lower 
levels of globalization have little potential to reduce manufacturing employment 
substantially. Competition will be reflected in traditional measures of globalization's 
. "threat" to developed countries: imports and outward investment (direct and 
portfolio), for example. As imports and outward investment reach higher levels, 
domestic manufacturing firms will have more difficulty competing and will likely 
experience deindustrialization. 

Second, when globalization reaches higher levels, a process of mimetic 
isomorphism may contribute to a decline of manufacturing employment (DiMaggio 
and Powell 1983). Mimetic isomorphism occurs when firms in an industry copy 
each other's practices (Han 1994, Haveman 1993). Globalization creates an 
environment of greater uncertainty for manufacturing firms: the breakdown of 
traditional Fordist production regimes, volatile costs and revenue with currency 
fluctuation and speculative investment, higher rates of business failure, and an 
increased velocity of transactions. In this uncertain context, manufacturing firms 
may mimic the practices of other firms in their fields (Guillén 2001b, Guler et 
al. 2002, Martin et al. 1998).* For example, the adoption in the United States 
of Japanese justin-time production practices highlights the wider observable 
field a U.S. firm has in a highly globalized context (Womack et al. 1991). When 
a few leading manufacturing firms reduce employees or outsource production 
to other (not necessarily developing) countries, others may copy this strategy. 
When some manufacturers shift to more technologically intensive production 
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that utilizes fewer employees, other firms can pay close attention and at least 
consider doing the same. 

This mimetic isomorphism can occur with or without heightened competition 
for all manufacturing firms. As Haunschild and Miner (1997:474) explain, "It 
is precisely the larger number of other organizations enacting a practice that 
enhances legitimacy or endows a practice with a taken-for-granted status." In a 
highly globalized environment, if some manufacturers face greater competition 
or greater uncertainty and reduce their workforces, other firms, confronting that 
same uncertainty, may follow suit, and a broader process of deindustrialization 
can result (Haunschild and Miner 1997:491). Measures of trade (exports A 
imports) and investment openness (inward + outward) or what we term “total 
globalization” below should gauge this notion of a highly globalized environment. 
When these measures of economic openness reach a higher level, manufacturing 
firms will be operating in an environment where most firms are engaging in 
higher levels of international economic exchange. These manufacturing firms are 
likely to be exposed not just to competition, but also to a cross-fertilization of 
ideas, technology and management strategies. We suggest this highly globalized 
environment will result in the mimetic isomorphism of manufacturing decline.’ 

Third, higher levels of globalization may trigger what Wallace and Brady (2001) 
call spatialization. Spatialization is their term for the most recent institutional 
environment of capitalist development and management-labor relations — what 
Gordon etal. (1982) calla social structure of accumulation. Spatialization centers on 
employers’ quest for the optimal spatial or global arrangement of their operations 
and the use of relocation or threats of relocation to control workers and limit their 
demands (Grant and Wallace 1994). In a highly globalized economy, the use and 
threats of relocation enhance managerial power relative to that of labor (Brady 
and Wallace 2000, Harrison 1994). In such a globalized context, these threats of 
relocation and relocation by other manufacturing firms have greater credibility, 
and in turn, influence (Wallace and Brady 2001). As a result, management can 
force labor (and especially organized labor) to accept downsizings and flexible 
work arrangements, limit workforce expansions, and compel the fewer remaining 
workers to increase hours and productivity (Sallaz 2004). 

The core manifestation of spatialization is outward investment (direct and 
portfolio) (Grant and Wallace 1994). Brady and Wallace (2000) also find that 
inward investment undermines labor, and this might leverage managers’ power 
in reducing manufacturing jobs. As firms close facilities in a home country and 
invest in new facilities in a host country, manufacturing obviously declines in 
the home country. The sheer economic openness also involves spatialization as 
it reflects the optimal efficient strategy of managers and owners to maximize 
production while cutting back employment. Hence, as trade and investment 
openness increase, deindustrialization is likely to occur because of the weaker 
position of labor in protecting manufacturing jobs. Ultimately, a highly globalized 
economy allows managers and capitalists to force through contractions in 
manufacturing employment. Along with competition and mimetic isomorphism, 
spatialization should lead to deindustrialization. 
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Methods 


` 


Our analysis includes 18 affluent democracies from 1960-2001 (Australia, Austria, 
Belgium, Canada, Denmark, Finland, France, Germany, Ireland, Italy, Japan, 
Netherlands, Norway, New Zealand, Sweden, Switzerland, United Kingdom 
and United States). The unit of analysis is the country-year. Because of missing 
data, the models with several of the globalization variables include less than the 
maximum number of cases. Thus, we note the number of cases for every model 
below. As a more temporally balanced sensitivity test, we re-estimated all models 
with the years 1975-2001 only, and the results and conclusions were consistent. 
Our study has the largest, longest and most up-to-date sample for any study of 
this kind.* 

OLS regression is inappropriate for this data because of unmeasured time- 
invariant heteroscedasticity across countries and serial correlation between years. 
Directly following similar studies (Alderson 1999; Rowthorn and Ramaswamy 
1997, 1998), we estimate fixed effects models with a first-order autocorrelation 
correction (FE-AR1). In addition, we present a second technique. Since most 
of the time series of the variables are non-stationary, we detrend all variables 
by taking the first differences (Xt-Xt1). After first differencing, we use Beck and 
Katz's (1995, also Beck 2001) technique of OLS with panel corrected standard 
errors (OLS-PCSE). After first differencing, these models do not contain serial 
autocorrelation, so we do not need to correct for it in the OLS-PCSE models. 
OLS-PCSE is increasingly popular in political science and sociology, even though, 
to our knowledge, it has not been used to estimate models of manufacturing 
employment.’ All analyses were conducted in Stata. 

The descriptive statistics, number of cases and sources for all variables 
are presented in the Appendix. Because our time-period and number of cases 
sometimes varies across models, we present the descriptive statistics for all 
possible cases 1960-2001 and for the 1975-2001 sub-sample. The proximate 
source for many of our variables was Huber et al. (2004). 

The dependent variable is manufacturing employment as a percent of the tota/ 
labor force. This is the same dependent variable from the same source as other 
recent studies (Alderson 1999). This consistently collected and standardized 
variable provides cross-nationally and historically comparable data on the share 
of the labor force employed in industry. 

Our independent variables begin with a basic set of controls reflecting the 
Rowthorn model. All independent variables are measured in the current year. 
Our first three variables replicate Alderson's (1999) model. First, we include gross 
domestic product (GDP) per capita and GDP per capita squared. Measured in 
real purchasing power parity dollars, this variable assess the level of economic 
development and productivity. According to past research, the main GDP per 
capita term should have a positive effect, while its square should have a negative 
effect. Second, we include unemployment measured as the percent of the labor 
force unemployed. Unemployment should have a negative effect. 

We also innovate beyond past studies by supplementing the Rowthorn 
model with two variables. As discussed above, these variables were discussed 
within the Rowthorn model, but past analyses have not included them. Our 
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third control variable is agricultural employment as a percent of the total labor 
force. Manufacturing grew as a percent of the labor force as a function of the 
decline of agriculture’s share.® In turn, we expect agricultural employment to be 
negatively related to manufacturing employment. Finally, we control for inflation 
measured as the annual rate of change in the cost of living index.? Along with 
unemployment, inflation controls for the business cycle. 

Our study aims to improve upon existing studies of globalization and 
manufacturing employment. Hence, three concerns guide our operationalization 
of globalization (Brady et al. 2005). First, we only examine actual economic 
exchange and are not interested in just the political or legal possibility of economic 
exchange.'? Like past research (Alderson 1999), we concentrate on the actual 
flows of finance, goods and services across international boundaries. 

Second, we focus on the largest dimensions of international economic 
exchange. Thus, we do not limit ourfocus to exchange with developing countries. 
Sometimes, analysts do so in order to test precise hypotheses concerning 
investment fleeing to or imports from developing countries (Alderson 1999; 
Rowthorn and Ramaswamy 1997, 1998). Unfortunately, doing so omits the vast 
majority of globalization experienced by affluent democracies. Most international 
economic exchange occurs among affluent democracies, so it is essential to 
incorporate investment in and trade with all countries (Gilpin 2001, Hirst and 
Thompson 1996). 

Third, we attempt to provide a comprehensive analysis of the multiple facets of 
globalization. Often, globalization has been measured with only a few convenient 
indicators like trade openness. We embrace the reality that the measurement 
of globalization is contested, and the literature has yet to converge on a single 
measure (Guillén 2001a, Sklair 1999). Thus, our analysis features a wide variety 
of globalization indicators. This is essential because we are interested in both 
globalization that might directly influence manufacturing as competition (e.g., 
imports and outward FDI) and a highly globalized economy that might indirectly 
influence manufacturing through mimetic isomorphism and spatialization. We 
do not think it is possible (or advisable) to select only a few indicators as perfect 
tests of globalization's effects (Guillén 2001). 

Concretely, we examine 12 indicators of international investment and trade as 
measures of globalization. Each is a measure of the amount of globalization in a 
country (i) in a year (t). For example, we have data on the level of exports in the 
United States in each year from 1960-2001. Each is expressed as a percent of 
GDP and we examine both the actual values and their squared terms. (1) Outward 
portfolio investment (PI) is bond or equity investment that amount to less than a 
10 percent ownership of foreign firms by domestic investors. (2) /nward portfolio 
investment (PI) is bond or equity investment representing less than 10 percent 
ownership of domestic firms by foreign investors. (3) Outward foreign direct 
investment (FDI) represents capital flows where a domestic firm acquires at least 
a 10 percent ownership share and management of a foreign firm or facility. (4)/ 
Inward foreign direct investment (FDI) represents foreign acquisition of at least 
a 10 percent ownership share and management of a domestic firm or facility." 
(BY Investment openness is the sum of inward and outward PI and FDI. (6) Net 
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investment is the sum of inward PI and FDI minus the sum of outward PI and 
FDI. (7) Exports are the value of all goods and services flowing out of a domestic 
economy and into a foreign economy. (8) /mports are the value of all goods and 
services flowing into a domestic economy from a foreign economy. (9) 7rade 
openness is the sum of exports and imports. (10) Net trade is the difference 
between exports and imports — positive values indicate a trade surplus and 
negative values indicate a trade deficit. (11) 7ota/ globalization sums trade 
openness and investment openness. (12) sums net investment and net trade, 
and represents the extent to which countries export more than they import and 
receive more investment than they send out. 

We use two-tailed tests of significance for all variables. The 12 globalization 
indicators collectively represent our differentiation-saturation model. Reflecting 
differentiation, we anticipate that all 12 will have positive effects in the linear term. 
The different indicators should also capture the various dimensions of saturation 
— competition, isomorphism and spatialization — through negative effects in the 
squared term. Outward (portfolio and direct) investment, imports, negative net 
investment, negative net trade and negative net globalization reflect the concepts 
of competition and spatialization. Though positive manifestations of globalization 
and hence reflecting differentiation, inward investment, exports, positive net 
investment, positive net trade and positive net globalization also reflect a highly 
globalized environment where mimetic isomorphism of manufacturing decline 
may occur. Finally, investment openness, trade openness and total globalization 
are measures of the extent of globalization of a nation's economy and hence 
are direct measures of the mimetic isomorphism element of our differentiation- 
saturation model. 


Results 


We first present the basic model with the two techniques in the two periods 
(1960-2001 and 1975-2001). Second, we examine the linear effects of the 12 
globalization indicators net of the basic model - including one globalization 
measure at a time. Third, we examine the curvilinear relationship between 
globalization and manufacturing employment net of the basic model. Finally, 
we analyze subsamples to explore the conditional effects of globalization in 
different contexts. For all variables, we present the unstandardized coefficient, 
the standardized coefficient in bold and italics, and the tscore in parentheses. ' 


Basic Models 


In Table 1, we present models with all five control variables for two techniques 
(FE-AR1 and first difference OLS-PCSE) and two periods (1960-2001 and 1975- 
2001). Consistent with past research, our analysis provides evidence that there 
is an inverted U-Shaped relationship between per capita GDP and manufacturing 
employment. With FE-AR1, GDP per capita has a significant positive effect and the 
squared term has a significant negative effect for both 1960-2001 and 1975-2001. 
This finding, however, is less robust in the OLS-PCSE models. In the 1960-2001 
period, the squared term is only near statistical significance. In the 1975-2001 
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period, both the linear and squared GDP measures are near, but not statistically, 
significant. After detrending the data and using OLS-PCSE, our study provides 
only limited evidence of a curvilinear relationship between GDP per capita and 
manufacturing employment. 


Table 1: Basic Models of Manufacturing Employment as a Percentage of the Labor Force in 
18 Affluent Democracies 





1960-2001 1975-2001 
First Difference First Difference 
FE-AR1 OLS-PCSE FE-AR1 OLS-PCSE 
GDP Per Capita .0003** .0002* .0004*** .0002 
.216 .203 .326 .245 
(2.31) (1.82) (2.69) (1.57) 
GDP Per Capita’ -4.49 E-9** -3,42 E-9 -9.60 E-9*** -5.02 E-9 
-.151 -.164 -.316 -.249 
(-2.07) (-1.50) (-3.03) (-1.59) 
Unemployment EIS -.570*** -.502*** -.526"* 
-.294 -.625 -.332 -.666 
(-20.45) (-19.66) (-17.91) (-18.40) 
Agriculture as a -,519*** $5539 -.260** -.341** 
% Labor Force -.462 -.266 -.163 -.114 
(-7.60) (-9.50) (-2.45) (-3.35) 
Inflation .045*** .039*** 10324 We ee 
.026 .109 .026 .088 
(4.58) (3.55) (2.61) (2.47) 
Constant -1.469*** -474*** 16.280*** -.428*** 
(-38.72) (-11.92) (231.08) (-12.42) 
Between R° .089 .091 
Within R° .523 .541 
Overall R° .013 .539 .116 557 
N 738 738 468 468 
ow D] 2n S05 fo 10 


Notes: For each independent variable, the unstandardized coefficient, standardized coefficient 
in bold, and t-score in parentheses are displayed. 


The other three variables are significant in every model. Unemployment 
significantly decreases manufacturing employment, and inflation has significant 
positive effects. Also, we find that agricultural employment has a significant 
negative effect. As a share of the total labor force, the decline of agricultural 
employment coincided with an expansion of manufacturing employment. 
Some countries experienced slower or delayed industrialization because of the 
persistence of agriculture, and the rise of manufacturing was partly influenced 
by the decline of agricultural employment. In turn, it is valuable to control for 
agricultural employment. | | 

One can interpret the magnitude of these effects with the standardized 
coefficients. Unlike past research, our analysis does not clearly provide evidence 
that economic development and rising productivity (GDP and GDP2) are the most 
powerful influences on manufacturing employment. In terms of standardized 
coefficients, the main differences across models appear across the two historical 
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eras. In the 1960-2001 period, manufacturing employment is principally accounted 
for by the decline in agricultural employment and unemployment. Economic 
development has a sizable influence, but it is secondary to those variables. 
Inflation has a smaller effect. In the 1975-2001 period, economic development and 
unemployment appear to be the most influential causes, followed by agricultural 
employment and inflation. 

In contrast to past research (Alderson 1999; Rowthorn and Ramaswamy 1997, 
1998), the fit of our models is considerably weaker. This is partly because different 
statistical packages generate different estimates of conventional statistics like 
R2 (e.g., Alderson used LIMDEP; we use Stata). This also could be due to the 
fact that others logarithmically transform the independent variables in order to 
minimize the effect of outlying values. Finally, this could be due to the longer 
time-period we analyze and the resultant increased heterogeneity. Regardless, 
the FE-AR1 1960-2001 model has a surprisingly weak fit. The FE-AR1 1975-2001 
model fits the data much better, but is still relatively weak. The OLS-PCSE models 
fit the data dramatically better than the FE-AR1. models - regardless if one is 
comparing the 1960-2001 or 1975-2001 models. Plausibly, the better fit of the 
OLS-PCSE models might suggest that these models are at least as valuable as 
the conventional FE-AR1 models for understanding variation in manufacturing 
employment. 


Linear Globalization Effects Models 


In Table 2, we display the FE-AR1 results from 12 different models. In each of these 
models, we include one of the 12 globalization measures and the five control 
variables. In Table 2, five of the coefficients reach statistical significance, while 
- seven — including the total globalization summary measure - are insignificant. 
The significant coefficients are not in a direction that suggests that globalization 
is causing deindustrialization. Outward Pl, imports and trade openness all 
significantly increase manufacturing employment. As domestic firms increase 
their portfolio investment abroad, as more imports flow into an economy, and 
as an economy experiences more total trade, manufacturing increases. Unlike 
earlier claims that globalization had linear negative effects on manufacturing, this 
suggests that globalization increases manufacturing. Even the negative effects 
for net trade and net globalization suggest that globalization is not initially causing 
deindustrialization. The more countries export relative to importing and receive 
rather than send out globalization, the less manufacturing employment exists. 
These results are consistent with our argument that globalization augments 
manufacturing through differentiation. 

All five significant globalization variables have noticeably small standardized 
coefficients. With a standard deviation increase in outward Pl, manufacturing 
employment is expected to increase by about .01 standard deviations, holding 
all other variables constant at their means. With a standard deviation increase in 
imports, manufacturing is expected to increase by .06 standard deviations. With 
a standard deviation increase in trade openness, manufacturing employment 
is expected to increase by .04 standard deviations. With a standard deviation 
increase in net trade and net globalization, manufacturing employment is 
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expected to decline by .02 and .01 standard deviations. The only control variable 
that has a smaller effect than these globalization coefficients is inflation, and 
this is only for two of the significant globalization coefficients. By contrast, 
GDP per capita and its square, unemployment and agricultural employment as 
a percent of the labor force (occasionally insignificant) have far larger effects 
than the significant globalization effects. Thus, even where globalization is 
significant, its effect is relatively small in comparison with other predictors of 
manufacturing employment. 

At the same time, it is noteworthy that GDP per capita and/or its square are 
insignificant in some models. In the models featuring inward PI, investment 
openness, net investment, total globalization and net globalization, GDP per 
capita is near but does not reach statistical significance. In the models featuring 
investment openness, net investment, total globalization and net globalization, 
neither GDP per capita nor its square is statistically significant. Since the 
curvilinear relationship between GDP per capita and its square is probably the 
dominant explanation for deindustrialization, these non-significant results are 
quite surprising. Controlling for some globalization variables, GDP per capita and 
its square lose significance and the evidence for the Rowthorn model is weaker. 

In Table 3, we analyze the linear globalization effects using OLS-PCSE. These 
results are remarkably consistent with the FE-AR1 results in Table 2. Again, 
the same five globalization variables have significant effects, while seven are - 
insignificant. Though their t-scores are slightly different from the FE-AR1 models, 
outward Pl, imports, and trade openness significantly increase manufacturing 
employment, while net trade and net globalization significantly reduce it. Again, the 
globalization coefficients do not show that globalization causes deindustrialization. 
Rather, the evidence suggests globalization entails differentiation and increased 
manufacturing. In the OLS-PCSE models, the significant globalization coefficients 
have slightly larger effects than under FE-AR1 estimation. Still, like the FE-AR1 
models, the significant globalization variables are comparable with the effects 
of inflation and are considerably smaller than unemployment and agricultural 
employment. Similarly, the globalization coefficients have smaller effects than 
the significant GDP per capita effects. 

The GDP per capita variables have even more surprising results in the OLS- 
PCSE models than in the FE-AR1 models. GDP per capita has a significant effect 
in only two of the models. In those models (featuring exports and net trade), the 
GDP per capita is only significant at the .10 level. GDP per capita squared does 
not have a significant effect in any of the models. The GDP per capita variables 
are only significant in two of the four models featuring trade variables with the full 
738 cases. As demonstrated in Table 1, the GDP per capita variables are simply 
not as significant under OLS-PCSE estimation. This provides further evidence that 
the Rowthorn model is less robust with this alternative estimation technique. 


Curvilinear Globalization Effects Models 


In Table 4, we display the FE-AR1 results from the 12 different models. In each 
model, we include one of the globalization measures, its square and the five 
controls. For seven of the globalization measures, either the globalization measure 
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or its square is significant. In three models — featuring investment openness, net 
trade and net globalization — the main globalization effect is significantly positive 
while the squared term is insignificant. In the models for investment openness 
and net trade, the squared term is nearly significantly negative. Nevertheless, for 
these three measures, globalization appears to have a linear effect. These three 
findings are generally consistent with Table 2. With a standard deviation increase 
in investment openness, manufacturing employment is expected to increase 
by .03 standard deviations. With a standard deviation increase in net trade, 
manufacturing is expected to decline by .02 standard deviations. With a standard 
deviation increase in net globalization, manufacturing is expected to decrease 
by .01 standard deviations. All three of these linear effects are small — only 
comparable with inflation, and smaller than the other independent variables. 

More importantly, in four models, globalization has a curvilinear effect 
on manufacturing employment. Exports, imports, trade openness and total 
globalization have a significant positive effect in the main term and a significant 
negative effect in the squared term. The main effect of exports has a standardized 
coefficient (beta) of .11 and its square has a beta of -.09. The main effect of 
imports has a beta of .20 and its square has a beta of -.13. The main term for 
trade openness has a beta of .20 and its square has a beta of -.13. The main 
effect of total globalization has a beta of .08 and its square has a beta of -.04. 
These curvilinear effects are larger than the significant linear effects above. In 
these models, the significant curvilinear effects are larger than inflation, but 
smaller than unemployment and agricultural employment. Notably, the effects 
for imports and trade openness are similar in size to the GDP per capita variables. 
Thus, imports and trade openness have similar effects to economic development 
and productivity. Exports and total globalization have smaller effects than the 
GDP per capita variables. 

The analyses of curvilinear effects of globalization in Table 4 provide further 
scrutiny of the Rowthorn model. Consistent with Tables 2 and 3, unemployment 
and agricultural employment principally influence manufacturing employment. 
Those two have the largest standardized coefficients in most models, and are 
the most robustly significant. Interestingly, the GDP per capita variables have 
nearly identical significance levels as in Table 2. The only departure is that GDP 
per capita squared is not quite significant in the investment openness model. 
As in Table 2, either GDP per capita or its square fails to reach significance 
in five of the twelve models. Thus, these analyses provide further evidence 
that GDP per capita and its square do not have completely robust effects on 
manufacturing employment. 

In Table 5, we display the OLS-PCSE results featuring each globalization 
variable and its square. The results with OLS-PCSE are very consistent with 
the FE-AR1 results. Again, seven of the globalization indicators have significant 
effects. The only difference is that both investment openness and its square are 
statistically significant, while the square of investment openness was previously 
only near significant. In addition, a curvilinear relationship between globalization 
and manufacturing employment is observed for exports, imports, trade 
openness, and total globalization. Consistent with every other table, net trade 
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and net globalization have significant negative effects in the main term. For those 
two, globalization has a relatively small effect on manufacturing (betas of -.03 
and -.05). Though globalization has a curvilinear effect in five models, its various 
effects are only larger than inflation and occasionally agricultural employment 
and the GDP variables. The main term for investment openness has a beta of .10 
and its square has a beta of -.09. Exports has standardized coefficients of .12 and 
-.10; imports has standardized coefficients of .28 and -.20; trade openness has 
standardized coefficients of .23 and -.17; and, total globalization has standardized 
coefficients of .20 and -.17. 

As with the earlier tables, unemployment consistently has the most powerful 
effect. The beta for agricultural employment fluctuates, but is generally the second 
largest significant effect. GDP per capita is only significant in two models. Though 
occasionally near significant, GDP per capita squared is never significant. After 
controlling for the globalization variables and their squares in OLS-PCSE models, 
GDP per capita has little relevance to explaining manufacturing employment. 
Hence, these results challenge the Rowthorn account. Ultimately, manufacturing 
is shaped principally by unemployment and agricultural employment and 
secondarily by globalization and inflation. 


Conditional Effects Across Subsamples 


In Table 6, we decompose our sample into liberal/uncoordinated (LME) and ` 
coordinated market economies (CME), Europe and non-Europe countries, and 
1960-1975 and 1976-2001 periods.'? These analyses explore whether globalization 
has similar effects in each of these different contexts. While the effects of all 12 
globalization indicators and the FE-AR1 models are consistent, we confine our 
presentation to total globalization in OLS-PCSE. All models control for the same 
variables as above (not shown). 

The first rows of the first two columns show that total globalization had a 
positive linear effect in LMEs but did not have a significant linear effect in CMEs 
and the model fit is much better for LMEs. The second set of rows reveals that 
total globalization has a positive and negative curvilinear effect in LMEs. In CMEs, 
total globalization has a nearly significant positive linear term and a significantly 
negative squared term. In both contexts, globalization reduces manufacturing 
when globalization reaches higher levels. Despite many fewer LME cases, 
globalization appears to have more powerful effects in LMEs than CMEs. 

The next two columns display the decomposition of European and non- 
European countries. Total globalization has a linear positive effect in European 
countries, and does not have a significant linear effect in non-European countries. 
Total globalization has positive and negative significant curvilinear effects in 
European countries, but does not have significant curvilinear effects in non- 
European countries. Interestingly, however, the model fit is much better in both 
sets for non-European countries. 

On one hand, these results show a pattern where our differentiation- 
saturation model better explains the deindustrialization of LMEs and 
European countries. Plausibly, the Rowthorn model better explains the 
deindustrialization of CMEs and non-European countries. On the other hand, 
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it is perplexing that globalization has significant effects in LMEs and Europe, 
but not in CMEs and non-Europe. With the exception of Ireland and the United 
Kingdom, the LMEs are all outside Europe. Ultimately, the evidence is mixed 
and we cannot simply conclude if our differentiation-saturation model works 
more effectively in a discrete set of countries. 

The final two columns display results for the 1960-1975 and 1976-2001 periods. 
Total globalization has a significant positive effect in the earlier period, but notin the 
later period. In the second set of rows, total globalization has a significant positive 
effect in the linear term in the earlier period but is not significant in the squared 
term. Even at higher levels, globalization does not cause any deindustrialization 
in the earlier period. Consistent with our differentiation-saturation model, when 
globalization first expanded from 1960 through 1975, international investment 
and trade (at lower levels) contributed to industrialization, not deindustrialization. 
After 1975, total globalization has significant (positive and negative) curvilinear 
effects. Our evidence suggests that globalization triggered deindustrialization in 
the latter period when globalization reached higher levels. 


Discussion 


This study aims to provide an improved analysis of the relationship between 
globalization, industrialization and deindustrialization. The study is based on . 
the largest, longest and most recent sample of its kind and incorporates the 
most comprehensive set of globalization measures. Moreover, this study 
uniquely incorporates agricultural employment and inflation. To our knowledge, 
our study is the first to model a curvilinear relationship between globalization 
and manufacturing employment. As well, we are the first to propose a theory 
— the differentiation-saturation model - for why globalization should have such 
curvilinear effects. Ultimately, we provide a more rigorous examination of the 
causes of variation in manufacturing employment. 

Our baseline models confirm but innovate beyond recent studies. We 
provide some evidence that GDP per capita has a curvilinear relationship with 
manufacturing employment. As others contend, this reflects the long-term trends 
in economic development and worker productivity. Nevertheless, some of the 
main models fail to support the Rowthorn model. GDP per capita and its square 
are less influential in the OLS-PCSE than in the FE-AR1 models. The evidence 
for the Rowthorn model is weaker in the longer time period (1960-2001) than 
in the most recent period (1975-2001). Though previous studies did not include 
measures of agricultural employment, we find it has a powerful impact. Much of 
the variation observed in manufacturing can be explained by the extent and trends 
in declining agricultural employment. Unemployment also has powerful effects, 
while inflation has a lesser influence. While previous research has converged on 
the Rowthorn model, our empirical analysis provides only partial support. 

Our analysis shows that some aspects of globalization significantly affect 
manufacturing employment, though several measures of globalization fail to reach 
significance. Five globalization measures have significant linear effects in both 
FE-AR1 and OLS-PCSE models. Interestingly, none of these coefficients suggest 
that globalization causes deindustrialization. Rather, these effects consistently 
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show that globalization increased manufacturing employment. We theorized that 
globalization cultivates differentiation and specialization, which contributes to 
manufacturing's share of the labor force. Outward PI, imports and trade openness 
have positive effects, while net trade and net globalization have negative effects. 
As countries send out PI, import goods and services, and engage in international 
trade, manufacturing increases. As countries export more than they import, and 
receive more than they send out international flows, manufacturing declines. These 
results support our differentiation-saturation model, reflecting our expectation that 
globalization initially breeds specialization and increased manufacturing. Though 
the effects are small, including the globalization measures generally weakens the 
influence of GDP per capita and its square. The effects of agricultural employment 
and unemployment remain significant. 

Our analysis of the curvilinear relationship between globalization and 
manufacturing yields several interesting and novel findings. While a few 
globalization measures continue to have significant linear effects, both the main 
term and the square are significant for several globalization measures. Exports, 
imports, trade openness, investment openness (in OLS-PCSE only) and total 
globalization have significant positive and negative effects. These effects are 
larger than the aforementioned linear effects and comparable to the effects of 
GDP per capita and its square. These results provide evidence that low levels of 
globalization result in differentiation and specialization, increasing manufacturing. . 
At higher levels, globalization subsequently results in saturation, reducing 
manufacturing's share of the labor force. 

Our analyses of subsamples provide suggestive evidence that globalization's 
effects slightly differ across national and historical contexts. Globalization 
has significant effects in LMEs and European countries, but not CMEs and 
non-European countries. Globalization had positive effects on manufacturing 
employment in the earlier period from 1960 to 1975. In the initial stages of 
globalization, manufacturing expanded as international trade and investment grew. 
In the later period from 1976 to 2001, however, globalization and manufacturing 
employment had the curvilinear relationship expected by our differentiation- 
saturation model. Of course, these analyses of subsamples should be read with 
caution given the varying samples across models. 

Our study partially supports the Rowthorn model and partially supports our 
differentiation-saturation model. As countries shifted away from agricultural 
employment, and worker productivity and development rose, manufacturing's 
share of the labor force increased. As worker productivity and development 
continued to rise even further, manufacturing declined. At the same time, 
globalization does have significant effects and the evidence forthe Rowthorn model 
is weaker upon considering globalization. Initially, at lower levels, globalization 
leads to differentiation and specialization and increases manufacturing's share of 
the labor force. Subsequently, at higher levels, globalization results in saturation, 
which undermines manufacturing employment. At higher levels, globalization 
generates competition for domestic manufacturers, who find it more difficult 
to maintain their workforces. Globalization breeds mimetic isomorphism as 
manufacturing firms follow other globalizing firms and contract their workforces 
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or shift production overseas. Finally, globalization contributes to spatialization 
by shifting power relations in favor of capitalists and managers and against 
labor. The management and owners of manufacturing firms can force labor to 
accept reduced workforces because their threats of global relocation are more 
credible in a highly globalized economy. Our study suggests that low levels of 
globalization initially cause industrialization, and that higher levels of globalization 
subsequently cause deindustrialization. 

Since our study is the first to explore the later 1990s and model a curvilinear 
relationship between globalization and manufacturing employment, more 
research is certainly warranted. We encourage others to scrutinize the Rowthorn 
model and further explore a curvilinear relationship. Several of our globalization 
measures fail to reach significance, and our results challenge the Rowthorn 
consensus. Hence, our results are ultimately cautious about general explanations 
of deindustrialization and implicitly call for further research. Our study is 
unable to evaluate the causal mechanisms (competition, isomorphism and 
spatialization) that we theorize underlie higher globalization's negative influence 
on manufacturing. One promising direction might be to study the behavior, 
practices and workers of manufacturing firms as their economy or environment 
- and not simply their production — becomes highly globalized (Guillén 2001b). 
Future research with alternative modeling techniques that allow for conjunctural 
causation and multiple causal pathways could be another avenue of research. 
Also, more research can be done on the national-level like most previous studies 
and ours (Alderson 1999). At the very least, our study shows it is mistaken to 
dismiss globalization as a possible influence on deindustrialization. 


Notes 


1. Doner (1991) argues that when Japanese firms invested in Southeast Asia, 
the result was boosted consumer demand in Southeast Asia for Japanese 
products and lessened decline of manufacturing jobs in Japan. 


2. Gilpin (2001:209) notes, "Americans, for example, traditionally like big cars, 
and Europeans, small ones; Americans have tended to possess a compara- 
tive advantage in the former and Europeans in the latter. Yet, there is a mar- 
ket in the United States for small European cars and vice versa." 


3. Consistent with our argument that early globalization increases manufactur- 
ing employment, Carroll and Hannan (2000:9-11) suggest that globalization 
may enhance the development of manufacturing firms when density is low 
in a domestic economy. Globalization may spread the cultural legitimacy of 
a manufacturing industry, so "late" countries can benefit by institutionaliz- 
ing those industries more rapidly. At high levels of globalization, however, 
domestic manufacturing firms face heightened international competition as 
organizational density rises. 


4. We appreciate that mimetic isomorphism connotes the copying of formal 
organizational structures (frequently decoupled from "on the ground prac- 
tices) under conditions of uncertainty and in order to achieve legitimacy from 
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some third party. However, we also believe mimetic isomorphism can involve 
the copying of practices (e.g., downsizing) under conditions of uncertainty 
(e.g., rising globalization). Firms may copy others in order to behave stra- 
tegically" in order to impress shareholders, competitors, creditors and sup- 
pliers, or simply to follow the leaders. To be clear, we do not observe firm 
behavior as that is beyond the scope of this paper. 


Within a highly globalized environment, there is likely to be a shift in prac- 
tices, such as global outsourcing and a focus on core competencies, and 
a growing attention to how firms in other countries manufacture (e.g., the 
implementation of just-in-time manufacturing, etc.), all of which should be 
reflected in manufacturing decline. 


Our analysis includes a maximum of 738 cases and a minimum of 499. At 
our minimum, the number of cases is comparable to Rowthorn and Ramas- 
wamy's (1998) 510 cases, dwarfs Rowthorn and Ramaswamy's (1997) 145 
cases and is greater than Alderson's (1999) roughly 400 cases. Most previ- 
ous studies include 18 affluent democracies (Alderson 1999, Rowthorn and 
Ramaswamy 1998) while Rowthorn and Ramaswamy (1997) analyzed 21 
countries. Previous studies had a shorter time-period and none included data 
after the early 1990s. 


Because the sample includes more years (42) than countries (18), Beck (2001) - 
would argue that the dataset should be considered a pooled time series rath- 
er than a panel. This is the main reason we supplement the panel analysis 
technique (FE AR1) with this pooled time series technique (OLS-PCSE). 


Rowthorn and Ramaswamy (1998:11) acknowledge, "The rising share of 
employment in manufacturing in the industrialization stage of development 
represents to a large degree the movement of employment from agriculture 
to industry." 


In analyses available upon request, we estimated all models while exclud- 
ing agricultural employment and inflation. Doing so does not change our 
findings for the globalization variables. At the same time, however, GDP per 
capita and its square would actually have less robustly significant and small- 
er effects. 


We do not analyze legal measures of investment or trade openness. We ex- 
perimented with a dummy for fixed exchange rates (available upon request). 
If added to the models in Tables 2-5, this dummy is never significant in the 
FE-AR1 models but is occasionally significantly positive in the OLS-PCSE 
models. This variable did not substantially improve the model fit, and most 
of its effect is captured by our globalization indicators. 


. Our delineation between types of investment distinguishes between controlling 


interest (direct) versus investment that is more likely to be speculative, short- 
term and without an ownership stake (portfolio) (Brady and Wallace 2000). 
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12. We calculate standardized coefficients by multiplying the coefficient by the 
standard deviation of the independent variable and dividing by the standard 
deviation of the dependent variable. 


13. We code Australia, Canada, Ireland, New Zealand, the United Kingdom and 
the United States as liberal/ uncoordinated (Hall and Soskice 2001). Descrip- 
tive statistics for the subsamples are available on request. 
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Appendix: Descriptive Statistics, Number of Cases and Sources for Variables 
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IMF, International Financial Statistics 
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Appendix (continued) 
Net Trade 0.607 756 1.254 486 
(3.440) (3.738) 
Net Trade” 12.183 756 15.519 486 
(30.906) (35.428) 
Total Globalization 71.648 517 75.408 447 
(49.583) (51.602) 
Total Globalization" 7,587.202 517 8,343,145 447 
(17,612.1) (18,801.17) 
Net Globalization 1.055 517 1.067 447 
(4.078) (4.299) 
Net Globalization" 17.714 517 19.582 447 


(62.543) (67.028) 
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Redressing Past Human Rights Violations: 
Global Dimensions of Contemporary Social Movements 


Kiyoteru Tsutsui, Stony Brook University 


This paper proposes a theoretical framework to analyze global dimensions of contemporary 
social movements and attempts to answer the empirical question: why did the social 
movement for former comfort women emerge in the late 1980s after more than 40 years 
of silence? The theoretical framework integrates the world polity approach into social 
movement theory to argue that global political and cultural transformations in recent 
years have expanded political opportunities at the global level and intensified international 
flows of mobilizational resources and discursive frames, increasing the potential for social 
movements on globally legitimated issues such as human rights. The empirical analysis on 
the rise and development of the comfort women movement shows that these global factors 
have been crucial in the emergence and success of the movement. 


This paper proposes a theoretical framework to understand the impact of newly 
emerging global norms on contemporary social movements and applies the 
framework to an empirical case of social movement for former "comfort women." 
In the past several decades, new normative expectations áround progressive 
issues such as human rights and environmentalism have congealed into fairly 
coherent global norms and influential international instruments. The progressive 
ideas have diffused and changed political dynamics across the globe, engendering 
many new social movements that pressure governments and corporations to 
change their practice. Scholars in international relations and social movement 
research have taken notice and produced many studies that examine the impact 
of the new global norms and institutions on contemporary social movements. 
This paper builds on this burgeoning literature and seeks a theoretical synthesis 
of the existing studies on global dimensions of contemporary social movements, 
incorporating ideas in the world polity research. 

The theoretical framework will then be applied to a case study on social 
movements for former "comfort women." The term “comfort women” refers to 
an estimated number of 80,000 to 200,000 women who were drafted from the 
then-Japanese territories with varying degrees of coercion and deception and 
brought to “comfort stations” to provide sexual services for Japanese soldiers 
during the Asia-Pacific War (Hicks 1995, Lie 1997, Yoshimi 2000 [1995]). While 
women from many of the Japanese colonies were victimized, more than 80 
percent of them are said to have been Korean women, making it a particularly 
thorny diplomatic issue between Japan and Korea (Chai 1993, Soh 1996). They 
were subjected to coerced sexual labor and terrible living conditions, and their 


An earlier version of the paper was presented at the American Sociological Association 
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Ramirez and Jackie Smith as well as participants in the comparative workshop at Stanford 
University for valuable comments on earlier drafts. Direct correspondence to Kiyoteru Tsu- 
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suffering lasted much longer than the comfort women system. However, this 
systematic violence against women had been forgotten for morethan 40 years. 
The silence was abruptly broken in the late 1980s, when social movements for 
comfort women emerged and swiftly achieved remarkable success in eliciting 
admissions of guilt and substantial amounts of compensation from Japan. Why 
did activism on this issue emerge in the late 1980s after more than 40 years of 
silence and how did it achieve such remarkable success? Global dynamics of 
contemporary social movements hold the key to answering this question. 


Global Dimensions of Contemporary Social Movements 


The past decade has seen dramatic growth in research on global dimensions of 
social movements (Bandy and Smith 2004, Della Porta et al. 1999, Della Porta 
and Tarrow 2004, Guidry et al. 2000, Smith and Johnston 2002, Tarrow 2005). 
Empirical studies have examined international political opportunity structures 
(Imig and Tarrow 1999, Marks and McAdam 1996, McAdam 1998), transnational 
flows of human and material resources (Keck and Sikkink 1998a, Smith et 
al. 1997), and new patterns of framing with sensitivity to global audiences 
(Klandermans et al. 1999). Concurrently, a few incipient attempts to theorize 
global dynamics of contentious politics have emerged (Keck and Sikkink 1998a, 
1998b; McCarthy 1997; Tarrow 1998). However, a systematic examination of 
how existing conceptual and theoretical tools in the field might apply to the ` 
international level is yet to be done. Here, | attempt a theoretical synthesis of the 
burgeoning literature, organizing the argument along three main approaches in 
social movement studies, political opportunity structures, resource mobilization 
and framing processes. 


. Global Political Opportunity Structures 


Political opportunity structures are conceptualized along four dimensions in the 
literature: (1) relative openness of the institutionalized political system, (2) cracks 
in elite alignments in a polity, (3) the presence of elite allies, and (4) the state's 
capacity and propensity for repression (McAdam 1996:27).? | examine the global 
dynamics of political opportunities along the first three dimensions, all of which 
are expected to facilitate social movements; the fourth dimension is less relevant 
because the global polity does not have the state-like capacity for repression. 
First, open access to international political institutions should encourage 
appeals to the global polity and facilitate social movements. Contemporary social 
movements use the international arenas to influence policy-making processes 
both at supranational and national levels. In the past few decades, corresponding 
to the rise of supranational political bodies’ power and influence, an increasing 
number of social movements have directly targeted them (Smith et al. 1997). 
Most salient examples include recent protests at the meetings of powerful 
international institutions such as the World Trade Organization. The growth of 
these tactics parallels the emergence of national social movements in the era of 
the rise of nation-states (Tilly 1984). To the extent that global entities affect local 
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lives, social movements have become commensurate in their scale, making the 
global institutions their targets. 

Social movements for local or national goals also appeal to global forums 
to pressure unresponsive targets (Keck and Sikkink 1998a). The pressure can 
be applied by formally appealing to international institutions such as the U.N. 
Commission on Human Rights or more informally by publicizing the issue through 
media and other information outlets.š Even when the target is not responsive, 
contemporary activists can appeal to the international political arena to pressure 
the target government or corporation. Thus, those social movements that 
might not have surfaced for lack of /ocal political opportunities before the era of 
globalization can now seize g/oba/ political opportunities to become viable. 

The second and third dimensions of political opportunity structures have to 
do with the relationship between social movements and elites in a polity. Elite 
cleavages at the global level, such as discords among powerful countries and 
institutions, can provide political opportunities. For example, the civil rights 
movement in the United States unwittingly benefited from the cleavage between 
the two superpowers, as Soviet Union's criticism of racism in the United States 
put additional pressure on the U.S. government to address the situation (Layton 
2000, McAdam 1999). Alliances with individual elites at powerful international 
organizations such as the United Nations and in powerful countries such as 
the United States encourage and facilitate social movements as well. Their 
endorsement legitimates the movement and compels the target to take the 
movement seriously. 

These global political opportunities have expanded in the past few decades 
as international institutions and elites have become increasingly sympathetic to 
local struggles by disadvantaged and disaffected populations. Globalization has 
encouraged and facilitated local activists' taking advantage of these expanding 
political opportunities, thus giving rise to more social movements. 


International Dimensions of Resources and Networks for Mobilization 


Research on mobilizing structures of movements has found that prospects of 
social movements change with the flow of resources for mobilization (Zald and 
McCarthy 1987) and that social networks and organizational structures play 
crucial roles in mobilizing individuals to movements (Gould 1991, McAdam and 
Paulsen 1993). 

At the global level, first, international society provides challengers with new 
resources for mobilization. The most direct form of this type of international 
influence is provision of material resources for mobilization. Typically, international 
nongovernmental organizations (INGOs) based in developed countries support 
activists in developing countries to improve local practices on human rights and 
environmental protection. Grants for human rights work by Ford Foundation or 
MacArthur Foundation are prime examples of such material support (Bob 2005, 
Keck and Sikkink 1998a). In addition, INGOs also provide human resources and 
tactical repertoires for effective mobilization, sending professional activists to 
assist local organizations in staging social movements. These human and material 
resources directly propel social movements across the globe. 


334 * Social Forces Volume 85, Number 1 * September 2006 


Second, transnational human and organizational networks help social 
movements much the same way as neighborhood networks and local 
interpersonal ties mobilize people into social movements (Gould 1991, McAdam 
and Paulsen 1993). For example, the global expansion of activities to protect 
indigenous peoples' rights in the past few decades has created international 
activist networks from which indigenous peoples draw resources for their political 
mobilization (Feldman 2002, Makuxi et al. 2003). Similar organizational ties 
connect environmental organizations in different countries and provide support 
for each other's mobilization (Reimann 2001, Schofer and Hironaka 2005). An 
increasing number of international conferences and meetings, especially those 
on topical issues such as human rights and environmentalism, have enhanced 
international ties and facilitated collective action. 

As globalization accelerates the flow of mobilizational resources and enhances 
international networks of activists, potential for social movements grows across 
the globe. 


Framing Processes at the Global Level 


How do framing processes of movements, through which activists legitimate 
their goals and maximize public support (Snow et al. 1986) play out at the global 
level? While few studies have considered the issue of where the ideas and 
materials for framing come from, the assumption in the literature is often that 
framing processes take place in the local cultural context (Noonan 1995, Zuo and 
Benford 1999). Rapid globalization has rendered this assumption untenable. 

An increasing number of activists draw on international "vocabularies of 
motive" in their framing efforts.^ As the flow of information across national 
borders has increased and accelerated, more vocabularies from external sources 
 areavailable today for activists. Activists employ globally legitimated vocabularies, 
such as "human rights" and "environmentalism," to redefine and reinvigorate their 
movements often with great success. For example, "female genital mutilation" 
became a much more salient issue after activists replaced the technical and 
neutral framing of the issue, female circumcision or clitoriodectomy, with the 
"mutilation" frame, which effectively rendered the practice a violation of human 
rights (Boyle 2002, Keck and Sikkink 1998a). 

Also, as local publics become sensitized to international standards of human 
rights or environmentalism, their judgment on the legitimacy of social movements 
is increasingly based on these international standards. While there is a delicate 
interplay between global norms and local cultures, the growing vocabularies of 
social justice at the global level enable activists to frame and reframe their causes 
to more effectively gain public attention and support, both locally and globally, 
thus increasing the potential for social movements in the contemporary world. 


Global Human Rights Norms and the World Polity 


The foregoing described how the growing interconnectedness of nations 
across the globe and the steady expansion of international institutions enhance 
opportunities, resources and vocabularies for social movements. However, 
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globalization, itself, does not necessarily increase political mobilization. | argue 
that an underlying cultural transformation that accompanied globalization and 
promoted progressive issues such as human rights has increased the potential 
for social movements (Donnelly 1998, Held and McGrew 2003, Lauren 1998). To 
make this point, | focus on global human rights norms and draw on the world 
polity approach, which has examined how global norms and models influence 
local politics through international institutions and networks (Meyer et al. 
1997a). According to this approach, the strength of global norms and linkage to 
international society circumscribe the impact of global norms on local politics. 

First, the global polity itself is an ever-changing political entity, in which certain 
ideas become strong norms while others disappear over time (Schofer and 
Hironaka 2005). Thus, depending on the timing of the movement, it may or may 
not be able to take advantage of the global-level opportunities, resources and 
vocabularies. When movement goals click with global norms of the time, the 
movement is more likely to succeed in the global arena. Thus, an understanding 
of the vacillating nature of global norms is essential in examining the impact 
of global factors. Colonialism and slavery are examples of ideas that have lost 
legitimacy over time, while norms on human rights have experienced the most 
dramatic rise in recent decades, rivaled only by environmentalism. Starting with 
the Universal Declaration of Human Rights, the United Nations has adopted many 
human rights declarations and treaties, and an increasing number of states have 
ratified them, creating new political arenas for claim-making by the disadvantaged 
(Tsutsui 2004, Wotipka and Tsutsui 2001). The institutionalization of human rights 
has legitimated the language of human rights, increasing the vocabularies of 
motive for human rights movements (Hafner-Burton and Tsutsui 2005, Morgan 
2004), spawning numerous international human rights organizations, and 
enhancing the international networks of activists (Keck and Sikkink 1998a, Smith 
et al. 1997, Tsutsui and Wotipka 2005). These global-level changes have affected 
local activism greatly in the rapidly globalizing world, encouraging many social 
movements to align themselves with the globally-sanctioned norms. A corollary 
to this argument is that movements that seek goals deemed illegitimate and 
dangerous by international society, such as neo-Nazi and racist movements, are 
discouraged. 

Inaddition, strong global norms have a morefundamental impactin constructing 
social movements (Meyer and Jepperson 2000). Diffusion of global human rights 
norms has produced common global understandings and identities among 
activists and citizens, enabling them to see various human suffering as human 
rights violations. This shared understanding facilitates construction of actorhood 
among those who were resigned to the status quo, encouraging them to engage 
in activism. Thus, in addition to propelling existing movements further, global 
norms give rise to new social movements whose goals are in line with global 
norms. Furthermore, once a movement successfully takes advantage of global 
opportunities and resources, it is more likely to continue to align itself with global 
norms to capitalize on the global elements, especially when it is dependent on 
international support, as is the case with many internationally inspired movements. 
This process reinforces the global norms further and increases activism around 
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these norms, but at the same time, it presents a danger of decoupling between 
global movements and local needs if activists prioritize international supports 
over local necessities. 

Finally, the impact of global factors is stronger in countries with greater linkage 
to international society (Jang and Luo 2000, Meyer et al. 1997b, Ramirez et al. 
1997, Strang 1990, Thomas and Lauderdale 1988). Thus, | expect that the growth 
of social movements, triggered by the global factors discussed above, is more 
dramatic in internationally integrated countries. If the government is tightly 
integrated into international society, it will be more concerned about accusations 
of violations of international codes of conduct, and therefore will be more 
responsive to pressures from the global polity. In addition, if the country is linked 
to the outside world, international flows of mobilizational resources and tools 
for framing will enter more smoothly. Thus, victims of human rights violations in 
integrated countries, such as Canada and Belgium, are more likely to benefit from 
the global opportunities and resources than those in isolated countries, such as 
North Korea and Myanmar. Also, in a single country, its growing integration into 
international society over time increases the impact of global norms on local 
social movements. 


The International Context of the Comfort Women Movement 
Historical Background 


The issue of comfort women had been almost completely forgotten in both Korea 
and Japan in the 40 years since the end of the war. Until the late 1980s, reference 
to comfort women surfaced only in popular culture, where the stories were told 
in an unpoliticized manner.® 

Against this background, political mobilization for comfort women suddenly 
emerged in the late 1980s. The genesis of the movement is said to be two events 
in Korea. One is an international conference on "Women and Tourism” that took 
place on Cheju Island, Korea in April 1988. Professor Yun Chung-Ok, who had 
begun researching comfort women in the 1980s and has led the movement, 
made a presentation that alerted Korean and Japanese women's rights groups to 
the issue (Soh 1996).* The other event was a protest in Korea against the Korean 
government's sending an emissary to Emperor Hirohito's funeral in January 1989. 
During the protest, the issue of comfort women was mentioned as one of the 
unresolved political issues between Korea and Japan (Chai 1993). 

Soon after these events, the movement produced an international coalition 
of Korean and Japanese women's rights activists and organizations that would 
press the Japanese government to apologize for past wrongs. Initially, they 
targeted the Japanese government by publicizing the issue in the national media 
and public forums in both Korea and Japan. They also filed lawsuits against the 
Japanese government after a few former comfort women came forward to be 
plaintiffs. The movement found some support among the general public, but the 
Japanese government turned a deaf ear, dismissing the issue as resolved by the 
1965 normalization treaty between Japan and South Korea and claiming that the 
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comfort women system was established by private actors. At the end of 1991, 
the movement did not seem promising. 

Then, in 1992, two important events took place. First, on the eve of then-Prime 
Minister Miyazawa's visit to Korea, Asahi Newspaper reported that new evidence 
on the Japanese government's involvement in the comfort women system was 
found.” Pressed by this report, Prime Minister Miyazawa had to issue an apology 
recognizing the government's involvement in the system. The admission and 
apology was still partial, however, and the government continued to argue that the 
issue was resolved by the treaty, therefore no compensation was forthcoming. 

Second, to break this impasse, the movement launched an international 
campaign in 1992 and appealed to the U.N. Commission on Human Rights 
and audiences in several major U.S. cities. The issue caught the Commission's 
attention, particularly because another case of violence against women in 
wartime, mass rape in Yugoslavia, was a salient issue. The Commission later 
announced that it would conduct a fact-finding investigation on the issue. 

This process took place against the background of the international 
expansion of women's rights and the emergence of global norms about past 
human rights violations in the preceding decades. The U.N. sponsored many 
international treaties and conferences to protect women’s rights,’ including the 
U.N. International Women’s Year in 1975 and the Decade for Women from 1976 
to 1985 (Berkovitch 1999, Lauren 1998). These activities empowered many 
women's rights activists and organizations, even in countries where women 
were traditionally relegated to submissive and less visible roles in society (Boyle 
2002, Chen 1996, Clark et al. 1998). 

There has also been increasing international effort to uncover and punish past 
human rights violations, leading to a number of apologies and compensations 
by national governments (Gibney and Roxstrom 2001). Germany's apologies for 
Nazi crimes and the U.S. government's apologies for internment of Japanese 
Americans are prime examples. The United Nations institutionalized this trend by 
establishing the International Criminal Tribunals in former Yugoslavia and Rwanda 
and then the International Criminal Court to punish war crimes and facilitate 
reconciliation (Lauren 1998, Rudolph 2001). These official U.N. activities fortified 
the emerging international norms that governments and leaders should be held 
accountable for their "crimes against humanity" regardless of how long ago the 
crimes were committed. 

These developments cultivated fertile ground for attention and support when 
the comfort women's movement appealed to global society. Since then, the 
movement has engaged in much international activity with the U.N. Commission 
on Human Rights as the main locus of contestation? In the next few years the 
movement succeeded in eliciting further admissions of guilt from the Japanese 
government and some reparation for the former comfort women, which 
surpassed any expectations the activists had at the outset (Soh 1996:1238).'° The 
movement continues to pursue a more complete apology and more satisfactory 
reparation, and is becoming even more international as it has diffused to many 
other countries that Japan had occupied: Taiwan, China, the Philippines, Indonesia 
and Papua New Guinea, among others. 
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Global Political Opportunities for the Movement 


` 


The relative openness of the global political system was crucial for the advancement 
of the comfort women movement. Faced with the Japanese government's 
reluctance to take the movement seriously, leaders of the movement first 
appealed to the U.N. Commission on Human Rights in 1992. Multiple petitions 
were submitted to the Commission by Korean and Japanese activists in the first 
few months in 1992, including one by the sociologist Lee Hyo-Chae, co-chair of 
the Korean Council for the Women Drafted for Military Sexual Slavery by Japan, 
a Korean organization for the comfort women movement founded in 1990 (Soh 
1996:1234). The Commission quickly responded, adopting the issue on the agenda 
for the next session, and then in 1994, after preliminary investigations in the Sub- 
Commission for the Prevention of Discrimination and the Protection of Minorities, 
appointing Special Rapporteur Radhika Coomaraswamy to investigate the issue 
along with other issues about violence against women. Her report, issued in 1996, 
was sympathetic to the movement, but the Japanese government denounced it 
as inaccurate and presented its own views to the Commission. Much lobbying 
by the activists and the Japanese government took place behind the scenes, and 
in the end the Commission only gave it a "take note" status, which is the lowest 
level of recognition in the Commission (Hata 1999). Nevertheless, the publicity 
that the proceedings in the Commission created did quite a lot of damage to the 
image of the Japanese government and pressured the government to take a more 
conciliatory approach to the movement. The government was forced to investigate 
the truth about the comfort women system and issued admissions of government 
involvement in the system. It also established the Asian Women's Fund to pay 
compensation for former comfort women with money from private sources." 

The movement's appeal to the international arena and the resulting publicity 
clearly bore fruits. Indeed, the effectiveness of appeals to international society 
can be measured by the amount of coverage a movement receives in the 
media. Figure 1 shows the number of articles about comfort women in the most 
prestigious newspaper in Japan, Asahi Newspaper. The coverage exploded 
in 1992 with the first appeal to the U.N. Commission.'? The level of attention 
declined in the next few years and spiked in 1996 when Coomaraswamy's report 
was submitted to the Commission. Thus, the two years in which the biggest 
international activities took place were the two years in which the movement 
received the greatest attention in Japan. 

Figure 2 shows the number of articles about comfort women in newspapers 
and magazines in three different regions of the world.'^ It demonstrates similar 
patterns of coverage - a rise after 1992 and continuing heavy coverage for the 
next six years - throughout Asia, Europe and the Americas, although the absolute 
volume of coverage is, not surprisingly, greatest in Asia. 

The growing clout of international human rights institutions and the increasing 
publicity given incidents of war crimes and violence against women, bolstered by 
the Yugoslavian rapes and genocide in Rwanda, provided political opportunities 
for the movement. Worldwide, leaders and activists agreed that something 
needed to be done to stop such atrocities; punishing perpetrators was viewed as 
a first step in the right direction (Rudolph 2001). 
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Figure 1. Number of Articles on Comfort Women in Asahi Newspaper 1985-1998 
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Figure 2. Number of Newspaper Articles on Comfort Women in Asia, Europe and Americas 
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International Networks and Resources for the Movement 


` 


When Chung-Ok made her presentation on the comfort women issue at the 
aforementioned conference on Cheju Island, what she did was make a connection 
between the historical incident and modern sex tourism (Chai 1993:79, Soh 
1996:1232). The comfort women issue was understood as a precursor to and 
a root cause of sex tourism by Japanese men traveling to Korea. Korean and 
Japanese women's groups had been protesting sex tourism since 1977. This 
conference also brought together women's rights activists from all over the 
world, leading to international collaboration. The fact that the movement started 
after the conference demonstrates the importance of existing international 
activist networks in giving rise to a new social movement. The international 
network expanded rapidly to other countries in which victims of the comfort 
women system existed, including China, Taiwan, Indonesia, the Philippines and 
the Netherlands. 

The interpersonal and organizational ties among the activists enabled various 
international activities on behalf of the movement. Women's rights groups and 
Korean-American groups in the United States provided forums for publicizing the 
issue many times.'® Established international human rights nongovernmental 
organizations, such as the International Commission of Jurists, also supported the 
movement, issuing statements urging the Japanese government to compensate 
the victims, condemning the government's efforts to appease the movement, and 
bolstering the movement's contention that the statute of limitations does not apply. 

Figure 3 underscores this point further as it demonstrates the growth of 
international attention to Japan among women's rights activists. The figure graphs 
the founding years of women's rights organizations that have Japan as an area of 
interest. The information was drawn primarily from Human Rights Internet (HRI), 
which holds the most comprehensive data base on international human rights 
organizations, accessed July 30, 2005 at: http://www.hri.ca/organizations.'6 The 
figure shows that women's rights organizations interested in Japan began to 
grow in the mid-1970s, when the U.N. Year and Decade for Women started, and 
exploded in the late 1980s, corresponding to the emergence and development 
of the comfort women issue." 

The growing international networks of women's rights organizations eventually 
coalesced into a consortium organization, Violence Against Women in War- 
Network Japan in 1998.'5 This organization was formed after a conference on 
War and Violence Against Women in Tokyo a year earlier, which had participants 
from 20 countries (VAWW-NET Japan 20053). The organization's aims are to 
"eliminate violence against women in war and armed conflicts" and "to build 
a non-militarized world by bearing the responsibility of creating a peace that is 
based on the perspective of women's human rights." (VAWW-NET Japan 2005b) 
This organization was the most visible formalization of the international networks 
of women’s rights activists that have helped push the movement forward at every 
turn. Their collaboration culminated in the Women’s International War Crimes 
Tribunal on Japan's Military Sexual Slavery in 2000, which was a nonbinding 
‘citizens’ tribunal” to adjudicate the Japanese government's responsibilities for 
the comfort women system. 
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Figure 3. Founding Years of Women's Rights Organizations Interested in Japan 
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Table 1 shows the overall number of participants and the number of surviving 
victims of the comfort women system in each of the nine countries represented 
in the conference. It reveals the international nature of the conference and the 
movement. In addition to the participants, the "citizens' tribunal" featured legal 
experts from Argentina, Australia, Kenya, the Netherlands, the United Kingdom 
and the United States, many of whom were practitioners of international law on 
human rights, women's rights and crimes against humanity. Some have been 
directly involved in the tribunals for Rwanda and former-Yugoslavia. Furthermore, 
the consulting members included experts from six more countries (Canada, Costa 
Rica, France, India, Uganda and Yugoslavia), and the preparatory meetings took 
place in Korea, Japan, China, the Philipines, Taiwan and in the Hague during the 
two-and-a-half year organizing period (Puja 2001:616). The tribunal was attended 
by more than 1,000 people on each of the four days and media from all over the 
world. More than 200 reporters from other countries coverd the sessions, which 
ended in guilty verdicts for the Japanese government and the Emperor. 

While critics dismiss the verdict, pointing to the lack of vigorous defense 
and legitimate cross-examination, the tribunal served the movement well. Wide 
participation in the tribunal reflected the global reach of women's rights activists 
as well as the global resonance of the issue. Such international collaboration 
of activists has propelled the movement throughout its history. Material and 
ideological resources for the movement were provided by existing women's 
rights and human rights groups and diffused through networks of activists. 

It is also worth noting that the movement in recent years started expanding 
its goals to aim more broadly to condemn violence against women in conflict 
generally and is seeking to advance that agenda in international society. This 
process parallels the trajectory of local social movement organizations. Just as 
their local counterparts, international social movement organizations expand 
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Table 1: Number of Participants in the Women's International War Crimes Tribunal on 
Japan's Military Sexual Slavery in 2000 


Country Number of Participants Number of Victims 
S. Korea 220 21 
Taiwan 63 12 
Philipines 42 15 
China 28 6 
Indonesia 16 4 
N. Korea 11 2 
East Timor 6 2 
Netherlands 3 2 
Malaysia 1 0 
Total 390 64 





Source: http://www1.jca.apc.org/vaww-net-japan/womens_tribunal_2000/participants.html 


their goals to include wider issues, thus perpetuating themselves even after the 
original goal is largely met. 


The Rise of Global Framing in the Movement 


Global diffusion of human rights discourses provided effective frames for the 
movement to attract attention and gain support. The analysis below focuses on 
the frames the movement employed in its demands to the Japanese government 
and categorizes them into three different types: women's rights frame, human 
rights frame and Japan's responsibility frame. Global influences are evident in the 
first two types, while the third reflects local cultural and historical contexts. 

First, global activities and discourses on women's rights reached both Korea 
and Japan and gave the comfort women movement an ideological basis for 
mobilization. With some help from their western counterparts, Korean and 
Japanese activists raised awareness on women’s rights issues and helped 
women stand up against the traditional male dominance including sexual 
violence. Empowered by international women’s rights discourses, activists 
in Korea and Japan framed the issue of comfort women as a classic case of 
oppression of women and sexual violence that continues to haunt Asian women. 
These discourses were very effective in publicizing the injustice of the comfort 
women system. 

The movement has also successfully framed the issue as a violation of 
women’s rights. An organizer of the tribunal contends that in addition to bringing 
those responsible for the comfort women system to justice, another main goal of 
the tribunal was to “break ‘the cycle of impunity’ of sexual violence that continues 
throughout the world to this day." (Puja 2001:611-2). This women's rights framing 
created shared understanding among activists, enabling the movement to 
overcome nationalism and form a coalition between activists in Korea and Japan, 
and it succeeded in appealing to wider international audiences. 

The second frame, the human rights frame, is an extension of the first frame 
but is more general in its scope. It frames the comfort women issue as an 
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egregious violation of human rights and argues that, because it is a crime against 
humanity, the statutory time limitation does not apply. Because of the emerging 
international norm on redressing past wrongs, the Japanese government 
could no longer claim that the issue had been settled between Korean and 
Japanese governments. Thus, no matter the age of the offense and the legal 
qifficulties of reparations, the movement was able to push the argument that 
the Japanese government should issue formal apologies and compensation. The 
comprehensive term “crimes against humanity,” which describes any egregious 
human rights violations any time, is used to make this connection and eliminate 
the argument for a statute of limitations. 

The human rights frame also helped draw attention to the issue, particularly 
because the investigation on the comfort women issue by the U.N. Commission 
on Human Rights took place amidst the debate over the establishment of the 
International Criminal Tribunals in Rwanda and former Yugoslavia. The linkage to 
more recent violations of human rights helped raise the profile of the issue in the 
international political arena, increasing pressures on the Japanese government 
to address the issue (Hata 1999).** 

Finally, the movement has also framedi its efforts as a part of thelocal movement 
in Korea and Japan that seeks to elicit acknowledgement and apologies from 
the Japanese government for its aggressions in Asia. | call this locally-rooted 
frame Japan's responsibility frame. The issue of comfort women first surfaced 
in the Korean consciousness when Koreans protested against their government 
by sending an emissary to Japan for the funeral of Emperor Hirohito. It was a 
manifestation of Korean nationalism that resents Japan's intransigence to admit 
its past wrongs during its occupation of Korea. This nationalism is still very strong 
in Korea, and in this broader context, Japan's responsibility frame resonates very 
well with the Korean public. In Japan, too, there has been activism promoting 
apologies for aggressions in Asia. It became stronger in the 1980s as the issue 
of textbook descriptions of the aggressions in Asia became a diplomatic problem 
with China. The comfort women issue was the next big target in the 1990s as this 
line of activism grew further. Japan's responsibility frame appeals to the liberal/ 
progressive sector in Japan that has been mobilizing for government's apologies 
for Japanese war crimes. 

If the global discourses have influenced the movement's framing, the women's 
rights frame and the human rights frame should increase and Japan's responsibility 
frame should decline over time. To explore if such shifts in the framing took 
place, | examine newspaper reports of the movement. Newspaper accounts do 
not necessarily reflect movements' claims, because reporters selectively report 
on what they think is important and newsworthy. Several scholars have studied 
the correlation between claims by activists and news reports, but the findings 
seem inconclusive (Earl et al. 2004, Klandermans and Goslinga 1996). Here, it is 
important to note that the newspaper | examine, Asahi Newspaper, is known to 
be sympathetic to the movement even though it seeks to maintain professional 
objectivity expected of the most prestigious newspaper in Japan. Thus, | expect 
their reports to reflect the movement's framing of the issue fairly well. 


344 * Social Forces Volume 85, Number 1 * September 2006 


The three frames are captured in several keywords. First, the women's rights 
frame is captured by "jose/" (women), a term often used to discuss issues 
surrounding women's rights. When referring to "comfort women" a different 
term (ianhu) is used in Japanese and therefore there is little room for confusion 
between the two. Second, the human rights frame is measured by "Jinken" 
(human rights) and “sensou hanzai” (war crimes). The latter is a word that is 
usually used in discussing war crimes in general not in the context of Japan's war 
crimes in particular. Thus, this concept captures the general appeal the movement 
seeks in framing the issue, which allows them to make connection with other 
more recent war crimes. Finally, the Japan's responsibility frame is captured by 
"sensou sekinin" (responsibility for the war) and "shinryaku" (aggression). These 
two terms accompany most of the discourses on Japan's responsibilities for 
aggressions in Asia during the war. 

Figure 4 summarizes the percentages of newspaper articles that use each of 
the five keywords in their reporting out of all the articles about comfort women. 
The data show that while women's rights and human rights frames have increased 
in reporting the comfort women issue, reflecting the growing currency these 
discourses have gained in the international political arena, Japan's responsibility 
frame has receded considerably. The women's rights frame has been used from 
the beginning and has shown a modest increase over time. The human rights 
frame was not a strong presence initially, but grew considerably in the following . 
years and was prevalent by 1998. Japan's responsibility frame, on the other hand, 
was strong until the mid-90s but declined in the following years as the movement 
went increasingly global. | argue that this is because the movement moved into the 
global political arena as it developed, and frames that resonated with international 
audiences took precedence. For example, boycotting of Japanese products was 
one of the movement strategies in Korea, but it was mostly ineffective and the 
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movement rarely mentions it any more. Thus, the change in the framing indicates 
that the movement has been influenced by the global discourses and that the 
movement adjusted its framing in order to appeal to international audiences. 

Figure 5 corroborates this argument further. It reports the percentages of 
newspaper articles in three regions of the world, Asia, Europe and Americas, that 
use the term human rights in their reports on comfort women. The figure shows 
that human rights discourses increased over time in Asian newspapers, while 
the human rights frame in reporting the comfort women issue has increased 
in European and American news sources. This suggests that, outside Asia, the 
movement was understood as a human rights issue, and the movement adjusted 
to this global perception by utilizing the human rights frame in its framing, as 
activists recognized the resonance in international society. 


Changes in Domestic Politics in Korea and Japan 


The foregoing has emphasized the global factors that advanced the movement. 
Figure 5. Framing of the Comfort Women Movement in Newspapers in Asia, Europe and 


Americas, 1990-1998 
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This does not mean that domestic factors were irrelevant. Domestic changes 
in political and social environments in both countries also contributed to the 
movement's growth. In Korea, democratization in the 1980s reduced the risk 
of political mobilization. After the mass protest demonstrations in 1987 forced 
Chun Doo Hwan's regime to undertake democratic reforms, the emergence of 
new domestic political opportunities in the transition to democracy gave rise to 
many new social movements. Combined with the diffusion of the international 
women's rights discourse, the domestic political environment became much 
more conducive to a viable women's rights movement. 
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Before this period, the strong Korean government had prevented 
nongovernmental actors from engaging with the Japanese government directly 
on issues of war crimes and compensation because the Korean government 
resolved these issues in the normalization treaty that provided Korea with roughly 
$300 million in compensation from the Japanese. The Korean government was 
responsible for distributing the money to all the victims of the occupation, 
including the former comfort women. Thus, it deemed the issue of comfort 
women under its jurisdiction and did not like the movement to deal with the 
Japanese government directly. In the new democratized political environment, 
however, the Korean government had to tolerate the movement, thus allowing 
activists to directly target the Japanese government. The importance of the 
democratic setting becomes clearer when we compare the Korean situation 
with that of other Asian countries. In China, Malaysia, Singapore and Indonesia, 
strong governments largely monopolize dealings with the Japanese government. 
Japan has been one of the biggest providers of foreign aid for these countries, 
and the issue of compensation for the war was supposedly settled between 
the governments. In these countries, the government tends to suppress social 
movements regarding the issue of Japan's acts of aggression. 

In Japan, the U.S. occupation force established democratic institutions much 
earlier, and social movements had become routine part of domestic politics 
by the 1960s. The rise of women's rights movements, however, had to wait . 
until the 1980s (Liu and Boyle 2001), and this momentum carried over to the 
comfort women movement (Field 1997:23). Also instrumental was the shift in the 
emphasis of leftist groups begun in the late 1980s, from socialism/communism 
to new issues such as human rights, feminism and environmentalism. After the 
end of the Cold War, leftists in Japan needed to find a new focus for their political 
mobilization, as support for the hardcore Marxist ideology disappeared quickly. 
They sought to adopt causes that were less radical but nevertheless consistent 
with their belief system. Such causes included general issues such as human 
rights; feminism and environmentalism as well as more specific issues such as 
anti-U.S. hegemony and Japan's war crimes. The issue of comfort women was 
particularly appealing to them because it touched on many of the issues they 
cared about, including human rights, feminism and Japan's war crimes. 

The impact of these domestic factors relative to that of the global influences is 
hard to assess precisely. In the case of the comfort women's movement, however, 
the importance of the global factors in enabling the activists to resuscitate 
the issue and turn it into a powerful international human rights movement is 
undeniable. As the evidence shows, the emergence of the new global political 
opportunities and international flows of resources and discourses were crucial in 
the movement's emergence and development. 


Conclusion 


This paper started with two goals, one theoretical and the other empirical. The 
theoretical discussion combined three main approaches in the social movement 
literature and the world polity approach to produce a theoretical framework for 
analysis of global dimensions of contemporary social movements. It predicts a 
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rise in social movements on issues that are considered important in international 
society, as political opportunities at the global level and international flows of 
mobilizational resources and frames have increased over time, especially in 
internationally integrated countries. 

In the empirical analysis, | applied the theoretical framework to the comfort 
women movement to answer the empirical question of the paper: Why did the 
movement for former comfort women emerge in the late 1980s after more than 
40 years of silence? The empirical analysis demonstrated that the movement 
emerged and expanded as global political opportunities, international networks 
of activists and globally sanctioned vocabularies for framing became increasingly 
influential and accessible to the activists. While the issue was dead between the 
Korean and Japanese governments until the 1980s, the movement successfully 
revived it and achieved some impressive success by using the international 
political forum to pressure the Japanese government and by linking the issue 
to contemporary concerns about violence against women and crimes against 
humanity. | argue that it is because these global conditions were in place by the 
late 1980s and continued to develop in the 1990s that the movement emerged 
after 40 years of silence and achieved a surprising level of success. 

| further argue that such global influences on local social movements are not 
unique to this movement but rather a global trend. As discussed in the theory 
section, growth of the global polity has created an international context in which 
local actors can draw on global norms to construct common understandings and 
identities to launch new social movements. They can also advance existing social 
movements by taking advantage of global institutions, resources and discourses 
to promote their causes in the global stage. Thus, even those movements that 
have little prospect in local society, be it because of lack of political opportunities 
or mobilizational resources, can become viable social movements in the 
contemporary world. The theoretical framework developed in the paper found 
support in the empirical analysis of the comfort women movement and should be 
useful in analyzing wider global trends and in understanding other contemporary 
social movements. As more social movement scholars pay attention to the 
global dynamics of social movements, accumulation of research should enable 
a comparative study that examines what types of movements benefit the 
most from the global polity and what kinds of actors are more vulnerable to 
global pressures. For example, movements on issues such as human rights, 
environmentalism and protection of minorities receive strong support from the 
global political arena because of their international legitimacy. Also, consumer- 
oriented companies may be more vulnerable to social movements than industry- 
oriented ones because a boycott campaign will hurt them more, and dependent 
countries may be more vulnerable than advanced economies because economic 
sanctions will have a greater impact on them. 

Formal testing of some of the arguments proposed here and evaluation of the 
impact of international factors relative to the impact of other domestic factors 
on social movements have to await multivariate statistical analyses.*° But the 
current analysis provides some initial support for the importance of global factors 
and points to a promising line of inquiry. 
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Notes 


E 


` 


The term "comfort women" corresponds to the Japanese word “/anhu,” 
which was used euphemistically to refer to these women. It remains the 
most common term today, although alternative terms such as "military sexu- 
al slavery" have been used by activists (Tanaka 2002). 


In recent years, many scholars have attempted to refine the concept of po- 
litical opportunity, which has become overused and abused in the literature, 
and sharpened the distinction between political opportunities and mobiliza- 
tional resources and cognitive dimensions of social movements (Goodwin 
and Jasper 2003, Meyer 2004, Meyer and Minkoff 2004). 


Even when a formal route is used, the practical effect is usually publicity of 
the issue because few international institutions have enforcement power to 
fulfill movements' demands. 


To be sure, this process is not unique to social movements since the late 
20th century. For example, the Sinn Fein movement for self-determination 
in Ireland employed the rhetoric used by the League of Nations to mobilize 
publics (Swart 1995). 


The most visible example is a book by Kako Senda, "Comfort Women," 
which was published in 1978. It did not have a strong political tone and did - 
not make inroads into politics. 


Her research included visits to Japan, Thailand and Papua New Guinea, and 
she was helped by members of the Korean Church Women United (Chai 
1993, Hata 1999). 


The Asahi Newspaper is the most prestigious newspaper in Japan and is 
known to have a liberal leftist slant. It has been sympathetic to the move- 
ment from the beginning and some argue that the timing of this report re- 
flects a collaboration between the newspaper and the scholar (Hata 1999). 
If this was the case, it was a shrewd move by the movement to take advan- 
tage of the international political opportunity created by the Prime Minister's 
visit to Korea. 


Most notable of the treaties is the Convention on Elimination of All Forms of 
Discrimination against Women (1979), which has been ratified by more than 
90 percent of the countries by the turn of the century. And U.N. sponsored 
international women's conferences were held first in Mexico City in 1975, 
and subsequently in Copenhagen in 1980, in Nairobi in 1985, and in Beijing 
in 1995 (Wotipka and Ramirez 2003). 


The U.N. Commission on Human Rights, where state representatives exam- 
ine cases of human rights violations, was considered "the single most impor- 
tant institution of the global human rights regime." (Donnelly 1998:52). 


10. The apologies were not to the movement's complete satisfaction, and the 


compensation money came from contributions by Japanese citizens (which 
amounted to about $5 million), not from the government, although the gov- 
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ernment paid for operation of the foundation and some medical costs for 
former comfort women. 


The scale ranges from “commend,” “welcome,” "take note with apprecia- 
tion," "take note" and "reject." 


The payment from the fund caused splits between activists and survivors. 
Some opposed the fund and urged survivors not to accept the money until 
the government offered complete apologies and payment from its budget, 
while others argued that the survivors did not have much time to wait for 
such complete success and should take the money (Field 1997:27). 


Because new evidence about the involvement of the Japanese government 
was also reported in 1992, the heavy coverage of the movement cannot be 
attributed solely to the international activities. But the appeal to the Commis- 
sion clearly increased the publicity of the issue. 


The data are obtained from the newspapers and magazines archived in the 
Lexis/Nexis data service. The Lexis/Nexis stores news articles from all over 
the world: as of July 18, 2005, it includes 505 European sources, 441 North 
and South American ones, and 363 Asian ones with no known ideological 
tendencies among the newspapers and magazines. 


. For example, in 1992, the aforementioned activist Lee Hyo-Chae addressed 


audiences in New York City, Washington, D.C., and Los Angeles at the invita- 
tion of Korean American Women United and the Women's Association for 
the Commemoration of the March First Korean Independence Movement 
(Chai 1993). 


| also collected additional data on founding years with information on the 
organizations provided by the HRI. 


The decline in the late 1990s is most likely attributable to the time it takes for 
organizations to enter the data base. 


A precursor to this organization was the Comfort Women Issue - Asian Soli- 
darity Conference, founded in Seoul in 1992 by members from Korea, Japan, 
Taiwan, Hong Kong, the Philippines and Thailand. It made appeals to Japan 
and other relevant governments, nongovernmental organizations and the 
U.N. Commission on Human Rights (Chai 1993). 


In an explicit articulation of this frame, a Dutch former comfort woman who 
was part of the movement said: "Now | use my story to accuse all people 
who are at the moment involved in raping, robbing and detainment of in- 
nocent women and children in the former Yugoslav republic. When | see the 
reports on those gang rapes on television, the pain returns immediately. In 
God's name, when are these crimes against humanity going to stop?" 


See Tsutsui (2004) for such a study on ethnic minority social movements. 
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Domestic Violence Against Married Women in Cambodia 
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Abstract 

We evaluate the effects of marital resources and early-life experiences on recent domestic 
violence and attitudes about wife abuse among 2,074 married Cambodian women. 
Household standard of living was negatively associated with physical domestic violence. 
Women with 8-13 fewer years of schooling than their husbands more often experienced 
physical and psychological domestic violence. Women with more living children more 
often experienced physical domestic violence and justified wife beating. Having surviving 
parents and siblings was not associated with domestic violence nor with attitudes about 
wife beating. Women reporting higher paternal schooling, urban childhood residence and 
domestic violence against their mothers had higher odds of experiencing physical and 
psychological domestic violence. Findings underscore the effects of marital resources and 
early-life experiences on domestic violence in Cambodia. 


Despite high levels of domestic violence against women globally, tests of 
competing causal theories are limited in non-Western settings. Scholars have 
argued that men may use force when they lack other resources to control their 
partners' behavior (Felson and Messner 2000, Goode 1971). Others have argued 
that women who are dependent on their partners may be more tolerant of spousal 
abuse (Kalmuss and Straus 1982). Still others have argued that a man with fewer 
resources than his partner may bolster his dominance with force (MacMillan and 
Gartner 1999). Finally, others have asserted that resources in marriage poorly 
predict domestic violence after accounting for early-life experiences and contact 
with kin (Jewkes, Levin and Penn-Kekana 2002). We test the effects of resources in 
marriage and early-life experiences on the odds of domestic violence among 2,074 
married women in Cambodia. An analysis of women's views about wife beating 
aids interpretation of the results. A score for household assets and amenities 
captures household standard of living. A woman's number of living children and 
her age and schooling relative to her husband capture socioeconomic dependence 
or status inconsistency in marriage. Having surviving parents and siblings, and 
the living arrangements of the husband, in-laws and natal or biological family 
capture a woman's family organization and contact with natal kin. A woman's 
early-life experiences include her father's schooling, her childhood residence and 
whether her father ever beat her mother. We use multivariate methods to assess 
the net effects of resources in marriage and early-life experiences on domestic 
violence against women and their attitudes about wife beating in this historically 
violence context. 
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Background : 


Domestic violence refers to "assaultive and coercive behaviors that adults use 
against their intimate partners" (Holden 2003), and surveys show that domestic 
violence against women is widespread (Kishor and Johnson 2004, Levinson 1989; 
Watts and Zimmerman 2002). Twenty-five percent of women in the 1995-1996 
National Violence Against Women Survey in the United States reported that they 
had ever been raped and/or physically assaulted by a current or former spouse, 
cohabiting partner or date (Tjaden and Thoennes 1998). The lifetime prevalence 
of physical domestic violence has ranged from 40 percent to 52 percent among 
ever-married women in parts of Latin America (Ellsberg et al 1999, Kishor and 
Johnson 2004) and from 17 percent to 48 percent among various women in 
Africa (Coker and Richter 1998, Jewkes et al. 2001, Kishor and Johnson 2004, 
Watts and Zimmerman 2002). About one third of ever-married Egyptian women 
have reported being beaten since marriage (El-Zanaty et al. 1996, Yount 2005), 
and 30 percent of married Arab women in Israel have reported recent physical 
or sexual abuse (Haj-Yahia and Edleson 1994). Levels of lifetime and recent 
physical domestic violence against women have been similarly high in parts of 
Asia (Hoffman, Demo and Edwards 1994; Kim and Cho 1992; Koenig et al 2003), 
and 18 percent of ever-married women aged 15-49 in Cambodia have ever been 
beaten by a spouse or partner (Kishor and Johnson 2004). Despite widespread . 
domestic violence against women, tests of competing theories about its causes 
have been limited outside the West. 


Resources Of The Household Or Male Partner 


Goode (1971) and others (Felson and Messner 2000) have argued that overt 
(physical) force and its threat are resources, like money or personal attributes, that 
people may use to control the actions of others. Goode has posited that people will 
rely on overt force when they lack other resources, or when other resources have 
failed to help them achieve their goals. Scholars have tested Goode's hypothesis 
by estimating the effects of husbands' or households' socioeconomic status on 
the physical abuse of wives. In parts of India, the unadjusted odds of wife beating 
have been higher in households owning fewer consumer durables (Jejeebhoy and 
Cook 1997), and landholdings have been negatively associated with wife beating 
by a husband or his family in rural Bangladesh (Koenig et al 2003). In Bangkok, 
Thailand, an index combining family income and the husband's schooling and 
occupational prestige has been negatively associated with the physical abuse 
of wives (Hoffman, Demo and Edwards 1994), and a composite of individual 
income, schooling and occupational prestige has been negatively associated 
with assaults by partners among adult heterosexual couples in the Unite States 
(Williams 1992). In North America, the income of the family or husband has 
been consistently negatively associated with physically abusing a wife, but the 
relationships of men's schooling and employment with wife abuse have been 
inconsistent (Smith 1990). Scholars have argued that observed income-abuse 
associations also corroborate stress theory because low income may induce 
stress, which leads to violence (Figure 1) (Dutton 1988, Gelles 1974, MacMillan 
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Figure 1. Conceptual Model of the Causes of Domestic Violence Against Women 
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Note. This is a conceptual model. Items in italics are not available for analysis. 


and Gartner 1999, Smith 1990). This pathway of effect has been observed in 
urban Thailand (Hoffman et al. 1994). 


A Woman's Social and Economic Dependence on the Partnership 


Critics of Goode have argued that poverty in men alone cannot explain domestic 
violence against women and that women's socioeconomic dependence on 
partnerships may better explain this abuse (Blumberg 1984, Dobash and Dobash 
1979, Kalmuss and Straus 1982). Specifically, married women with children and 
little other financial support may have or perceive to have few alternatives to 
marriage, and thus may be more tolerant of spousal abuse (Figure 1). Having 
sons in parts of South Asia, however, has been either unrelated to or negatively 
associated with domestic violence against women (Koenig et al. 2003, Rao 1997, 
Schuler et al. 1996), and housewives in Kentucky have been less likely than 
working women to be in violent marriages (Hornung, McCullough and Sugimoto 
1981). Yet, data from 40 U.S. families have shown that unemployed and poorly 
educated women seek outside intervention after spousal beatings less often than 
do employed women with more schooling (Gelles 1976). Among cohabiting adults 
in the United States, women's psychological dependence has been positively 
associated with minor physical violence, and women's economic dependence 
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has been positively associated with severe physical violence (Kalmuss and Straus 
1982).! Female participants in group-based savings and credit programs in rural 
Bangladesh have had a two-thirds lower risk of domestic violence than non- 
participants (Schuler et al 1996); however, aggravated violence did occur among 
some participants (Schuler, Hashemi and Badal 1998), and the risk of physical 
beating has not been associated with a married woman's membership in such 
programs elsewhere in rural Bangladesh (Koenig et al. 2003). In Minya, Egypt, 
women with sons and markedly less schooling than their husbands more often 
have experienced physical abuse and have expressed marginally more tolerance 
for such abuse (Yount 2005). 


Status Inconsistency in the Partnership 


Other critics of Goode have argued that models of domestic violence against 
women need to distinguish partnerships in which the distribution of resources 
disrupts status expectations (Anderson 1997, MacMillan and Gartner 1999). 
Namely, if the psychological and/or economic resources of a woman exceed those 
of her male partner or some culturally acceptable level, atypical disadvantages 
in the male partner's status may threaten his masculine identity and motivate 
him to reinstate his dominance with violence (Figure 1) (Connell 1995, Thoits 
1992). Among currently or formerly married women in Kentucky, life-threatening 
violence has been more common among wives whose schooling and occupational - 
attainments exceeded those of the husband (Hornung, McCullough and Sugimoto 
1981). In the United States, men have used physical violence more often against 
female partners with higher incomes, but not with more schooling (Anderson 
1997). In Canada, the adjusted probability that a man will use coercive tactics to 
control his female partner has been greatest when she was working and he was 
not (MacMillan and Gartner 1999). Yet, absolute spousal differences in years of 
schooling and scores for occupational prestige have not been associated with 
physical wife abuse in Bangkok, Thailand after adjusting for the household's 
socioeconomic status and other factors (Hoffman et al. 1994). 

Discrepancies in these findings may result from differences across studies 
in the samples, measures and analytic methods that were used (MacMillan and 
Gartner 1999, Weis 1989). First, reporting on violence may differ according to the 
social acceptability of violence across settings and resource groups (Anderson 
1997, Arias and Beach 1987). Second, the absolute and relative resources at one’s 
disposal may be variously associated with discrete forms of violence (MacMillan 
and Gartner 1999, Weis 1989). A husband with other resources, for example, may 
threaten physical force to control his wife because the costs of using physical 
force are too high (Smith and Leginski 1970). Alternatively, husbands from all 
social classes may use physical and psychological violence if they were similarly 
socialized to believe that men should be dominant (Johnson 1995). Third, some 
scholars have assumed a linear relationship between relative resources and 
domestic violence by using absolute differences in partners’ years of schooling 
or scores for occupational prestige (Diop-Sidibe 2001; Hoffman, Demo and 
Edwards 1994). To permit non-linear effects, one might distinguish women with 
more of a resource from those having the same amount or less of that resource 
than their partner (Anderson 1997). 
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Family Organization and the Woman's Contact with Natal Kin 


Discrepancies in the above findings also may result from differential attention 
to other causes of domestic violence. Family organization and contact with 
natal kin, for example, can alter the risk of domestic violence against women, 
especially where multigenerational coresidence and extended kinship are 
common (Hoffman, Demo and Edwards 1994; Kabeer 1999; Warner, Lee and Lee 
1986). First, the practice of virilocal residence, whereby a couple lives with the 
husband's kin, may reduce a woman's power because the coresiding husband 
and parents-in-law share authority over her actions (Warner, Lee and Lee 1986). 
Empirically, women living with senior marital relatives have had weaker family 
power in rural Bangladesh (Balk 1997), but also /ower odds of domestic violence 
in this context (Koenig et al. 2003). Second, the practice of uxorilocal residence, 
whereby a couple resides with the wife's kin, may strengthen a woman's ties to 
her natal kin and enable her to negotiate marital conflicts (Kabeer 1999, Moors 
1995, Morsy 1993). In India, women with closer ties to natal kin have had more 
control over finances, decision-making power and mobility (Bloom, Wypij and 
Das Gupta 2001), and the presence of the wife's family has been negatively 
associated with domestic violence against women in South India and Minya 
Egypt (Rao 1997, Yount 2005). Third, women whose marriages were arranged 
or who married at young ages (to older mates) may have less marital power and 
thus may be at higher risk of domestic violence (Jejeebhoy and Cook 1997). 


The Early-life Experiences of Partners 


A final group of scholars has argued that adverse experiences in childhood 
may increase the risk of involvement in domestic violence, net of the effects of 
socioeconomic resources in partnerships. Namely, witnessing family violence or 
experiencing direct maltreatment may increase the risk of domestic violence in 
adulthood because such experiences teach children to view violence as normal. 
Such experiences among women also may lower their self esteem and reduce 
their ability to leave abusive partners. In the United States and elsewhere, 
witnessing family violence in childhood, and especially a father's perpetration 
of violence against one's mother, has been associated with an increased risk of 
becoming either a perpetrator or a victim of domestic violence (Jewkes, Levin 
and Penn-Kekana 2002; Kalmuss 1984; Martin et al 2002; Whitfield et al. 2003). 
Although a recent study in Nicaragua has shown contradictory findings (Ellsberg 
et al. 1999), a meta-analysis of 52 case-control studies in the United States 
confirmed that witnessing violence in the wife's family of origin is associated 
with husband-to-wife abuse (Hotaling and Sugarman 1986). Some scholars also 
have shown significant direct and indirect effects of physical abuse in childhood 
on later involvement in domestic violence (Jewkes, Levin and Penn-Kekana 
2002; Schafer, Caetano and Cunradi 2004; Whitfield et al. 2003), but others have 
cautioned that these effects may vary by gender and ethnicity (Herrenkohl et al. 
2004; Schafer, Caetano and Cunradi 2004). In South Africa, household and relative 
spousal resources have had no effect on domestic violence against women after 
controlling for exposure to and experiences of violence in childhood (Jewkes, 
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Levin and Penn-Kekana 2002). J 
Other Determinants of Domestic Violence Against Women 


The empirical literature has pointed to other variables for which analyses of domestic 
violence against women should take into account. They include the prior abuse of 
alcohol and/or drugs by one or both partners (McKenry, Julian and Gavazzi 1995), 
as well as a woman's religious affiliation, age group, location of residence and 
prior experience of a child's death. Among women and men in the United States, 
being Catholic as opposed to nonreligious has been associated with less tolerance 
for wife beating (Ulbrich and Huber 1981), and attitudes about women's roles as 
well as practices such as female genital cutting have varied between Muslims and 
Christians in Egypt (Yount 2002, 2004). In several non-Western settings, women's 
higher age and rural residence have been associated with lower odds of recent 
spousal abuse, but women's duration of marriage has had inconsistent effects 
on such abuse (Kishor and Johnson 2004). In León, Nicaragua, a higher lifetime 
prevalence of physical or sexual abuse has occurred in the mothers of children 
who died before age 5 years (Asling-Monemi et al. 2003). 

A growing number of national surveys have included questions about violence, 
yet its causes are understudied in Southeast Asia. Crosscultural research also 
has not tested explicitly for the net effects of resources in marriage and early- ` 
life experiences on domestic violence against women. The main purpose of this 
paper is to conduct an integrated test of the effects of these variables on the 
odds of physical and psychological domestic violence against married women in 
Cambodia. A second purpose is to enhance the interpretation of these findings 
by exploring the effects of the same variables on available attitudinal measures 
about wife abuse (Figure 1). The discussion above motivates five specific 
hypotheses: 


H1: Women in poorer households will have higher odds of 
experiencing domestic violence. 


H2: Women who have more children and are economically 
dependenton marriage will have higher odds of experiencing 
domestic violence and of justifying abuse against wives. 


H3: Women with more schooling than their husbands will 
have higher odds of experiencing domestic violence. 


H4: Women who married at a younger age, and who lack 
contact with natal kin, will have higher odds of experiencing 
domestic violence. 


H5: Women who report that their father beat their mother 
will have higher odds of experiencing domestic violence 
and of justifying abuse against wives. 
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Setting 


Cambodia has sustained invasion, foreign occupation and civil war since the 
Khmer Empire's fall in the 15th century. In 1863, the king of Cambodia placed the 
country under French protection, and the country became part of French Indochina 
in 1887. After Japanese occupation during World War Il and participation in the 
First Indochina War (1946-54), Cambodia gained full independence from France in 
1954.2As neighboring Laos and South Vietnam came under increasing Communist 
attack, economic conditions in Cambodia worsened, and North Vietnamese and 
Viet Cong troops infiltrated the country. In 1970, Premiere Lon Nol led a successful 
right-wing coup, but the loss of Cambodian villages and civilians because of U.S. 
and South Vietnamese anti-communist actions spurred a rise in the number of 
Cambodian Communists (the Khmer Rouge) and eventually a civil war. By 1972, 
shortages of food in Phnom Penh sparked rioting and increased oppression under 
Lon Nol. Pol Pot and the Khmer Rouge seized Phnom Penh in 1975. Most of the 
country's vehicles and machines were destroyed, and the urban population was 
forced to work as agricultural slave labor. During 1975-1979, the Khmer Rouge 
executed as many as 3 million people, including members of the upper, middle 
or educated classes, and suspected enemies of the Khmer Rouge (de Walque 
2005, Heuveline 1998). 

Vietnam invaded Cambodia in 1978 under the guise of ousting this regime? 
Three Cambodian factions eventually challenged the Vietnamese-installed 
regime, and the Association of Southeast Asian Nations helped these factions to 
form a coalition government, which gained a seat at the United Nations. ASEAN's 
efforts led to U.N. resolutions calling for Vietnam's withdrawal of troops and the 
right of Cambodia to self-determination through elections. The peace process 
began in 1989 and continued until a peace treaty was signed in October 1991. 
The U.N. supervised Cambodia's first democratic elections in 1993, and a new 
constitution reestablished the monarchy. The Khmer Rouge abstained from these 
elections and continued armed opposition, but massive defections by the mid- 
1990s severely weakened the Khmer Rouge even before the imprisonment and 
eventual death of Pol Pot in 1998. 

During its recent history, Cambodia was largely isolated from the outside world 
(Mehta 1997), and no reliable demographic data were collected between the 
official censuses of 1962 and 1998 (Huguet 1991; National Institute of Statistics, 
Directorate General for Health [Cambodia] and ORC Macro 2001; NIS, Ministry 
of Planning 1999). After years of war, the country's population of 13.6 million is 
disproportionately female, as evidenced by a drop in male:female sex ratios from 
101:100 in 1962 to 93:100 in 1993 and 95:100 in 2005 (U.S. Census Bureau 2005). 
Recent surveys also show that Cambodia is the poorest country in Asia, and 
that the health of Cambodians is among the worst in the Pacific Region. Only 16 
percent of its population is urbanized, and the per capita gross domestic product 
is $260 (World Bank Group 2001). Rates of infant and under-five mortality are 
95 and 135 deaths per 1,000 live births, respectively, and the maternal mortality 
ratio is 450 deaths per 100,000 live births (World Bank Group 2002). The average 
number of children per woman is high (4), only 38 percent of pregnant women 
receive prenatal care, and trained providers attend only 32 percent of births 
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(World Bank Group 2002). Less than two-thirds of children are immunized against 
measles and DPT, 35 percent of children less than 3 years of age are stunted 
(meaning they have a height-for-age z-score less than -2 standard deviations below 
the international reference median), and 45 percent of children less than 5 are 
underweight (meaning they have a weight-for-age z-score less than -2 standard 
deviations below the international reference median) (NIS, Directorate General for 
Health [Cambodia] and ORC Macro 2001; World Bank Group 2002). As a result, 
life expectancy at birth in 2000 was low for men (52 years) and women (55 years) 
(World Bank Group 2002). Accidents from landmines are common, and almost 
.5 percent of Cambodians are amputees (NIS, Directorate General for Health 
[Cambodia] and ORC Macro 2001). Recent surveys have included questions about 
violence against women (NIS, Directorate General for Health [Cambodia] and ORC 
Macro 2001; Nelson and Zimmerman 1996; Surtees 2003), but little is known 
about its causes in this historically violent context. 

Ethnographies since the 1960s have described Khmer kinship as bilateral 
(Ledgerwood 1995). The ownership of property and inheritance do not strictly 
follow either the male or the female lines. Khmer marital arrangements include 
an exchange of bridewealth both to transfer “rights” over the woman and to 
provide resources for the new couple. Children belong to both parental lines, and 
newlyweds may live neolocally (in a separate residence) or with the parents of 
the wife or the husband before ideally establishing a separate house. A period of - 
uxorilocality may occur early in marriage if the woman's parents wish to confirm 
her husband's good qualities or if the woman has inherited land or a house. 

War has had a major impact on Cambodian families, and living arrangements 
today are a result of not only bilateral kinship, but also imbalanced sex ratios and 
high rates of widowhood in women (Zimmer and Kim 2001). Only 6 percent of 
ever-married women live with their in-laws (NIS, Directorate General for Health 
[Cambodia] and ORC Macro 2001), and fewer adults aged 60 years and older 
live with a married son versus a married daughter (ratio .93), a never-married son 
vs. a never-married daughter (.87), and a widowed son vs. a widowed daughter 
(.10) (Zimmer and Kim 2001). Perhaps for similar reasons, most women ages 
15-49 live near natal kin (77 percent) and have family who can offer shelter (72 
percent); yet, fewer (56 percent) have family who can offer financial support (NIS, 
Directorate General for Health [Cambodia] and ORC Macro 2001). 

Consistent with historical patterns of kinship, Cambodian women have had a 
long history of economic activity and more autonomy than other Asian women 
(Ledgerwood 1994). Parents have expected daughters to support the family 
financially (Kato 2000), and these expectations have motivated women’s uptake 
of rice cultivation and petty trade in rural areas, and rural-to-urban migration 
often to work in the garment manufacturing industry (Nishigaya 2002). Yet, 
rates of literacy at ages 15-24 are lower for women (74 percent) than men (84 
percent) (World Bank Group 2002), and only 28 percent of women ages 15-49 
own personal assets (NIS, Directorate General for Health [Cambodia] and ORC 
Macro 2001). Khmer ideals of the virtuous woman have fueled a double standard 
that women should abstain from premarital and extramarital sexual relations 
while men may enjoy both (Ledgerwood 1994, Nishigaya 2002). High rates of 
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female illiteracy, low wages in female occupations and a tacit tolerance for the 
commercial sex industry have led some women to supplement their incomes 
with paid sex (Gorbach 1998, Nishigaya 2002). 


Data 
Fieldwork and Sample 


The Cambodia 2000 Demographic and Health Survey is the country's first 
nationally representative survey on population and health. The master sample 
is based on the 1998 Cambodia General Population Census and rural and 
urban participants were chosen separately. For the sample, 471 villages were 
selected directly from the sample frame or with probability proportional to the 
number of households in the village. Five or fewer segments were retained from 
each selected village, and households were selected from listings completed 
in retained segments. Women ages 15-49 years in selected households were 
eligible for interview. Of those families, 98 percent of the heads of households 
(n — 12,236) were interviewed as were 99 percent (n — 15,351) of eligible 
women. Household heads answered questions about the age, gender, relation 
to head, marital status, schooling and occupation of all members; characteristics 
of the dwelling; and assets and consumer goods owned by the household. 
Female respondents answered questions about their family background; work; 
reproductive history; knowledge and use of contraception; prenatal, delivery and 
postnatal care; infant feeding practices; immunization and health of children; 
marriage and sexual activity; fertility preferences; knowledge of HIV/AIDS and 
other sexually transmitted infections; husband's background; and maternal and 
adult mortality. 

A 25 percent sub-sample of households was identified for administration of 
a women's status module and a domestic violence module. Questions about 
domestic violence were asked only of one woman per household, and one 
respondent was selected randomly from households in which more than one 
eligible woman lived. The DVM was administered without the knowledge of 
other household members to ensure the safety of selected respondents. Before 
completing the module, each respondent was informed of the sensitivity of the 
questions and the measures that would be taken to ensure the confidentiality of 
her responses. Interviewers were instructed to discontinue an interview if privacy 
could not be achieved, and presented to abused women who requested help 
a list of organizations providing services or referrals. Almost 98 percent (n = 
2,403) of eligible women completed the women's status and domestic violence 
modules. Relevant questions in the women's status module included those about 
the living arrangements of marital and natal kin. The DVM included a modified 
Conflict Tactics Scale, with questions about psychological, physical and sexual 
aggression, and the consequences of any violence perpetrated by a current or 
former spouse, ever and in the prior year (Straus et al. 1996).^ Also included 
were questions about physical violence perpetrated by others since the woman 
was age 15 years, physical violence during pregnancy, help sought by abused 
women, and reasons for not seeking help (if applicable). Interviewers also asked 


364 * Social Forces Volume 85, Number 1 * September 2006 


women whether they agreed that a husband is justified to beat his wife for a 
range of circumstances. 


Dependent and Independent Variables 


Dependent variables in the multivariate analysis included separate measures 
for physical and psycho-logical domestic violence in the prior year, and a measure 
of women's attitudes about wife beating at the time of interview. Analyzing recent 
rather than /ifetime domestic violence was preferred to ensure a reasonable 
temporal ordering between outcomes and covariates. 

To construct appropriate summary measures for domestic violence against 
women in this context, we estimated the prevalence of each type of psychological, 
physical and sexual violence perpetrated by a current or former husband in the 
prior year. The observed relative frequencies of each form of spousal violence 
motivated the construction of two summary measures. The first captured women's 
experience of any physical or sexual violence perpetrated by a current or former 
spouse in the prior year. This measure was based on a woman's report about 
whether a current or former husband had in the prior year (1) pushed, shaken, or 
thrown something at her; (2) slapped or twisted her arm; (3) punched her with a 
fist or something that could hurt her; (4) kicked or dragged her; (5) tried to strangle 
or burn her; (6) threatened her with a knife, gun or other weapon; (7) attacked her 
with a knife, gun or other weapon; or (8) forced her to have sexual intercourse or ` 
to perform other sexual acts against her wishes. The second measure captured 
a woman's experience of any psychological violence perpetrated by a current or 
former spouse in the prior year, including public humiliation, threats to harm her 
or someone close to her, or swearing at her. A final measure captured whether 
the respondent agreed at interview that a husband is justified to beat his wife if 
she goes out without telling him, neglects the children, argues with him, refuses 
to have sex with him, or prepares food late or poorly.® 

Measures of early-life experiences included the respondent's childhood 
residence (urban, rural), the schooling attainment of the respondent's father 
(none, any primary, any secondary, unknown), and whether the respondent 
reported that, during her childhood, her father ever beat her mother (yes, no, 
unknown). Measures of family organization and availability of kin included whether 
the respondent's mother and father were alive, the respondent's number of living 
siblings, a linear spline for the respondent's age at first marriage in years (which 
allows the slope coefficient to differ before and after any specified "knots," which 
in this case was 25 years), whether the respondent's husband or parents-in-law 
were coresiding atthe time of interview, and whether someone in the respondent's 
natal family lived close enough to visit and return the same day. A measure for 
household standard of living was constructed from indicators for assets and 
amenities of the household in which the respondent lived. Using a method that 
was developed to summarize data on households from DHS-type surveys (Filmer 
and Pritchett 1999), we recoded indicators for the following household assets 
and amenities: a radio, television, refrigerator, bicycle, motorcycle or car; source 
of drinking water (piped, well or surface water, another source), toilet facilities 
(flush, pit latrine, none or other), whether the dwelling had electricity, and whether 
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the main type of floor material was finished.® We then used principal components 
analysis to estimate a set of mutually independent principal components from 
these items (Dunteman 1989). The first principal component accounted for 31 
percent of the variance in the original items, and we used scoring coefficients 
for this component to derive a score for household standard of living. Because 
the distribution of scores was slightly right-skewed, we grouped respondents 
into quartiles according to their score. Measures for socioeconomic dependency 
or status inconsistency in marriage included the respondent's years of schooling 
relative to her spouse (8-13 fewer, 0-’ fewer, more), age relative to her spouse (10 
Or more years younger, 0-9 years younger, older), and number of living children.7 
included the respondent's religion (Buddhist, other), residence at interview 
(urban versus rural), age in years (a linear spline with a knot at 25), and number 
of children that had died. 


Analytic Methods 


The analytic sample includes 2,074 married women ages 15-49 years who 
completed the DVM and had complete data for relevant covariates (2,403 women: 
295 unmarried women, 34 women with missing data). Univariate frequency 
distributions were computed for all covariates, outcomes and variables from 
which analytic variables were derived to assess the quality and completeness of 
the data and the basic characteristics of the sample. For any covariate with more 
than 1 percent non-response, Fisher's exact tests were used to compare the 
characteristics of responders and non-responders, and no significant differences 
were identified. For comparison, relative frequencies of psychological, physical, 
and sexual violence perpetrated by a current or former spouse since the 
respondent's (first) marriage also were estimated. 

Summary measures for physical and psychological domestic violence were 
cross-tabulated with categorical measures of all covariates. Chi-squared tests and 
Fisher's exact tests, where necessary, were computed for each cross-tabulation, 
and unadjusted odds of each type of domestic violence and of ever justifying wife 
beating were computed. Covariates reflecting conceptually important domains, 
and control variables that were associated with measures of domestic violence 
in these bivariate analyses or other studies were retained for the multivariate 
analysis. P-values for all bivariate analyses were adjusted to account for the 
stratified, cluster-sample design (Research Triangle Institute 2004). 

For some covariates, more extensive bivariate analyses with measures of 
domestic violence were conducted to determine the best measurement scale 
for these covariates. Specifically, bivariate interrupted and spline regressions 
of physical domestic violence on the respondent's age, age at first marriage, 
and age and years of schooling relative to her spouse were fitted (Marsh and 
Cormier 2001). The predicted log odds of physical domestic violence from these 
fitted regressions also were plotted with the observed log odds and observed 
values for covariates to assess associations visually. Linearity in the proportions 
of women experiencing physical domestic violence by the level of continuous or 
ordinal covariates also was assessed graphically. 

SAS-callable Sudaan and multivariate logistic regression were used to estimate 
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the log odds of physical and psychological domestic violence in the prior year, 
and of women's ever justification of wife beating at interview, as functions of 
measured early-life experiences, natal and marital family variables, household 
standard of living, relative spousal resources and controls. For each multivariate 
model, an exchangeable correlation structure at the level of the village was 
specified to account for non-independence of the responses arising from the 
stratified, cluster-sample design (RTI 2004). Variables entered into the full model 
were retained in the final model if they were either marginally significant (p < 
.10) or conceptually important control variables. All final models were checked 
for potential problems of multicollinearity among the covariates using variance 
inflation factor statistics. Analyses of residuals and influence diagnostics were 
performed to check for model lack of fit. To obtain estimates of the model log- 
likelihoods for each full model, each model was rerun assuming an independent 
correlation structure and using the Binder method to calculate standard errors 
(RTI 2004). 

Estimated coefficients from models for physical domestic violence were used 
to compute predicted probabilities of physical abuse. This step was performed to 
assess the extent to which early-life experiences or relative resources in marriage 
drive the risk of physical domestic violence against married women from all 
standards of living in this setting. To complete this task, predicted probabilities 
of experiencing physical abuse in a 12-month period, by household standard - 
of living, were estimated for four hypothetical groups of women having early- 
life experiences and relative spousal resources associated with the highest, 
intermediate and lowest likelihoods of abuse. This task was achieved first by 
assigning to the subgroup of women at each level for household standard of 
living sets of values for measures of relative spousal resources and early-life 
experiences that were at least marginally associated with the risk of abuse 
(both highest risk; highest, lowest risks; lowest, highest risk; both lowest risks). 
Second, other covariates assumed their observed values for each woman in 
the subgroup. Third, the linear prediction for each woman in the subgroup was 
estimated and exponentiated, and an average predicted probability for each 
subgroup was computed. (Each subgroup consisted of 104 women, on average 
(median — 25.5), and the range across subgroups was 20-1,034.) 


Results 


Table 1 shows characteristics of the 2,074 married women in Cambodia who 
were in the analytic sample. Women were mostly Buddhist (93 percent), rural 
residents (85 percent), and more than:25 years of age (82 percent). Most women 
(84 percent) had grown up in rural areas. About one-quarter of the women had a 
father who had no schooling, and almost one-third (32percent) did not know the 
schooling attainment of their father. Less than half (43 percent) of the women 
reported that their father had any schooling, and only 9 percent had a father with 
at least some secondary schooling. Ten percent of the women reported that, 
when they were children, their father had beaten their mother, and 14 percent 
said that they did not know whether their father had ever beaten their mother. 

A majority of women (63 percent) had living mothers, but less than half (42 
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Table 1: Characteristics of Married Women Ages 15-49 Years in Cambodia, 2000 


Wife's Early-life Experiences 


% 


. Childhood residence urban (rural) 16.4 
Father's schooling, none 24.8 
any primary 34.6 
any secondary 8.5 
don't know 32.1 
Father ever beat mother, no 76.3 
Yes 10.0 
don't know 13.7 
Wife's Family Organization / Availability of Kin 
Mother alive (no, survival status unknown) 63.0 
Father alive (no, survival status unknown) 42.0 
Number of living siblings, 0 4.6 
1-2 23.8 
3-4 33.3 
25 38.3 
Age at marriage > 25 years (= 25) T2 
Husband coresident (no) 95.8 
Husband's father and/or mother coresident (no)* 3.2 
Member of birth family living close enough to visit & return in same day (no)' 77.0 
Household standard of living score 
1st quartile, range of scores -.67 
2nd quartile, range of scores -.63 to -.45 
3rd quartile, range of scores -.39 to .11 
4th quartile, range of scores .13 to 5.06 
Relative Spousal Resources / Wife's Dependency 
Age relative to husband, 0 to 9 years younger" 72.8 
10 or more years younger 8.5 
Older 18.6 
Schooling relative to husband, 8-13 fewer years 4.2 
0-7 fewer years 80.6 
More years 15.2 
Number of living children, 0 7.1 
1-2 31.8 
3-4 32.4 
255 28.7 
Other Sociodemographic Characteristics 
Religion other (Buddhist) tl 
Residence urban (rural) 15.0 
Age group in years > 25 (< 25) 82.5 
Number of children who have died, 0 65.4 
1-2 28.8 
3-4 49 
>5 1.0 





Note. For dichotomous variables, the percentage of women in the non-reference group only is 


reported, and the composition of the reference group is indicated in parentheses. 


*N = 2,072, ° N = 1,994, <N = 2,061 
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percent) had living fathers. Women tended to have several living siblings, with 
38 percent having more than five and only 5 percent having none. Most women 
(93 percent) had been married before the age of 25 and were living with their 
husband (96 percent) at the time of interview. Few women (3 percent) were living 
with a parent-in-law, whereas most women (77 percent) were living within daily 
visiting distance to a member of their own family. The narrow range of scores 
for household standard of living among women in the lower 50th percentile 
as opposed to the upper 50th percentile (-.67 to -.45 vs. -.39 to 5.06) suggests 
that “wealth” as measured here was concentrated in relatively few Cambodian 
households. Almost 9 percent of women were at least 10 years younger than their 
husbands, and 19 percent of women were older than their husbands. Although 
most women (81 percent) had up to 7 fewer years of schooling than their 
husband, notable minorities of women had markedly fewer years of schooling (4 
percent with 8 to 13 fewer years) and more years of schooling (15 percent) than 
their husbands. Only 7 percent of women had no living children, with almost 
equal distributions of women having one to two (32 percent), three to four (32 
percent), and five or more (29 percent) living children. Most women (65 percent) 
had never experienced a child's death, although almost 6 percent of women had 
lost at least three children. 

Table 2 presents rates of psychological, physical and sexual violence against 
married Cambodian women by a current or former spouse. All rates refer to either - 
the prior year or the period since the woman's (first) marriage. In general, rates of 
all types of spousal violence since marriage were consistent with their respective 
rates for the prior year. About 25 percent of women reportedly had experienced 
some form of psychological, physical or sexual violence since marriage, with 
23 percent reporting such violence in the prior year. The most common form of 
marital violence was psychological (17 percent ever, 16 percent in prior year), with 
similar percentages of women reporting to have experienced public humiliation 
(7 percent, 7 percent), threats (9 percent, 9 percent), and swearing (9 percent, 8 
percent) ever or in the prior year, respectively. Physical violence was the next most 
common form of marital violence (16 percent ever, 15 percent in prior year), with 
the most common types being "slapping" (11 percent, 10 percent) and "pushing" 
(9 percent, 9 percent). Still, more than 5 percent of women reportedly had been 
punched or had been kicked or dragged ever and in the prior year. Although less 
than 4 percent of women reported to have experienced sexual violence since 
marriage, more than 3 percent had experienced such violence in the prior year. 
Over 56 percent of women agreed that a husband is ever justified to beat his 
wife, especially if she goes out without telling him (30 percent) or neglects the 
children (32 percent). 

Table 3 shows the unadjusted odds of physical domestic violence, psychological 
domestic violence, and ever justification of wife beating for the non-reference 
versus reference categories of all covariates. All measures for the early-life 
experiences of married Cambodian women were at least marginally associated 
with some form of domestic violence in the prior year, but were neither strongly nor 
consistently associated with women's attitudes about wife beating. Compared to 
women who did not report father-to-mother physical violence in their childhood, 
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Table 2: Prevalence of Domestic Violence and Beliefs about Wife Beating 


% Reporting 'Yes' Since: 
(First) Last 12 
Marriage Months 
Any psychological, physical, or sexual violence or physical 


consequences of acts by (last) husband° 24.7 29.2 
Any psychological violence by (last) husband° 17.3 16.3 
Say or do anything to humiliate you in front of others? 7.4 6.7 
Threaten you or someone close to you with harm? 9.2 8.7 
Swear at you? 8.6 8.2 
Any physical or sexual violence by (last) husband" 17.1 15.8 
Any physical violence by (last) husband° 16.3 14.9 
Push you, shake you, or throw something at you? 9.5 8.5 
Slap you or twist your arm? 11.3 10.4 
Punch you with his fist or with something that could hurt you? 5.6. 52: 
Kick you or drag you? 5.5 5.1 
Try to strangle you or burn you? 5 ey 
Threaten you with a knife, gun, or other type of weapon? 2 23 
Attack you with a knife, gun, or other type of weapon? 1.0 0.8 
Any sexual violence by (last) husband’ 3.6 3.3 
Physically force you to have sexual intercourse even when you 
did not want to? 3.3 3.0 
Force you to perform types of other sexual acts you did not want 
to? 1,2 1.1 
A husband is justified in hitting or beating his wife if she" 
Goes out without telling him? 30.0 na 
Neglects the children? sod na 
Argues with him? 22.2 na 
Refuses to have sex with him? 11:5, na 
Food is late or not well prepared? 15.9 na 
Ever justified (based factor score)? 56.1" na 


“Total sample of women (married, ages 15-49, in Cambodia in 2000, N = 2,074) with complete 
data on outcomes and covariates. ^ N 2 2,073, * N 2 2,072 


women who reported such violence had higher odds of experiencing physical 
(OR = 2.7) and psychological (OR = 2.2) domestic violence in the prior year. 
Women who did not know whether father-to-mother violence had occurred also 
had 1.6 times higher odds of experiencing physical domestic violence. Compared 
to women whose fathers had no schooling, women whose fathers had at least 
some secondary schooling were more likely to report physical domestic violence 
in the prior year (OR = 1.7) and to report that wife beating is justified (OR = 1.4). 
Women whose fathers had primary schooling were almost 30 percent less likely 
to report that wife beating is justified (OR = 0.7). Having an urban upbringing was 
associated with marginally higher odds of experiencing psychological domestic 
violence in the prior year (OR = 1.3) 

Compared to women with living fathers, women whose fathers were deceased 
or of unknown survival status had 1.3 times higher odds of experiencing 
psychological domestic violence in the prior year, but had similar odds of physical 
domestic violence and of saying that wife beating is ever justified. Having a 
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coresident husband was associated with /ower odds of experiencing physical 
(OR = 0.4) and psychological (OR = 0.5) domestic violence. No other measure for 
family (mother alive, living siblings, in-laws coresident, proximity of natal family) 
was associated with women's experiences of domestic violence or attitudes 
about wife beating. 

Compared to women in the lowest 25th percentile for household standard of 
living, women in the upper 50th percentile were 40 percent less likely to have 
experienced physical domestic violence in the prior year (OR = 0.6). The odds 
of experiencing physical domestic violence in the prior year increased 9 percent 
with each additional living child, and the odds of experiencing physical and 
psychological domestic violence in the prior year increased at least marginally 
with each additional child death (ORs = 1.2 and 1.1, respectively). Women 
with more years of schooling and those with many fewer years of schooling 
than their husbands had higher odds of experiencing psychological domestic 
violence in the prior year (OR = 1.5 and 1.7, respectively), but had similar odds 
of experiencing physical domestic violence and of agreeing that wife beating is 
ever justified. Women who were older than their husbands also were marginally 
more likely to experience psychological domestic violence in the prior year (OR — 
1.3), but a woman's age relative to her spouse was not associated with either her 
experience of physical domestic violence or her attitudes about wife beating. 

Otherwise, being non-Buddhist was associated with almost 60 percent lower ` 
odds of experiencing physical domestic violence in the prior year (OR = .4) and 
with marginally higher odds of agreeing that wife beating is ever justified (OR = 
1.5). Among women aged 25 years or younger, each additional year of age was 
associated with 1.1 times higher odds of experiencing physical and psychological 

domestic violence. After age 25, increasing age was no longer associated with 
- experiencing domestic violence in the prior year and was marginally negatively 
associated with agreeing that wife beating is ever justified (OR = .99). Urban 
residence was associated with 1.5 times higher odds of agreeing that wife beating 
is ever justified, but was not associated with recent experiences of physical or: 
psychological domestic violence. 

Table 4 shows adjusted odds ratios from final multivariate models of domestic 
violence in the prior year and of agreeing that wife beating is ever justified.? 
Adjusting for other covariates, measures for early-life experiences were strongly 
associated with domestic violence in the prior year. Compared to women who 
reported no father-to-mother violence during their childhood, women who 
reported such violence had higher odds of experiencing physical (OR = 2.9) and 
psychological (OR = 2.1) domestic violence in the prior year. Women who did not 
know about any father-to-mother violence during their childhood also had higher 
odds of experiencing physical domestic violence in the prior year (OR = 1.7), but 
these women were less likely to agree that wife beating is ever justified (OR = 
.8). Compared to women whose fathers had no schooling, women whose fathers 
had any secondary schooling had 2.0 times higher odds of experiencing physical 
domestic violence in the prior year, and marginally higher odds of agreeing that 
wife beating is ever justified (OR = 1.4). Women whose fathers had primary 
schooling were 30 percent less likely to have experienced psychological domestic 
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violence in the prior year (OR = .7). Compared to having a rural upbringing, 
naving an urban upbringing was associated with at least marginally higher odds 
of experiencing physical and psychological domestic violence in the prior year 
(ORs = 1.4), but women with rural and urban upbringings had similar odds of 
saying that wife beating is ever justified. 

After adjusting for other covariates, retained measures for family organization 
and the survival status of natal kin generally were not associated with the odds of 
either recent domestic violence or attitudes about wife beating. Women whose 
fathers were dead or of unknown survival status were marginally more likely to 
justify wife beating than were women with living fathers (OR = 1.2). Living with 
one's husband was associated with /ower odds of experiencing physical (OR — 
.4) and psychological (OR = .5) domestic violence in the prior year. The number of 
siblings, survival status of the mother, and age at first marriage were not associated 
with either domestic violence in the prior year or attitudes about wife abuse. 

Measures of absolute household resources and relative spousal resources 
also were associated with measures of recent domestic violence, and/or with 
attitudes about wife beating, after adjusting for other covariates. Compared to 
women in the lowest 25th percentile for household standard of living, women 
in the upper 50th percentile had lower odds of physical domestic violence in the 
prior year (OR = .6), but had similar odds of psychological domestic violence 
and of agreeing that wife beating is ever justified. Compared to women with 
0-7 fewer years of schooling than their husband, women with 8-13 fewer years 
had higher odds of experiencing physical (OR = 1.5) and psychological (OR = 
1.4) domestic violence in the prior year. Women with more schooling than their 
husband also had 1.6 times higher odds of experiencing psychological domestic 
violence in the prior year. Having more living children was associated with higher 
odds of experiencing physical domestic violence in the prior year (OR = 1.2) and 
of agreeing that wife beating is ever justified (OR = 1.1). 

Otherwise, being non-Buddhist was associated with lower odds of experiencing 
recent physical domestic violence (OR = .5), but marginally higher odds of saying 
that wife beating is ever justified (OR = 1.6). Having more children who died 
was associated with higher odds of experiencing recent physical (OR = 1.3) and 
psychological (OR = 1.1) domestic violence. Neither women's age nor current 
residence was associated with the outcomes in this analysis. 

Figure 2 displays predicted probabilities of physical domestic violence for a 12- 
month period. Estimates are presented by level of household standard of living 
for four hypothetical “risk” groups of women having the following combinations 
of early-life experiences and relative spousal resources: (1) high risk early-life 
experiences and resources characteristics, (2) high risk early-life experiences and 
low risk resource characteristics, (3) low risk early-life experiences and high risk 
resource characteristics, and (4) low risk early-life experiences and resources 
characteristics. Following the results in Table 4, “high risk” early-life experiences 
include urban childhood residence, father-to-mother violence in childhood, and 
secondary schooling of the father. “High risk” resource characteristics include 
8-13 fewer years of schooling, five or more living children, and husband not 
coresident. Estimated confidence intervals show that, within each level for 
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standard of living, predicted probabilities of abuse differ significantly for group 
1 vs. groups 2, 3 and 4 and for groups 2 and 3 vs. group 4. Confidence intervals 
for groups 2 and 3 overlap, so predicted probabilities for these groups are not 
statistically different. 

All four lines in Figure 2 show modest declines in the predicted probability of 
experiencing physical domestic violence with increasing household standard of 
living. Within any household standard of living, however, predicted probabilities 
of physical domestic violence differ markedly according to whether a woman has 
adverse early-life experiences or a risky relative resource profile. For example, the 
highest-risk women have between a .77 and .87 predicted probability of physical 
abuse, regardless of their standard of living. By contrast, women with a protective 
early-life and relative resource profile have less than a .12 predicted probability 
of experiencing physical domestic violence, regardless of their standard of living. 
Women with either adverse early-life experiences or an adverse relative resource 
profile have intermediate predicted probabilities of experiencing physical 
domestic violence (.31—.49). Such figures highlight that dependence on marriage 
and adverse early-life experiences drive the risk of physical abuse against married 
women from all standards of living in Cambodia. 


Conclusions 


The purpose of this paper has been to compare the net effects of resources in 
marriage and early-life experiences on domestic violence against married women, 
and on their attitudes about wife beating, in Cambodia. Cambodia is an important 
setting in which to undertake this research because of the high levels of isolation, 
economic hardship and violence that its population has endured in recent history. 
. The availability of recent, national data on population, health and the conditions 
of marriage and family life in Cambodia also provides an exceptional opportunity 
to explore these questions in cross-cultural context. 

Findings reveal that the prevalence of violence against women in Cambodia is 
high. Levels of domestic violence since (first) marriage and in the prior year exceed 
23 percent among married women of reproductive age. Lifetime estimates of 
domestic violence in Cambodia are similar to those from parts of North America 
and Western Europe, Asia, Africa and Latin America (Hoffman, Demo and 
Edwards 1994; Kishor and Johnson 2004; Tjaden and Thoennes 1998; Watts and 
Zimmerman 2002). Estimates of domestic violence in the prior year, however, are 
lower than similar estimates from Korea (38 percent) and Bangladesh (42 percent 
physically beaten by the husband or his family) (Kim and Cho 1992, Koenig et 
al. 2003). Cross-national comparisons warrant caution, however, because of 
differences in the characteristics of samples, procedures used to collect data 
on violence and likely cultural variation in respondents' willingness to disclose 
violence. These caveats suggest a persistent need to apply standard definitions 
and procedures in studies of domestic violence, but results here still reveal that 
levels of spousal and other violence against Cambodian women are not markedly 
different from levels observed elsewhere. 

Findings also show that resources in marriage are important determinants of 
domestic violence against women in Cambodia (Hypotheses 1-3). First, women 
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with a lower household standard of living had higher odds qf experiencing 
physical domestic violence (H1). This finding corroborates results from elsewhere 
(Smith 1990, Yount 2005) and supports two mechanisms of effect that are not 
directly measured. On the one hand, men in poor households who lack economic 
resources may instead use physical force to control their partners (Felson and 
Messner 2000, Goode 1971). On the other hand, a low household standard of 
living may elevate men's stress, which increases the risk of physically abusing 
their wives. The null effect of household standard of living on psychological 
domestic violence, however, suggests that Cambodian men of all classes may 
be similarly socialized to assert their dominance in this way (Johnson 1995). 

Second, the relative resources of spouses had significant and substantial 
net effects on women's experience of domestic violence. Women who had 
fewer opportunities than their husbands because they had 8-13 fewer years of 
schooling more often experienced physical and psychological domestic violence 
(H2). Women with more living children also more often experienced physical 
domestic violence and reported that wife beating is justified (H2). Together, these 
findings corroborate those from Western and non-Western settings (Kalmuss and 
Straus 1982, Yount 2005), and suggest that women who are socioeconomically 
dependent on their partners are more tolerant of partner abuse and at higher risk 
of experiencing it. Third, women who may have challenged status expectations 
in marriage by having more schooling than their husbands had higher odds of - 
experiencing psychological, but not physical domestic violence (H3). This finding 
suggests that a man who is threatened by his wife's higher schooling attainment 
may use nonphysical violence to reinstate his dominance, but a wife with more 
schooling may have sufficient marital power to prevent physical retaliation. 

Unexpectedly (H4), neither a woman's age at first marriage nor the survival 
X Status or proximity of her natal kin was associated with her odds of experiencing 
domestic violence. The non-effects of natal kin contradict findings from India 
and Egypt, where women who are isolated from their family have higher odds of 
physical abuse (Rao 1997, Yount 2005). The non-effects of natal kin in Cambodia 
are even more surprising given its history of bilateral kinship. One interpretation of 
such findings is that measures for the survival status of natal kin are poor proxies 
for the quality of their support (Nelson and Zimmerman 1996). Yet, the non-effect 
of having natal family nearby suggests that uxorilocal living arrangements no 
longer provide their customary protections to daughters. Instead, the post-war 
marriage squeeze and economic hardship may prevent many parents from 
intervening when their daughters experience spousal abuse (Surtees 2003). 
Otherwise, the husband's residential status was the only family-related variable 
that was significantly associated with a woman's risk of domestic violence, and 
women who were living with their husband had lower odds of physical and 
psychological violence. One reason for this result could be that a husband uses 
violence to control his wife's behavior during his absence (Felson and Messner 
2000). Yet, prior domestic violence may have motivated couples to separate by 
the time of the interview, so interpretations of this finding merit caution. 

Finally, women who reported father-to-mother violence in their childhood had 
higher odds of experiencing physical and psychological domestic violence, but 
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did not more often justify wife beating (H5). These findings contradict the idea that 
children who are exposed to domestic violence learn to tolerate abusive behavior, 
and thus are more likely to experience spousal abuse. One explanation for these 
findings is that women who witnessed father-to-mother violence, while not more 
accepting of spousal abuse, may nevertheless be more likely to end up with 
violent husbands, through mate selection processes. Alternatively, these findings 
may be attributable to the high exposure of most respondents to community 
violence during childhood. Specifically, the 83 percent of respondents aged 25- 
49 years in 2000 were youth during the 1970s, and their exposure to violence 
during that period may have fostered the high acceptance of wife beating in this 
sample (56 percent). Similarly, the daughters of urban, educated families in the 
1970s may have been heavily exposed to the violence of that period (de Walque 
2005), and women of secondary-educated fathers in this sample were at least 
marginally more likely to justify wife beating and to experience abuse. 

Certain limitations of this analysis merit remark. First, retrospective accounts 
may underestimate the extent of domestic violence in this setting, and the 
reliability of women's reports is unknown because spouses were not interviewed 
(DeKeseredy and Schwartz 1998). Nevertheless, field teams worked to enhance 
disclosure by following international guidelines for research on domestic violence 
(Kishor and Johnson 2004, NIS et al. 2001), and estimates of domestic violence 
from this study corroborate those from other surveys (Nelson and Zimmerman 
1996).. Second, data were not collected on women's direct experiences of 
abuse in childhood nor on their spouses' exposure to and experiences of 
abuse as children. Because children's exposure to domestic violence and direct 
maltreatment are correlated (Graham-Bermann 2002), the effects of women's 
reports of father-to-mother violence may be overestimated here. Future studies 
in Cambodia should include detailed histories of violence in childhood to estimate 
the net effects of various experiences. Lastly, research has shown that certain 
features of neighborhoods are associated with the risk of domestic violence 
(Bailey and Pederson 1995, Gage 2005, Koenig et al. 2003, Miles-Doan 1998). We 
examined bivariate associations of domestic violence against women in the prior 
year with measures for neighborhood poverty, violence and gender inequality 
of opportunity,'? but no association was significant. These results suggest that 
neighborhood effects are not apparent in settings where violence and economic 
hardship are endemic; however, more historical measures of community context 
may be needed. Thus, further investigation of contextual effects on domestic 
violence in Cambodia is warranted. 

Assessing the findings from this analysis collectively, predicted probabilities 
from multivariate models revealed that women's adverse early-life experiences 
and socioeconomic dependence on marriage markedly increased their odds of 
experiencing physical domestic violence. Thus, even where violence is endemic, 
adverse experiences in childhood and dependence on marriage can perpetuate 
domestic violence against women of all economic strata. 
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Notes 4 


d 


Women's psychological dependence referred to their perception that wives 
would be hurt more than husbands by divorce. Women's economic depen- 
dence referred to whether the wife worked, had young children, and earned 
25 percent or less of the family's total income. 


Cambodia took part in the First Indochina War because of its geographic 
proximity to Vietnam, their colonial and family ties, and ideological ties be- 
tween the Viet Minh and Cambodian Communists (later called the Khmer 
Rouge) through the Indochinese Communist Party led by Ho Chi Minh. Al- 
though Cambodia gained independence in 1953, it gained full independence 
the following year at the Geneva Conference when France gave up financial 
control over the country. 


At first, many Cambodians believed Vietnam's claim that its invasion was 
meant to end the genocidal regime of the Khmer Rouge. Yet, the invasion 
also was motivated by Vietnam's aim to control an Indochinese federation; 
border disputes with the Khmer Rouge, which wanted to reclaim territory for- 
merly taken by Vietnam; and the involvement of both countries in the Sino- 
Soviet dispute, with Vietnam backing the former Soviet Union and Cambodia 
backing China. 


This version of the CTS did not assume that violence occurs only in situations 
of conflict. 


We used these five items and principal components analysis to rank wom- 
en according to their level of acceptance of wife beating (Dunteman 1989). 
The first component accounted for 54 percent of the variation in the origi- 
nal items; yet, derived scores for this component were neither continuous 
nor normally distributed because 44 percent of the women never justified 
wife beating. 


Filmer and Pritchett (1999) also included a measure for the number of rooms 


for sleeping in the dwelling, but this variable was not available in the Cambo- 
dia 2000 DHS. 


Spousal differences in scores for occupational prestige were considered, but 
no form of this measure was associated with either physical or psychological 
domestic violence. 


Log-likelihood statistics are not available for multivariate logistic regression 
models that assume an exchangeable correlation structure. 


Multivariate models not shown indicated non-significant effects of relative 
spousal age, residence with in-laws and proximity of biological kin. Because 
these variables had higher item non-response, they were not retained in the 
final models that are presented in Table 4. 


. We computed these measures by aggregating, respectively, scores for 


household standard of living, spousal differences in years of schooling, and 
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household questions about whether a member had experienced a violent 
injury or death in the prior year. For the first two measures, each respondent 
was assigned the mean value for all other households or residents in the vil- 
lage (excluding the respondent or respondent's household). Because violent 
injury or death in the prior year was uncommon, the cluster mean for each 
respondent included the respondent's data. 
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Communist Cadres and Market Opportunities: 
Entryinto Self-employment in China, 1978-1996 


Xiaogang Wu, Hong Kong University of Science and Technology 


Abstract 

This paper examines the patterns of entry into self-employment in urban and rural China 
and across different reform stages, focusing on how communist cadres have responded to 
new market opportunities. Analysis of data from a national representative survey shows 
that both education and cadre status deter people from entry into self-employment in urban 
areas but not in rural areas. However, urban cadres have become increasingly more likely 
to be self-employed over time, and only those who became self-employed in the late reform 
stage have enjoyed higher income returns. The paper concludes that the labor mobility 
process should be taken into account in studying changes in advantages/disadvantages 
associated with different social groups in post-socialist transition economies. 


The inequality pertaining to social positions per se and that associated with 
people who occupy those positions are conceptually distinct (Sgrensen 1996). 
Structural changes may be reflected not only in changing relationships among 
different social positions, but also in changing distributions of people among 
those positions. In studying how the institutional transition to market economies 
in former state socialist countries has affected the restructuring of inequality, 
few scholars have made a clear distinction between the two processes: on the 
one hand, the transition has re-defined the relationships among different social 
positions; on the other hand, it has facilitated workers’ mobility and thus reshaped 
the distribution of individuals among different positions. 

The mixture of these two processes has hindered our understanding of 
the causal mechanism of changing income distribution, which led to a heated 
debate in the 1990s among sociologists on state socialism (e.g., Bian and Logan 
1996; Cao and Nee 2000; Nee 1989a, 1996; Parish and Michelson 1996; Xie and 
Hannum 1996; Zhou 2000). Central to the debate was a discussion on the fate 
of communist cadres in the market transition (Szelényi and Kostello 1996). Most 
scholars tended to make direct inferences about the institutional mechanism of 
social stratification by examining economic advantages of certain groups without 
regard to the individuals' mobility among different social groups. 
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In this paper, | highlight these processes and recast the social consequences 
of the market transition in terms of different social actors’ mobility in response 
to the changes in opportunity structures in the labor markets. Using data from 
the survey of Life Histories and Social Change in Contemporary China (Treiman 
and Walder 1996), | first examine how the pattern of entry into self-employment 
varies in urban and rural settings and across reform stages, and then demonstrate 
how divergent patterns of entry help decipher the observed patterns of income 
inequality in post-socialist China. 


Communist Cadres and the Market in Transition Economies 


The emergence and subsequent expansion of a private sector in former state 
socialist countries is an integral part of their transition from redistributive 
economies to market economies (Bonnell and Gold 2002). Beginning in the late 
1960s in Eastern Europe and in the late 1970s in China, market mechanisms 
have been gradually introduced into the socialist economic operations to boost 
productivity and to increase people's living standards. 

The development of the private sector has brought about two fundamental 
changes: those in the mechanisms of resource allocation and those in 
opportunity structures. First, as the market and redistribution represent two 
distinct institutional logics of resource distribution, the introduction of the market . 
mechanism into a redistributive economy alters the relationships among different 
social positions (i.e., direct producers are more favored than redistributors in 
economic rewards allocation). Second, the growth of the market sector has 
generated new opportunities. Private entrepreneurship, for example, became "an 
alternative to bureaucratic advancement in state socialism.” (Nee 1989a:667) 

The way these changes affect social stratification happens via two distinct 
processes. The first is a causal effect: the institutional transition directly leads to 
changes in the rules of how different social positions are rewarded. The second 
suggests that a new stratification system may result from changes in opportunity 
structures and individuals’ subsequent mobility across social positions. Early 
studies on market transition in an institutional perspective have largely focused 
on the former process, whereas recent literature in this field has increasingly 
emphasized the latter process in generating diverse stratification outcomes in 
post-socialist countries. 


The Institutional Perspective: Rethinking the Market Transition Debate and Beyond 


A core issue in the market transition literature pertains to the change in 
the economic advantages of old communist cadres relative to new private 
entrepreneurs (Szelényi and Kostello 1996). Because redistributive principles 
favor one set of positions and market principles favor another set (Szelényi 
1978), some scholars predicted that the institutional transition from redistributive 
economies to market economies in former state socialist countries would involve 
a transfer of power that could benefit direct producers/private entrepreneurs 
more than redistributors (Nee 1989a). This claim was supported by empirical 
evidence of the decline in returns to political power and the increase in returns to 
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human capital and to private entrepreneurship. Based on contradictory findings, 
other scholars challenged this argument and proposed alternative explanations 
that emphasized the persistence of political power (Bian and Logan 1996), the 
conversion of political power to economic advantages (Róna-Tas 1994), and the 
co-evolution of politics and markets (Zhou 2000a). 

Notwithstanding the disagreement, scholars involved in the debate seemed to 
believe that they could directly infer the change in the institutional mechanism of 
social stratification based on income inequality observed among social groups. 
The advantages of cadres/party members are attributed to the redistributive 
system, whereas increases in returns to human capital and to private 
entrepreneurship are attributed to the market mechanism. The intellectual leap 
from the observed pattern of inequality to the underlying causal mechanism, 
however, has to rely on many overly simplistic and untested assumptions. 
Without a thorough understanding of the process of individuals’ mobility among 
different social groups, the theoretical debate on the social consequences of the 
market transition is bound to yield no solution (Zhou 2000b). 

In this light, recent research has moved beyond the framework of the market 
transition debate and attempted to explain diverse empirical findings within 
concrete institutional contexts. The changing returns to political power and to 
human capital may be mediated by workplace organizations (Wu 2002, Zhou 
2000a), local government-market relationships (Parish and Michelson 1996) or 
alternative income sources (Walder 2002), rather than associated with the intrinsic 
features of market institutions. Who wins and who loses thus are complicated 
issues, largely contingent on different institutional arrangements between the 
market and the state. 

After specifying the institutional contexts, empirical findings previously 
cited to support different theoretical arguments are no longer contradictory. 
As Szelényi and Kostello’s (1996) pointed out, the rapidly expanding market 
sector created new opportunities, and those who were able to take advantage 
of the expanded new opportunities would become the winners. However, who 
was able to do so varied under different circumstances. In the early stage of 
the reform when participation in the market was highly risky and required little 
skill, most self-employed entrepreneurs came from the lower tiers of the social 
hierarchy. As private economic activities became legal and market competition 
played a greater role in economic operations, people with more human capital 
and political capital began to be involved in business activities. Some cadres also 
managed to convert their political privileges into new economic advantages in 
this stage. On the other hand, as Walder (2003) argued, market reforms not only 
created entrepreneurial opportunities in the private sector, but also generated 
new value for public properties, thereby creating opportunities for elite insiders. 
To what extent the elite members were able to appropriate the assets was 
contingent upon the extent of the regime change and the legal barriers, which 
varied substantially across different countries in different reform stages. 

Hence, these studies suggest that the macro-level institutional transition from 
redistribution to markets had no straightforward effect on the change in elite 
advantages in the post-socialist era. Instead, the effect is largely dependent 
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upon some intermediate processes, among which the change in the opportunity 
structure has received increasingly more attention. It is varying opportunities that 
link the macro-level institutional transition to the diverse stratification outcomes 
among individuals. 


The Opportunity-Mobility Perspective 


While changing opportunity has been recognized in the study of post-socialist 
stratification, the opportunity structure is vaguely conceptualized and its linkage to 
social mobility remains to be investigated. Nee (1989a) correctly pointed out that 
the emergence of a market economy would alter the structure of opportunities 
and give rise to entrepreneurship as an alternative avenue for social mobility, yet 
his empirical analysis of income attainment in rural China is misguided to test the 
effect of market opportunities, denoted by the physical distance from surveyed 
villages to cities and to marketplaces, rather than the two a/ternatives between 
redistribution and the market. 

Within such a dual structure of opportunities, “one could climb the rank order 
of the bureaucratic hierarchy, or one could try the market.” (Szelényi 1988:65). 
Who stayed in the hierarchy and who switched to the market sector? This 
question is crucial to the change in the post-socialist stratification order. Early 
research reported that the main groups involved in self-employment activities 
in rural China and Hungary were from lower tiers of the social hierarchy (Nee ` 
1989a, Szelényi 1988) and that the benefits they gained from the market would 
undermine communist cadres' privileges (Cao and Nee 2000; Nee 1989a, 1996). 
However, cadres did not always stay in the bureaucratic hierarchy. After the 
collapse of the communist regime, Hungarian cadre elites were more likely 
than ordinary workers to convert themselves into corporate entrepreneurs and 
to maintain their economic advantages (Róna-Tas 1994). In post-Soviet Russia, 
former communist party members continued to enjoy income advantages, which 
were hardly attributable to the collapsed socialist institutions (Gerber 1999). 

Recent studies have explicitly incorporated structural changes in accounting for 
the diverse stratification outcomes in post-socialist countries. Structural change 
(changing compositions of available positions) is conceptually different from 
institutional change (shift of property rights). In post-Soviet Russia, Gerber (2002) 
demonstrated that structural changes, induced by institutional changes, directly 
affected individuals’ life chances and thereby yielded variant consequences 
in social stratification. In rural China, Walder (2002) examined the impact of a 
specific structural change - the spread of local wage labor - on the pattern of 
income inequality. As the rapid economic expansion created opportunities for 
wage employment other than through farming and private entrepreneurship, the 
changes in income returns to cadres and to entrepreneurs were dependent upon 
the re/ative importance of wage employment and entrepreneurship in the local 
economy, which was not necessarily related to institutional transitions. 

In spite of structural changes, such as increases in highly rewarded positions 
(e.g., entrepreneurs), individuals may or may not leave their previous positions 
and move to new ones. As Blau (1977:5) put it, "processes of social mobility are 
an essential element in most forms of structural change," understanding how 


Communist Cadres and Market Opportunities * 393 


the inequalities among different social groups were generated requires a close 
examination of the processes of mobility to particular positions. Moving a further 
step, Wu and Xie (2003:429) argued that "workers are dynamic social actors who 
are not simply affected by the market, but rather respond to it by actively situating 
themselves in the labor market." They emphasized the impact of differential 
mobility processes on the changing pattern of income distribution and challenged 
the institutional causal explanations that many scholars previously adopted in 
studying post-socialist stratification. Nevertheless, such differential mobility 
processes, an important basis of Wu and Xie's (2003) alternative explanation, 
have not been empirically demonstrated in their analysis. 

This research explicitly conceptualizes the process of how institutional 
transitions affect social stratification outcomes as the process of how transitions 
alter opportunity structures in labor markets and of how individuals respond 
to changing opportunities by moving among different positions. | focus on the 
process of mobility into self-employment because the emergence of a self- 
employed private sector in former socialist countries resulted directly from the 
institutional transition to the market economy, and income advantages associated 
with cadres and entrepreneurs have been among the most controversial issues 
in the post-socialist stratification literature. | analyze how the patterns of entry 
into self-employment vary between urban and rural areas and across different 
reform stages and demonstrate how the variations help us make sense of the 
changes in income inequality in reform-era China. 


Paths to Self-Employment and Distributional Outcomes in China 


The development of the private sector has come a long way in China. Private 
ownership was largely eradicated during the first 30 years of the People's 
Republic. The economic reforms since 1978 have brought about a revival of self- 
employment and private ownership. Unlike the radical privatization programs 
in post-1989 Eastern Europe and Russia, Chinese reformers have adopted an 
incremental approach to expanding the private sector without first closing down 
or privatizing state-owned enterprises (Qian 2000). 

The government initially imposed various restrictions on the growth of the 
private economy. In the early 1980s, only individual/household businesses (geti hu) 
were granted legal status, and a cap of eight was set on the number of employees 
a geti hu could. hire. Private enterprises (s/ying qiye) on a larger scale, many of 
which evolved from individual/household businesses, were not sanctioned until 
the mid-1980s. After a setback from 1989-1991, the development of the private 
sector gained new momentum after Deng Xiaoping called for further market- 
oriented reforms in his famous tour to southern China in 1992. Private ownership 
became fully legitimized and played an increasingly important role in China's 
economic growth.! Industrial output from the private sector increased from less 
than 2 percent of the national total in 1985 to 34 percent in 1997. Employment 
in the private sector also grew rapidly, from only 2 percent of the national total 
non-agricultural labor force in 1981 to more than 18 percent in 1997 (Garnaut et. 
al. 2001). Clearly, more and more people were taking advantage of the expanding 
market opportunities to achieve socioeconomic mobility. 
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While the emergence of the private sector opened up a new window of 
opportunity in China, the social implications were different for rural and urban 
residents, whose opportunities had been largely structured by the household 
registration (hukou) system, implemented by the government since 1955. In the 
pre-reform era, rural residents were prohibited from moving into cities without 
the government's approval, and they were entitled to few rights and benefits 
that the socialist state conferred upon urban residents (Wu and Treiman 2004). 
Without an urban hukou status, even cadres at the village and township levels 
were not part of the state bureaucratic system and thus not as privileged as 
ordinary urban workers.? 

Soon after the reforms started, the rural collective farming system was largely 
abolished, whereas the urban redistributive sector continued to exist (though 
it shrunk substantially). Consequently, in rural areas, self-employment provided 
a major avenue of mobility for those who were deprived of socioeconomic 
opportunities under socialism. Making money was the only way that rural 
households/residents (including cadres) could reduce their disadvantages. In urban 
areas, a dual opportunity structure gradually evolved in the labor market. With the 
state still remaining the predominant employer in providing social security, fringe 
benefits, and political advancement, the private sector offered an alternative (but 
increasingly attractive) career path for social mobility (Davis 1999). 

Given the opportunity structure they faced, urban and rural residents who - 
became self-employed may have differed in their social backgrounds. Early 
ethnographic studies reported that rural entrepreneurs were mainly production 
team cadres, “sent-down” urban youth who remained in the rural areas, and 
discharged soldiers, all of whom possessed more human capital and/or political 
capital than the general rural population (Nee 1989b). In urban areas, most 
individual/household business owners and private entrepreneurs came mainly 
from marginal social groups, such as migrant peasants, unemployed youth, 
dismissed employees and retirees, who had little to lose but more to gain by 
involving themselves in private businesses (Gold 1991, Li 1993). 

Hence, | expect that the pattern of entry into self-employment in urban areas 
differed from that in rural areas. Specifically, with respect to cadres' involvement 
in this process, | propose to test the following hypothesis: 


Hypothesis 1: Cadres are less likely than non-cadres to 
become self-employed in urban areas, but this is not so in 
rural areas. 


Urban cadres had alternative opportunities within the state sector, which is 
heavily subsidized based on the political hierarchy, whereas rural cadres had 
no such choice after 1978 in an unsubsidized rural economy largely built upon 
social tradition and family relations. Even though rural cadres were inclined to 
get involved in self-employed activities with their social capital, they had no 
monopoly over the entrepreneurial opportunities in the market, as the barriers to 
entry in the countryside tended to be quite low. 
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The opportunity structures not only differed between urban and rural labor 
markets but also changed over time. The Chinese government policies towards 
the market economy have changed in the past two decades. When the private 
sector was restricted to a peripheral role in the redistributive economy, people 
involved in self-employment mainly came from socially marginal groups who had 
little to lose in urban settings. However, as the market gained more legitimacy 
and played a greater role in economic operations, more capable people moved 
into the market as a result of structural changes in opportunities. 

In China, with the Communist Party still holding power, the market economy 
gained full legitimacy in 1992, and entry into self-employment was accelerated. 
More cadres were giving up their career opportunities within the urban 
redistributive sector and "jumping into the sea" (xia hai) (Wu and Xie 2003). 
Therefore, by introducing the time dimension in the process of entry into self- 
employment, | test the following hypothesis: 


Hypothesis 2: Despite the fact that urban cadres may still be 
less likely to become self-employed, they are more likely to 
do so over time as reforms proceed. 


How do such differing transition dynamics account for income distribution 
in post-socialist China? Wu and Xie (2003) found that higher earnings among 
urban workers in the market sector (including the self-employed) are limited to a 
portion of those who entered the market in the late reform stage. Furthermore, 
among the later entrants, those who are more likely to stay in the state sector 
but have voluntarily given up benefits and privileges to enter the market tend to 
have higher earnings (Xie and Wu 2005). Because urban cadres constitute an 
increasing proportion of the self-employed workers as reforms proceed, higher 
earnings among later entrants may come from the higher economic achievements 
among cadre-converted-entrepreneurs. Therefore, | test the following hypothesis 
regarding income distribution: 


Hypothesis 3: Urban cadres who become self-employed in 
the late reform stage tend to receive higher incomes than 
do other groups. 


Hypotheses 2 and 3 focus on urban transition dynamics and its distributional 
consequences for two reasons. First, the dual opportunity structure exists only in 
urban areas. This structure deters workers’ involvement in self-employment. Second, 
the urban and rural reforms have followed distinctive paths. While the rural collective 
farming system was completely abolished soon after the reforms started, the urban 
redistributive sector was gradually shrinking but it still exists even to date. Thus, the 
effect of reform stages on entry into self-employment may be more prominent in 
urban areas than in rural areas. With the market-oriented reforms proceeding further, 
the urban-rural pattern of entry into self-employment may approach a convergence 
in the future. In testing the two hypotheses, parallel analyses for the rural sample are 
compared and contrasted to those for the urban sample. 
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Data, Variables and Methods 


` 


The three hypotheses are tested with data collected in 1996 from the survey of 
Life Histories and Social Change in Contemporary China, a multi-stage stratified 
national probability sample of 6,090 adults ages 20-69 from all regions of 
China (except Tibet) (Treiman and Walder 1996). The survey gathered extensive 
information on respondents' life histories and job activities. Samples from urban 
and rural areas were drawn separately, yielding 3,087 rural cases and 3,003 
urban cases (Treiman 1998:Appendix D). These two samples are combined, 
with appropriate weight, to form a national probability sample of the general 
population in China. 

This paper investigates the process of entry into self-employment from 1978 
to 1996 in both urban and rural areas. Literally, | define “self-employment” as 
working for oneself in non-agricultural sectors, as opposed to wage-earners/ 
farmers (Steinmetz and Wright 1989). While the self-employed may be not 
equivalent to “entrepreneurs,” they represent an easily identifiable social group 
that approximates the spirit implied by entrepreneurial activities (Aldrich and 
Waldinger 1990, Gerber 2001). Workers can also be engaged in entrepreneurial 
activities without necessarily changing their occupations (Davis 1999). Corporate 
entrepreneurs who do not receive regular salaries are not included in the two 
categories above. Hence, | define self-employment broadly on the basis of the 
respondent's main source of income: if it is from “running a business" or "set, 
employed professional" (ziyou zhiye), or "helping out with the family enterprise or 
business," then the respondent is coded as being self-employed? 

Figure 1 plots the cumulative percentages of self-employed workers in the 
urban and rural labor forces from 1978 to 1996.^ The temporal trend is consistent 
with historical observations: few people were involved in self-employment when 
the reforms first began (only 2 percent in both urban and rural areas). As the market 
transition proceeded, the share of self-employed workers increased substantially 
beginning in the mid-1980s, and the urban-rural gap, almost nonexistent prior to 
1985, also widened significantly. By 1996, the share of the self-employed reached 
13.3 percent in urban areas but only 7.5 percent in rural areas. 

The different profiles for urban and rural self-employed workers in contrast 
to non-self-employed workers are presented in Table 1. In urban areas, cadres 
or former cadres constitute 7.44 percent of the self-employed, much lower 
than their percentages among other workers who are not self-employed (24.6 
percent). In rural areas, cadres or former cadres account for 4.92 percent of 
the self-employed, which is slightly higher than their percentage among other 
workers.° Similarly, party members constitute only 2.3 percent of the urban self- 
employed but 20 percent of other urban workers, whereas 4.57 percent of rural 
self-employed and 6.10 percent of other rural workers are party members. In 
urban areas, self-employed workers tend to be less educated than other workers 
in terms of both average years of schooling and educational attainment. In rural 
areas, self-employed workers tend to have more education than do other workers. 
The results suggest two sharply different patterns of entry into self-employment 
in urban and rural areas in regard to some key characteristics. 
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Figure 1. Percentage of the Self-employed in Urban and Rural China, 1978-1996 
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Table 1: Summary Statistics for Selected Characteristics: Self-Employed and Non-Self-Em- 
ployed Workers in Urban and Rural China, 1996 


Urban Rural 
Non- Self- Non- Self- 
self-employed employed self-employed ^ employed 
% in the labor force 86.7 13.3 92.5 7.50 
Cadres/former cadres 24.6 7.44 3.36 4.92 
Party members 20.0 2.30 6.10 4.57 
8.72 8.00 25 7.45 
Years of.schooling (4.09) (3.22) (3.85) (2.98) 
Education level: 
Primary school 28.6 34.1 66.1 40.2 
J. high 39.0 46.8 27.0 49.9 
S. high/vocational 26.3 16.4 6.50 9.10 
College or above 11.6 2.58 40 10 
Rural hukou status 9.15 2151 — = 
Number of cases 2641 415 2772 224 


In the reform era, many rural residents were allowed to move into cities but 
still maintained their rural registration status (see note 4). These rural migrants 
faced serious discrimination in urban labor markets (Wang et al. 2002) and many 
became involved in self-employed activities. In the survey data analyzed here, 
rural migrants account for 31.1 percent of self-employed but only 9.15 percent of 


urban wage employees. 
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To examine the different patterns of entry into self-employment in urban and 
rural areas and the changes over time, | employ discrete hazard models in an 
event history analysis. A discrete hazard model involves a shift of the unit of 
analysis from respondents to life events (i.e., entry into self-employment) at a 
specific time (year). In this case, all non-self-employed workers are considered 
"at risk" of entry into self-employment in each year starting from 1978 or the year 
they turned 20, whichever is later. Those who had not yet become self-employed 
by 1996 are right-censored. 

After restructuring the data, a discrete-time hazard model can be fitted via 
conventional procedures for estimating binary logit models (Allison 1982). The 
dependent variable is whether or not the respondent became self-employed in a 
specific year between 1978 and 1996. It is coded as a dummy variable (yes = 1). 

The independent variables include cadre status, party membership, education, 
gender, age, whether one of the parents was self-employed when the respondent 
was age 14, whether the respondent was a rural migrant, and years since the 
reforms started in 1978. Cadre status and party membership capture the political 
capital an individual possesses. While political credentials (party membership) 
may lead to cadre/administrator status, power is intrinsically associated with the 
latter rather than with the former (Walder, Li and Treiman 2000). Cadres, whose 
skills are in administration and who trade heavily on their social connections, are 
more likely to acquire new advantages in the market by converting themselves. 
into entrepreneurs. Ordinary party members may not have direct advantages in 
cashing in on their privileges in the socialist hierarchy. Hence, this paper focuses 
on the role played by cadres in the process of entry into self-employment, with 
party membership as a control variable. Both variables are coded as dummies 
(yes = 1), which indicate the person's status prior to the year at risk. 

Education, which signifies human capital, is an important determinant of entry 
into self-employment. Because previous research reported that the effect of 
education may not be linear (e.g., Luber et.al [2001] for West European Countries 
and Gerber [2001] for Russia), education is a time-dependent covariate, measured 
by four levels completed by the year at risk (1 = primary school or less; 2 = junior 
high school; 3 = senior high school/vocational school; and 4 = college or more). 
Education is treated as a set of dummy variables in the models. 

Age, gender and parents' self-employment status are also important factors 
that determine the entry into self-employment. Age refers to the difference 
between the year at risk and the respondent's birth year; like cadre status, 
party membership and education, it is also a time-dependent covariate. Gender 
is a dummy variable (male — 1). Parents' self-employment status, a proxy for 
the impact of family background, is coded in the same way as the dependent 
variable is, namely, based on parents' reported main source of income when the 
respondent was 14 years old. 

Because the Chinese economic reforms have in general been moving in a single 
direction without major reversals (see Figure 1), a single continuous variable, 
the elapsed years since 1978, is included in the models to capture changes in 
opportunity structures in regard to the development of the private sector. Table 
2 summarizes the characteristics of workers who became self-employed from 
1978 to 1996. 
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Table 2: Selected Characteristics (Weighted) for Self-Employment Entrants in Different 
Reform Stages 


Urban Rural 
Overall Stage Stage II Overall Stage | Stage ll 
(1978-86) (1987-96) (1978-86) (1987-96) 
Annual entry rate % .97 T1 1.26 54 54 54 
Former cadre % 6.88 4.21 8.51 5.18 6.25 4.03 
Party member % 2.69 3.95 1.91 2.75 1.88 3.69 
Education level: 
Primary school 321 37.7 28.7 37.4 41.3 33.2 
J. high school 44.3 37.2 48.7 48.2 46.3 50.3 
S. high/vocational school 20.5 227 19.1 119 10.9 12.4 
College or above 3.1 24 3.5 2.8 1.5 4.0 
Male 96 54.5 61.3 50.3 70.1 72.5 67.4 
Age 29.2 26.9 30.6 25.0 23.1 27.0 
(11.0) (11.0) (10.8) (9.61) (8.15) (11.6) 
Parent self-employed 12.8 16.1 10.9 6.96 7.81 6.04 
Rural migrant 31.3 27.1 33.9 - - - 
Years since 1978 10.5 5.32 13.6 9.09 4.68 13.8 
(4.69) (2.67) (2.24) (5.18) (2.70) (2.09) 
# of events 438 170 268 243 130 113 


Notes: Data are weighted; 
The figures in parentheses are standard deviations for the continuous variables. 


To see how patterns of entry into self-employment change over time, the period 
between 1978 and 1996 is divided into two reform stages. Stage | (1978-1986) is 
the initial period when the market sector emerged only as a marginal element of 
the redistributive economy and when the reforms were mainly focused on rural 
areas. In Stage II (1987-1996), the private sector grew rapidly and its importance 
in the national economy increased, particularly in cities in the 1990s. Summary 
statistics for self-employed workers are presented separately in the two reform 
stages. 

As shown in Table 2, over the course of the reform, the average annual rate of 
entry into self-employment increased from .71 percent in Stage | to 1.26 percent 
in Stage Il in urban areas, whereas it remained approximately the same in rural 
areas. Notably, in urban areas, the share of (former) cadres among the self- 
employed increased from 4.21 percent in Stage | to 8.51 percent in Stage Il. In 
rural areas, the percentage of former cadres declined slightly instead, from 6.25 
percent in Stage I to 4.03 percent in Stage ll. 

In the following, | first analyze the urban-rural differences in entry into self- 
employment and the changing role of urban cadre status in the process, using 
discrete-time logit models. | then employ linear regression models to investigate 
the implications of different patterns of entry into self-employment on income 
inequality in post-socialist China. Because the samples are clustered within 100 
city districts/counties, an adjustment on standard errors is needed in both logit 
models and hazard models (Walder, Li and Treiman 2000). Robust standard errors 
are reported for all the models to correct the clustering on sampling units. 
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Table 3 presents the estimated coefficients for the discrete-time logit models 
separately for the urban and rural samples. To gauge the temporal variations, | first 
estimate additive models (Model 1a for the urban sample and Model 3a for the 
rural sample) with a continuous variable to represent elapsed years since reforms 
began (reform years). | then include an interaction term between cadre status and 
reform years to test whether cadres are more likely to become self-employed 
over time (Model 1b for the urban sample and Model 3b for the rural sample). 
Because migrants account for more than 30 percent of urban self-employed (see 
Tables 1 and 2), the urban sample excludes rural migrants (Models 2a and 2b). 

Models 1a and 3a demonstrate sharply different patterns of entry into self- 
employment in urban and rural areas. In urban areas, cadre status significantly 
deters one from entry into self-employment. Cadres' odds of becoming self- 
employed are only 47.7 percent (— e?/^') of non-cadres' odds, holding constant 
all other variables (p «.001). In rural areas, cadres indeed are more likely than 
non-cadres to become self-employed, but the difference is not statistically 
significant (probably due to the small sample size). This evidence lends support 
to Hypothesis 1, which suggests that cadres are less likely than non-cadres to 
become self-employed in urban areas but not so in rural areas. A 

Party membership plays a different role. Party members are significantly less 
likely than non-party members to get involved in self-employed activities in urban 
areas. The net odds of being self-employed for a party member are only 22 percent 
(= e1514) of the odds of being self-employed for a non-party member in urban 
areas (p < .001), but the difference is not statistically significant in rural areas. 

The effects of education on the likelihood of becoming self-employed in urban 
and rural areas are both statistically significant but in opposite directions. In 
urban areas, people with more education are less likely to be involved in self- 
employment. The net odds of becoming self-employed are 65.8 percent (— e9418) 
for junior high school graduates, 45.6 percent (— e?799) for senior high school/ 
vocational school graduates, and only 25.3 percent (= e134) for people with 
college education or above, compared with the odds of becoming self-employed 
for people with primary school education. In contrast, people with more education 
are more likely to become self-employed in rural areas: the net odds are 2.3 
times (= e°®8) for junior high school graduates, 2.1 times (— e?759) for senior high 
school/vocational school graduates, and 8 times (— e?) for college graduates, 
as much as those for people with primary school education. 

The effects of remaining variables are not surprising. First, in both urban and 
rural areas, age is negatively associated with the likelihood of entry into self- 
employment. A year increase in age reduces the net odds of becoming self- 
employed by 5.8 percent (— e?9?9-1) in urban areas and by 6.9 percent (= e202) 
in rural areas (p « .001). This is probably because older people have invested 
more in skills specific to the redistributive sector and are thus more averse to 
risks of career change (Gerber 2001, Róna-Tas 1994). Second, as found elsewhere 
(Gerber 2001) and also by others in China (Davis 1999, Entwisle et. al. 1995), 


Communist Cadres and Market Opportunities * 401 


'(siser peprea-o^3) 100" > d,,, I0 di, 0° > d, "poxg8row re eq 
'(SaÀnunoo/siornsrp) syrun Surdures pjedi»urid uo Suuojsnp 10J poisnípe s10119 prepurys jsngol ase sasayuaied ut sainBtq :s9}0N 







GC Lët Sr98€ Deg? ysu je jeeÁ-uosJeg 
Erz 6/2 ger SJUƏAƏ JO JequinN 
GOvl- L0ti- €vSL- Gts- GIL: ES pooulləyii-6o1 
£60 260: 980' Co Pë CR 24 opnəsq 
(c9) (29€) (ee) (rst) (cic) (812) 
x007- 290€ y” E So VEZ ES mole Ee dr C. Jue]suoo 
Leon) v0) (Gro) 
dk = «vor = +180' = sieəÁ ulloJ9),səIpeO 
u0198.Ja]u| 
(vio) Of (61.0) (910) (910) (910°) 
£0" JETT 2/40 990! «060 160 sjeoÁ uuojey 
Ga Goal (6ez:) (Lyc) (coz) (voz) 
Let «062 | «EZ | wxx6CC L xxx600 | SZOL pəÁolduue-Jləs JuaJeg 
(181) (zer) (gel) (sel) (901) (901) 
sxx VOL 991 mA lOr Salt +::809 SEW 
(zio) (c10) (900) (900) (900) (S00) 
O LO0'= s620 = wall wel xxx 190"- xxx 090 - ] JeəÁ ye aby 
(cav) (zzy) (80€) (80€) (962) ez) 
LOL E #910 212206. E xx 58€ bs OBS Ë LE j: 9A0qe JO eDojo? 
(62€) (Lee) (mz) (Loz) (291) (191) 
«094 «684 x» 96 ` xx V 86 7 EE 8 E ¡euoy eso ybly IS 
(191) (091) GIN (v91) (cc) (cz) 
xe HE 1.008. xad 0S8 7 waat H xxbbys +420 Ly- ¡00y9s uBiu if 
[papuo] joouos fueuug :uoneonp3 
(oc) (ops) (cv0 1) (cec) (S6z’) (r6Z') 
188'- zg" NIS E r GY | 209 15 100 ES 19quiau Áped 
(204) (Les) (9€9) (epz) (148) (gez) 
«928 | 9€9' «D9L' |" «609 - «801 1- PINE SIE? 
q£ IƏpoN eg J9poW qz IƏpoN ez po aL po EL po 
SjueJ6iy [en Buipnjox3 sjueJbiy jen Buipnjou] 
Jen ueqin 


a ee eee 
9661-8/Z61 “euruO j[e1myg pue ueqa/) ur jusurko[durq-jos ojur £1juq uo s[;pojX piezeH our -23242s1( 1oJ sə)eurnsq JUIDIYJIO) :£ IQBL 


402 * Social Forces Volume 85, Number 1 * September 2006 


men are more likely to become self-employed than are women. The odds of 
being self-employed for men are 1.7 times (— e**) the odds for women in 
urban areas, and 2.2 times (— e759) the odds for women in rural areas, net 
of other factors (p « .001). Finally, family background significantly affects the 
likelihood of entry into self-employment in both urban and rural areas. The net 
odds of becoming self-employed for people with at least one self-employed 
parent (when they were 14) are 2.8 times (= e**) the odds for people with no 
self-employed parents in urban areas, and 3.6 times (= e' 2%) in rural areas. This 
result confirms previous findings that entrepreneurial spirits can be transferred 
across generations (Szelényi 1988). 

To capture the temporal trend of entry into self-employment, | include a 
continuous variable, reform years, in the event history model. Model 1a and 
Model 3a show that the hazard rate of entry into self-employment increases as 
the economic reforms proceed. In urban areas, the net odds of entry increase 
by 10.2 percent (7 e9997-1) per year after 1978 (p < .001). The transition pace 
was much slower in rural areas: the net odds of entry increase by only 3.8 
percent (= e°°37-1) per year (p < .05). These results are consistent with the 
patterns shown in Figure 1. 

To test further the hypothesis regarding the changing role of cadres in the 
involvement in self-employment over time, | run interaction models between 
cadre status and reform years: Model 1b for the urban sample and Model 3b - 
for the rural sample. While cadres are still less likely to become self-employed 
than are non-cadres in urban areas, they are more likely to do so over time, as 
suggested by the significant coefficient of the interaction term in Model 1b (p < 
.05). Cadres' net odds of entry increase by 9.1 percent (= e 7-1) every year after 
1978. Cadres' net odds of becoming self-employed are only 18.2 percent (7 e 
- 170) of those for non-cadres in 1978, but the figure increases to 87 percent (= 
e1705:6678/087) in 1996. The evidence suggests that urban cadres, despite the dual 
opportunities available to them, are increasingly more likely to get involved into 
self-employed activities over time. Hypothesis 2 is supported. 

The rural transition scenario is the opposite. Model 3b shows that rural cadres' 
net odds of becoming self-employed were 4.6 times (= e!5®) the odds of non- 
cadres (p < .05) in 1978 (reform years = 0); but their advantages diminish quickly 
as the reforms proceed, by 11.3 percent (= €?12-1) every year since then, although 
the decline is only marginally significant (p < .10). This is in sharp contrast to 
the pattern found in urban areas, but confirms that rural cadres, who lacked 
alternative opportunities, were among the first to seize the newly emerging market 
opportunities. Because most cadres with entrepreneurial propensity may have 
already become entrepreneurs in the countryside, they were no long at risk. Other 
social groups’ continuing flow into self-employment in the later years of reforms, 
as indicated by the temporal increase in the entry rate, may have led to the overall 
insignificant difference between cadres and non-cadres in self-employment. 

Hence, the event history analysis reveals two different dynamic patterns of 
entry into self-employment between urban and rural areas and confirms both 
Hypotheses 1 and 2. As the reforms proceed, people are more likely to become 
self-employed in both urban and rural areas. While cadre status, party membership 
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and education significantly deter people from entry into self-employment in urban 
areas, they have either significantly positive or insignificant effects on entry into 
self-employment in rural areas. Furthermore, while urban cadres were initially less 
likely than non-cadres to become self-employed, they were increasingly likely to 
do so as the reforms proceeded. Rural cadres with entrepreneurial propensity, on 
the other hand, had a/ready seized entrepreneurial opportunities at the beginning 
of the reforms. 

It should be noted that the contrast in the transition dynamics between urban 
and rural areas could be confounded by surging migration, particularly from rural 
to urban areas, in the reform era. As presented in Table 1, more than 30 percent of 
people in urban areas did not hold the local hukou status. Lacking opportunities 
in urban labor markets, these rural migrants were more likely to get involved in 
self-employed activities, just as immigrants in other countries were (Aldrich and 
Waldinger 1990). To verify that the pattern of entry into self-employment in urban 
areas was not dominated by migrants, | replicated the analysis excluding those 
migrants. Results in Models 2a and 2b show that the urban transition dynamics 
remain largely the same. 

The above findings from China echo the broad sociological literature on the 
rise of self-employment in western economies (Aldrich and Waldinger 1990, 
Arum and Muller 2004). While some scholars stress the role of social marginality 
in entrepreneurial formation by suggesting that the self-employed come mainly 
from the disadvantaged groups who are excluded from access to pre-existing 
economic and political opportunities. Others emphasize the advantages of 
privileged groups with both financial and intellectual resources (see review in 
Martinelli 1994). Different paths to self-employment also have varying implications 
on inequality and class structure. For example, scholars have debated whether 
the recent growth in self-employment will lead to the rise of a “new middle class” 
(Steinmetz and Wright 1989) or to labor marginalization and proletarianization 
(Arum and Muller 2004). To explore the implications of the different transition 
processes revealed above, | conduct a supplementary analysis on income 
inequality in China. 


Regression Analysis on Income Inequality 


Entry into self-employment is evidently an important avenue for upward 
socioeconomic mobility. In terms of income gains, the self-employed are winners 
in the transition. As of 1996, in urban China, the per capita income for the self- 
employed households was 10,975 RMB yuan (1 RMB yuan = .12 USD), 60 percent 
higher than that for those not engaged in self-employment. In rural areas, the per 
capita income for the self-employed households is 6,666 RMB yuan, which was 
2.4 times that for those not engaged in self-employment.’ 

To investigate how the differing transition dynamics account for the income 
distribution in 1996, the linear regression model of the logarithm of per capita 
income is estimated on a set of independent variables. Following Wu and Xie 
(2003), the two subgroups among the self-employed are identified as those who 
entered early (in or before 1986) and those who entered late (after 1986). 
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Table 4 presents the results from the linear regression analysis. In Model 4a, 
independent variables are: the logged per capita income in 1986, education, party 
membership, cadre status, age, gender and entrant type.* Because the per capita 
income in 1986 is controlled in the equation, the net effect of an independent 
variable on the logged per capita income can be interpreted as that on income 
change. Similar to the results reported by Bian and Logan (1996) and Zhou 
(2000a), education, party membership and cadre status all have positive impacts 
on income growth (p « .001). The effect of age is also positive and significant (p 
« .001). Men enjoyed significantly lower income growth than did women in the 
period from 1986 to 1996. ? 

The most interesting finding in Model 4a is that only those who became self- 
employed after 1986 enjoyed as much as a 17 percent (= e.°'°7-1) higher income 
over those who remained wage employees. This result confirms the argument 
by Wu and Xie (2003) that people with more human capital and political capital 
were self-selected into the market in the late reform stage and that those people 
tended to do better economically (also see Szelényi and Kostello 1996, Xie and 
Wu 2005). 

Specifically, the preceding analysis has shown that urban cadres are more 
likely to become self-employed over time as reforms proceed. To see whether 
the changing composition of the urban self-employed (namely, more cadre- 
entrepreneurs) could account for the advantages held by the later entrants, | - 
include interaction terms between cadre status and entrant type in Model 4b. 
Results show that income advantages enjoyed by later entrants are due entirely 
to the advantages held by cadre-entrepreneurs, who became self-employed in 
the period between 1987 and 1996. They earned 2.06 times (= e9$02:020) the 
income that the regular employed workers received in 1996, and the difference 
is statistically significant (p « .05). Hypothesis 3 is thus confirmed. 

Under different circumstances in rural areas, the distributional outcomes vary. 
As shown in Models 5a and 5b, self-employed workers enjoyed significantly 
higher income than did non-self-employed workers in 1996, by about 68 percent 
(p < .001), regardless of when they became self-employed. The interaction terms 
between cadre and entrant type are not statistically significant, suggesting that 
cadre-converted entrepreneurs are not doing particularly better than entrepreneurs 
from other social backgrounds. 

The different findings from urban and rural areas offer a clear case of how 
the labor mobility process, entrenched in institutional transitions, has shaped 
changes in the income distribution in China. Without taking into account the 
variant labor mobility processes that link institutional transition and income 
inequality, extrapolating institutional changes directly from the pattern of income 
distribution patterns among different groups is unwarranted. 


Summary and Conclusion 


Using an opportunity-mobility framework, this research examines the patterns of 
Chinese workers’ entry into self-employment in urban and rural areas and across 
reform stages and links them to the changing patterns of income distribution in 
China. With particular attention to the role played by communist cadres in the 
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Table 4: OLS Regression Coefficients for Logged Annual Household Income per Capita on 
Selected Variables in Urban and Rural China, 1996 


Urban Rural 
Variables Model 4a Model Ab Model 5a Model 5b 
Income per capita in 1986 (In) .268*** .268*** 434** A34*** 
(.051) (.051) (.033) (.033) 
Education: Primary school [omitted] 
J. high school .294*** 29171 .156* 406; 
(.049) (.048) (.059) (.059) 
S. high/vocational school PIS ek .409*** .408*** 
(.052) (.052) (.066) (.067) 
College or above .482*** ATIE 2 Sf 
(.053) (.053) (.155) (.152) 
Party member (yes = 1) 12 ES .128* .105 .106 
(.039) (.039) (.080) (.078) 
Cadre status (yes = 1) .069** .049 .207 .206 
(.032) (.031) (.110) (.108) 
Male -.070* -.072* -.109** -.110** 
(.032) (.028) (.034) (.034) 
Age*10 .062* .061* -.026 -.026 
(.024) (.024) (.018) (.019) 
Entrant Type (not self-employed [omitted]) 
Early entrants (1978-1986) .024 .018 "D te :516^* 
(.209) (.205) (.107) (.105) 
Later entrants (1987-1996) .157* .120 d? 152612 
(.062) (.067) (.081) (.080) 
Interaction 
Early entry*cadre -— .076 — -.071 
(.360) (.467) 
Late entry*cadre — .602* — .459 
(.255) (.307) 
Constant 6.147*** 0.169377 4.949*** 4.948*** 
(.338) (.334) (.265) (.265) 
R .170 .174 .302 .302 
N 2247 2247 2586 2586 


Notes: Data are restricted to those who entered the labor force in or before 1986; Figures in 
parentheses are standard errors adjusted for clustering on counties. Data are weighted. 
*p«.05 *p<.01  ***p«.001 (two-tailed tests). 


transition process, three hypotheses were confirmed by the data. Results show 
almost diametrically different patterns between urban and rural areas. While 
education deters urban workers' entry into self-employment, it promotes rural 
workers' entry; while party membership and cadre status significantly reduce 
workers' involvement in self-employment in urban areas, neither of them has a 
significant effect on such entry in rural areas (Hypothesis 1). 

There is clear evidence that China's transition trajectory has increased 
entrepreneurial opportunities for urban cadres. In urban areas, while cadres were 
less likely to become self-employed than were non-cadres, they are more likely 
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to do so over time (Hypothesis 2). In rural areas, cadres were among the first to 
seize entrepreneurial opportunities. Later, they seemed to have little advantage. 

The variant patterns of entry into self-employment yield diverse outcomes of 
income stratification. In urban areas, later entrants still enjoyed income advantages, 
whereas early entrants, mostly from the low tiers of the social hierarchy, were 
pushed aside as more qualified persons started entering into self-employment 
(Hypothesis 3). In rural areas, early entrants, mostly from advantaged groups, 
were able to maintain their income advantages. 

Variations in patterns of entry into self-employment within different contexts in 
China show how complicated the processes and social consequences of market 
transitions are. Parties involved in the market transition debate each can find some 
pieces of evidence out of the complicated picture to support their respective 
arguments. The transition scenario in rural areas described above is consistent 
with the market transition theory: human capital (education) increases the chance 
of entry into self-employment, whereas political attributes have no significant 
impact on access to lucrative market opportunities (Nee 1989a). Beyond the rural 
context, this is the hardly the case in urban areas. Whereas urban cadres are still 
less likely than non-cadres to become self-employed, they are increasingly more 
likely to do so in the later stage of reform, which is consistent with the ongoing 
process of “power conversion” suggested by Róna-Tas (1994). 

The controversy stems not from empirical evidence, but from the theoretical - 
framework scholars have adopted to interpret the facts. If results from rural areas 
that cadres had little or no net advantages in entering into private entrepreneurship 
(Nee 19892) reflect the effect of marketization, then the finding that urban cadres 
were even more disadvantaged in access to market opportunities becomes a 
puzzle in the same interpretative framework because redistributive power is 
supposed to be much more prevalent in urban areas. On the other hand, the 
observed income advantages of cadre-entrepreneurs may not necessarily result 
from the power they used to have, but they might indeed reflect the self-selection 
effects (Gerber 1999, Xie and Wu 2005). 

Hence, rather than assuming that institutions have intrinsic implications on 
stratification outcomes, this research emphasizes both structural changes in 
opportunities and individuals’ selective mobility as an intermediate process 
between the macro institutional transition and the changes in social stratification 
outcomes. The consequences of market transitions depend on how the expanding 
market sector alters the opportunity structures for various social actors and on 
their reactions to the changes in the labor market. Those who can better situate 
themselves in the changing structures will prevail, remaking themselves and 
regenerating a new social order. 

Two caveats are provided for the present analysis. First, self-employment is 
defined broadly, which may include very heterogeneous activities, particularly in 
the urban and rural contexts. More detailed specifications are needed to analyze 
the socioeconomic implications of mobility into self-employment. Second, some 
fundamental changes in China's economic transition since the mid-1990s are 
not covered by the data analyzed here. For example, many managers in the rural 
public sector now can legally convert themselves into private entrepreneurs (Li 
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and Rozelle 2003); urban cadres may illicitly strip state assets to set up their own 
enterprises (Walder 2003). Further reforms in state-owned enterprises have also 
led to mass layoffs and rising unemployment, pushing some workers into small 
self-employed businesses. Under these circumstances, self-employment may 
provide "a refuge from poverty," rather than "a road to riches" (Hanley 2000); 
the opportunity structure and mobility mechanism may change accordingly. 
Future research needs to tie the changing opportunity structure to some specific 
variables and to further test the opportunity-mobility framework proposed here. 


Notes 


E 


The private sector was elevated to a status equal to that of the state sector 
in the 15th National Congress of the Chinese Communist Party in 1997 and 
endorsed in the Constitution in 1999; protection of private property was in- 
corporated into the Constitution by a recent amendment in 2004 (Lu 2004). 


In the reform era, despite the fact that geographic mobility and employment 
changes became relatively easier, the social concomitants of the hukou sys- 
tem still persisted. For example, rural migrant workers in cities were still 
classified as "peasant-workers" and discriminated against in the urban labor 
markets (Wang et al. 2002). 


The group is dominated by the first category, "running a business," with only 
a few cases falling in the second and third categories. 


Here, urban vs. rural are defined based on the residence when the samples 
were selected. In the reform era, the place where one resides is likely dif- 
ferent from the place where the person is registered. According to the 1996 
survey data, 12.4 percent of the urban samples held rural registration status 
(e.g., rural migrants in cities), whereas 4.1 percent of the rural samples held 
urban registration status (e.g., school teachers working in rural areas). 


Cadre status is coded slightly differently for the urban and rural samples. 
In urban areas, those who reported their occupation as "middle-rank man- 
ager/administrator," "high-rank manager/administrator" or that their position 
at the rank of section chief (gu //) are coded as "cadres." Rural cadres include 
both township cadres and village cadres, positions that are directly reported 
by respondents (see also Walder 2002:238-39). 


A squared term is added to the model to capture the second-order effect, 
which is statistically insignificant. 


The household per capita income is computed based on the reported annual 
household income divided by the number of working family members. 


Provinces are included in the model as a set of dummies, which are the sig- 
nificant predictors of income. The coefficients for individual characteristics 
remain largely the same. To conserve space, estimated coefficients without 


province dummies are reported here. 
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9. In other words, the gender gap in per capita income narrowed over the de- 
cade. This finding deserves further research. 
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Heavy Drinking and Suicide in Russia 


William Alex Pridemore, Indiana University 


Abstract 

Russian levels of alcohol consumption and suicide are among the highest in the world. 
While observers have long suspected an association between the two, they were unable 
to investigate this hypothesis until recently due to past Soviet secrecy and thus a lack of 
data. This study took advantage of the newly available data during the post-Soviet era 
to examine the cross-sectional association between heavy drinking and suicide mortality 
in Russia. Aggregate mortality data for the Russian regions (n — 78) for the year 2000 
were used to measure heavy drinking and suicide rates. Government data were used to 
control for the regional economic situation and strength of social institutions. Ordinary 
Least Squares regression was employed to estimate the effect of a proxy for heavy drinking 
on overall and sex-specific age-adjusted suicide rates. The results showed a positive and 
significant association between the two, and the association held for overall, male and 
female rates. These results not only confirmed an association between heavy drinking and 
suicide in Russia, but when compared to findings from previous studies of other countries 
they led to the hypothesis that a nation's beverage preference may be as important as its 
wet/dry drinking culture in the sensitivity of its suicide rates to alcohol consumption. 


Rates of alcohol consumption and suicide in Russia were high before the 
dissolution of the Soviet Union, rose sharply during the early 1990s, and are 
currently among the highest in the world. Annual alcohol consumption in Russia is 
estimated to be nearly 15 liters per person (Nemtsov 2000, Treml 1997) compared 
to average rates of about 10 and 7 liters per person in the European Union and the 
United States, respectively (World Advertising Research Center 2002). The age- 
standardized suicide rate in Russia in 2000 of about 38 per 100,000 persons was 
second only to Lithuania and was two to three times higher than the European 
Union average and in the United States. Violent death, including suicide, was 
also one of the leading causes of the Russian mortality crisis during the 1990s, 
especially for males (Notzon et al. 1998, Shkolnikov and Meslé 1996). 

Rossow (1996) and Skog (1993) noted that alcohol has been largely ignored in 
studies of suicide rates. While this has begun to change, most studies of alcohol 
and suicide are still at the individual level, and most aggregate-level studies 
have been time-series analyses of Western (usually European) nations. Although 
time-series designs are powerful, Norstróm and Skog (2001) and Rossow (1995) 
pointed out that analyses of rates aggregated to the national level may conceal 
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important spatial variation within a country. Further, while the empirical literature 
is providing increasing evidence of an ecological association between alcohol and 
suicide rates, most explanations have been reductionist in nature (i.e., employing 
aggregate data but resorting to individual-level explanations). However, some 
countries with high levels of drinking do not exhibit high suicide rates, suggesting 
an important role for cultural and social context beyond that offered by individual- 
level factors (Norstróm 1995a). Recent findings led some scholars to assert that 
wet drinking cultures may be less susceptible to an association between alcohol 
and suicide than dry cultures (Norstróm 1995a, Ramstedt 2001). In contrast, the 
results from some studies lead others to argue that it may not be simply the 
level of consumption but instead heavy and frequent binge drinking (commonly 
defined in the alcohol literature as five drinks for men and four for women in one 
sitting), especially of distilled spirits (e.g., vodka, rum, whiskey and gin), that is 
most important in any association with suicide rates (Gruenewald, Ponicki and 
Mitchell 1995; Razvodovsky 2003; Rossow 2000). Both hypotheses are important 
for Russia given levels of binge drinking (Bobak et al. 1999) and the preference 
for distilled spirits (Nemtsov 2000). 


Previous Research 


Most aggregate-level studies of alcohol and suicide have used time-series 
designs and examined European nations. For example, Norstróm (1988) found 
a positive and significant association over time between alcohol and suicide in 
Norway (see also Rossow 1993) and Sweden, and Skog (1993) and Skog et al. 
(1995) found a similar relationship in Denmark and Portugal. In a comparative 
study, Norstróm (19958) revealed a stronger effect of alcohol on suicide rates in 
Sweden relative to France. He concluded that the influence of alcohol on suicide 
rates may be stronger in dry relative to wet drinking cultures since heavy drinkers 
in the former may be more likely to experience weakened family and community 
bonds because their behavior is viewed as less acceptable (see also Babor et al. 
1992, Norstróm 1988). This idea is consistent with the hypotheses concerning 
social integration and suicide posed by Durkheim ([1897]1966), who was 
among the first to examine systematically the association between drinking and 
suicide at the societal level. A study of 14 European nations by Ramstedt (2001) 
showed similar results for wet/dry drinking cultures, providing further support for 
Norstróm's hypothesis. The few aggregate-level studies in the United States also 
found a positive association over time (Caces and Harford 1998; Gruenewald, 
Ponicki and Mitchell 1995). 

Recent research has begun to examine alcohol and suicide in Eastern Europe š 
(Màkinen 2000). Skog and Elekes (1993) used Hungarian time-series data and 
found a positive association. Wasserman, Vàrnik and Eklund (1998) used Soviet- 
era data and found evidence of a positive association in several Soviet countries. 
Two further studies are key to the present analysis. First, in a time-series analysis 
of Belarus, Razvodovsky (2003, 2001) found a significant relationship between 
alcohol and suicide and, more importantly, that this relationship was stronger 
for consumption of distilled spirits relative to other types of alcohol such as beer 
and wine. Second, Nemtsov (2003a) found a significant association between 
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drinking and suicide in his time-series analysis of Russian data. In contrast to 
the conclusions about wet drinking cultures discussed above, both authors 
concluded that heavy and binge drinking of vodka were responsible for the high 
suicide rates and the changes in rates over time in Belarus and Russia. 

Cross-sectional studies can also be informative when examining the alcohol- 
suicide link at the aggregate level because most nations are not monolithic in their 
drinking habits and suicide rates. Within-nation variation of both consumption 
and suicide may be associated with the spatial distribution of social forces such 
as economic circumstances, social cohesion, and ethnic and religious groups. 
Yet there have been few cross-sectional studies that examine this association. 
The most famous was that by Durkheim more than a century ago. The few recent 
studies that contained a cross-sectional component (Norstróm 1995b: Swedish 
counties, Skog et a/. 1995: Portuguese regions, Rossow 1995: Norwegian 
municipal regions) have found support for an association, though effect sizes 
varied and there were sometimes differential effects for males and females. 
Given Russia's size and its widely varying levels of heavy drinking and suicide, a 
cross-sectional examination should provide valuable insights. 


Alcohol Consumption and Suicide Rates in Russia 


Repeated political and economic crises in 1990s Russia resulted in uncertainty 
and diminished hope for the future at a time when a paradigmatic shift of 
social, cultural, economic and political norms were creating anomic conditions 
(Durkheim [1897]1966). It is generally accepted that the social and individual 
stress resulting from the reforms were the main causes of increased demand 
for alcohol at this time (Leon and Shkolnikov 1998). This demand was met by 
several factors that increased supply. Gorbachev's anti-alcohol campaign in 
1985, for example, had created a large market for illegally produced alcohol. 
Further, when the state monopoly on legal alcohol was repealed in 1992, local 
and international companies quickly began to exploit the new markets (Jernigan 
1997, Reitan 2000). The increased supply, together with poor tax collection by 
the Russian government, meant that alcohol prices rose at a fraction of food and 
other staples. Leon and Shkolnikov (1998) argued that alcohol played a central 
role in the mortality crisis of the 1990s, Notzon et al. (1998) showed that about 
12 percent of the decline in life expectancy between 1990 and 1994 was directly 
due to alcohol-related mortality, and Walberg et al. (1998) found an association 
between regional levels of alcohol consumption and decline in life expectancy. 
The problems associated with heavy drinking in Russia, however, are not new 
to the transition (McKee 1999, White 1996). Three distinctive traits of Russian 
consumption have been the preference for distilled spirits, binge drinking and 
socio-cultural tolerance for heavy drinking and concomitant behavior. Vodka 
represents about 75 percent of the alcohol consumed in the country (Nemtsov 
2000, Trem! 1997), and survey data showed that about one-third of Russian males 
admitted to binge drinking vodka at least once per month (Bobak et al. 1999). 
Heavy episodic drinking and consumption of distilled spirits have been shown 
to be significantly related to suicide (Gruenewald, Ponicki and Mitchell 1995; 
Razvodovsky 2003) and other violent behavior (Norstrom 1998a, Pridemore 2002). 
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Further, acute intoxication may act as a disinhibitor (Norstróm 1995a) in people 
predisposed to suicide for other reasons, especially among^those facing the 
anomic conditions of massive social change and dire economic circumstances. 

Also important for this study is the cross-sectional variation of heavy drinking 
and suicide in Russia. This is an enormous nation that is far from homogeneous 
in levels of consumption (Bobak et al. 1999, Pridemore and Kim 20068). For 
example, regional age-standardized death rates per 100,000 persons due to 
alcohol poisoning (see discussion of this proxy below) in 2000 varied widely 
(regional mean = 28.8, s.d. — 17.5). Similarly, the choropleth map provided by 
Pridemore and Spivak (2003:142) demonstrated the wide range and geographic 
patterning of regional suicide rates in Russia, with rates generally increasing from 
south and west to north and east. Regional age-standardized suicide mortality 
rates in 2000 ranged from a low of 7 to a high of 93 per 100,000 in the Dagestan 
and Altai Republics, respectively (regional mean = 42.1, s.d. = 17.1). The mean 
regional suicide rate for males was 77 per 100,000, ranging from 11 in Dagestan 
to 159 in the Altai Republic. The task of the present study was to determine if 
the spatial distribution of suicide rates covaried with the distribution of heavy 
drinking, and if any relationship held for both men and women. 


Data and Method 


The unit of analysis in this study was the Russian region (N = 89). Data from 
nine of these regions (i.e., the "autonomous okrugs") were covered by the larger 
region within which each was embedded. The Chechen and Ingush Republics 
were discarded because of the ongoing war in and the perennially unreliable data 
from these contiguous regions. This left 78 cases for analysis. Unless otherwise 
noted, all data were from 2000. 

The dependent variables were the regional overall and sex-specific age-adjusted 
suicide rates. Given former Soviet secrecy surrounding these data, as well as 
large swings in mortality rates in Russia over the past 20 years, Russian mortality 
data have been closely scrutinized and several scholars have concluded that the 
Russian overall and violent mortality data are valid (Anderson, Katus and Silver 
1994; Anderson and Silver 1997; Leon et al. 1997; Wasserman and Varnik 1998). 
The Russian suicide data employed here were prepared for the author using data 
from the State Committee for Statistics (Russian Mortality Database 2003). 

Estimating heavy drinking in Russia is difficult. Commonly used areal measures 
of consumption such as production, retail sales and tax receipts are unreliable 
in Russia (Trem! 1997).' For example, illegally produced homemade alcohol 
(samogon) may account for nearly half the alcohol consumed in the country ` 
and the proportion varies from region to region. Sales and tax receipts are also 
suspect because (1) of sales of samogon and (2) businesses often keep two sets 
of books in order to avoid high taxes. Individual-level estimates of consumption 
have been undertaken via surveys such as the Russian Longitudinal Monitoring 
Survey (see Bobak, McKee, Rose and Marmot 1999), but this survey does not 
employ a large enough sample size from each of the Russian regions to produce 
a reliable estimate of consumption or heavy drinking at the regional level. 
Further, not only have the estimates of consumption from this survey been highly 
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criticized (Nemtsov 2003b), but research in Russia comparing biological markers 
of consumption with survey responses showed the latter to be highly unreliable 
(Laatikainen, Alho, Vartiainen, Jousilahti, Sillanaukee and Puska 2002). 

Another approach common to aggregate-level studies has been to use the 
death rate from liver cirrhosis as a proxy (Nemtsov 1998, Norstróm 1998b, Smart 
and Mann 1998). Russian registration habits, however, commonly classify deaths 
caused by the long-term effects of drinking in the more proximate "alcohol 
poisoning" category (Shkolnikov and Meslé 1996), which makes up about 80 
percent of all deaths directly related to alcohol in Russia (Blum and Monnier 
1988; Shkolnikov, Meslé and Vallin 1996; Treml 1997). Further, the combination of 
binge drinking vodka and the consumption of alcohol substitutes and of illegally 
produced alcohol of unregulated quality in Russia (see McKee et al. 2005) results 
in an abnormally high rate of deaths due to true poisoning in the country. These 
facts have made it common to employ the age-adjusted death rate due to alcohol 
poisoning as a proxy for heavy drinking in Russia (Chenet et al. 2001; Pridemore 
2002; Shkolnikov et al. 2002; Shkolnikov, McKee and Leon 2001), and this was 
the proxy used in this study. 

Sociological studies of suicide (Baller and Richardson 2002; Kowalski, Faupel 
and Starr 1987; Norstróm 1995) usually control for a common set of structural 
covariates based upon economic conditions and social cohesion. In this study, 
poverty, inequality and unemployment were used as measures of economic 
conditions. Measures of social cohesion included the strength of institutions 
such as the family, education and polity, as well as the relative size of the urban 
population. Poverty was measured as the proportion of the regional population 
living below the poverty line (2000 data unavailable, 1999 data used), inequality 
as the ratio of the income of the top 20 percent of wage earners to the bottom 20 
percent of wage earners, and unemployment as the proportion of the active labor 
force that was unemployed. Family strength was measured as the proportion of 
households with only one adult and at least one child under the age of 18 (2000 
data unavailable, data from the 1994 micro-census were employed?), education 
as the number of persons enrolled in college per 1,000 residents, and polity as 
the proportion of registered voters who voted in the 2000 Presidential election. 
Urbanism was measured as the proportion of the regional population living in 
cities with more than 100,000 residents. Each measure was obtained from the 
State Committee for Statistics (Goskomstat 2001), with the exception of the 
voting data, which were taken from Orttung (2000). Finally, a dummy variable 
indicating those regions located in the Northern Caucasus was included in the 
models to control for the significantly lower suicide rates in this area compared 
to the rest of the nation.? As is common with highly aggregated data, several of 
these variables (poverty, inequality, unemployment, urban) had positive skews 
and were therefore logged in order to approximate a more normal distribution.^ 

Ordinary Least Squares regression was employed to estimate the effect of 
regional levels of heavy drinking on regional suicide rates. Because Russian 
male suicide rates are much higher and vary more widely than females and past 
studies of other countries have shown differential effects of alcohol consumption 
on male and female suicide rates, models were estimated separately for overall 
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and sex-specific suicide rates. Regression diagnostics were used to test the 
sensitivity of the results. Scatterplots of the independent variables with the 
dependent variables showed no obvious departures from linearity; a scatterplot 
of the residuals with the predicted Y-values suggested that the assumption of 
homoscedasticity holds (and there were also no more residuals with scores 
. above 1.96 than expected in a sample of this size); similarly, a histogram of 
the residuals showed that their distributions did not depart substantially from 
normality; neither the correlations between the independent variables nor the 
variance inflation factors showed signs of multicollinearity, nor did the inferences 
drawn change when certain variables were excluded from the model alone and in 
concert with others; finally, leverage values and DFBETAS showed that extreme 
values did not appear to present a problem and that there were no outlying cases 
that had an undue influence on the regression line. 


Results 


The bottom two rows of Table 1 show the descriptive statistics for the 
dependent and independent variables. The regional mean for the overall suicide 
rate in 2000 was 42 per 100,000 persons. Table 1 also shows the correlation 
matrix. As expected, regional suicide rates were correlated with the proxy for 
heavy drinking, with overall, male, and female correlations of .48, .42 and .61, 
respectively. j 

Table 2 provides the results for model estimation when overall and sex-specific 
suicide rates were regressed on alcohol poisonings and control variables. 
Controlling for the Northern Caucasus added considerable explanatory power 
while not changing the inferences related to alcohol consumption. Therefore, the 
discussion here focuses on Models 2, 4 and 6 (overall, male, and female suicide 
rates, respectively). As expected, the proxy for heavy drinking was positively and 
significantly associated with regional suicide rates net of the other variables in 
the model (B = .26, p = .01 for overall rates). The proxy was also significantly 
associated with the cross-sectional variation of both male (B — .21, p — .04) and 
female (B = .43, p = .001) suicide rates. 

The unemployment rate was positively and significantly associated with 
regional overall (B = .35, p = .002), male (8 = .31, p = .006), and female (B 
= .33, p = .001) suicide rates.® Suicide rates were also significantly higher in 
regions with a larger rural population, which is consistent with prior research 
both in the West and in former Soviet nations (Rancans, Renberg and Jacobsson 
2001). None of the indicators of the strength of social institutions appeared to be 
related to the spatial distribution of Russian suicide rates, which may be partially ` 
explained by the findings of Kowalski, Faupel and Starr (1987), who found that 
measures of social structure may only be helpful in explaining suicide rates in 
metropolitan areas, not in rural settings. Levels of poverty were not associated 
with overall or female suicide rates. However, while the p-value (p = .077) did not 
allow for strong conclusions, the results did suggest that male suicide rates may 
be associated with poverty. Given the response of male mortality to the social, 
political and economic changes during the Russian transition, it may be that male 
suicide rates were more sensitive to socioeconomic change and its effects on 
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Table 2: OLS Estimates of Regional Suicide Rates Regressed on Alcohol Proxyand Control Variables 
A U U 


` 


Overall Male i Female 
Model1 Model2 Model3 Model4 Model5 Model 6 
b (B) b (B) b (B) b (B) b (B) b (B) 


Intercept 23.10 -69 3730 ¿33 16.42 7.31 

Alcohol ao" 20?" ga ge a Ko 
(40) (26) (.33) (21) (.54) (43) 
Poverty 9.79* 7565 19,74% 1504 1.79 1.37 
(22) (.17) (25) (.19) (.11) (.08) 
Inequality 2.60 4.67 3.15 6.64 1.57 2.26 
(.04) (.07) (.03) (.06) (.07) (.10) 

Unemployment 3471 18.07 — 257 2852” — 227 6.15% 
(.07) (.35) (.03) (.31) (.12) (.33) 
Singles 49 -45 121 -.70 06 SE 
(.06) (-.05) (.08) (-.05) (.02) (-.04) 
Education -.03 05 -46 01 05 07 
(-.02) (.04) (-.07) (.00) (11) (.16) 
Polity -49 -28 seri -.39 -25" -19* 
(-.13) (-.08) (-.12) (-.06) (-.19) (-.14) 

Urban 414* AA — -645 7.25 DBA 
(-.21) (-.23) (-.19) (-.21) (-.358) — (-.36) 

N. Caucasus 3,65 56,23 7.60% 
(-.55) (-.57) (-.38) 
Adj. R? 31 49 26 46 51 59 


PL O pO A) 

Notes: Parentheses contain standardized coefficients. Logarithmically transformed values were 
used for poverty, inequality, unemployment, and urban. Results for males in Models 3 and 4 and 
females in Models 5 and 6 used male- and female-specific alcohol proxy, respectively. 


social deregulation (Durkheim [1897]1966) instead of simply to poverty (Brainerd 
2001; Whitley, Gunnell, Dorling and Smith 1999). Finally, after controlling for other 
variables, suicide rates in the Northern Caucasus remained significantly lower 
than in the rest of Russia. Not only did this control add significant explanatory 
power, but several of the independent variables were confounded with regional 
location (see footnote 3 and the correlations with the Northern Caucasus dummy 
in Table 2), making its inclusion important. 


Discussion 


The results presented here show a positive and significant cross-sectional 
association between the proxy for heavy drinking and suicide mortality for overall, 
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male and female rates in Russia. These cross-sectional findings are generally 
consistent with the conclusions drawn from Nemtsov's (2003a) time-series 
analysis of alcohol and suicide in Russia.” 

While several studies of alcohol and suicide employing aggregate data have 
shown them to be associated, these same studies have provided conflicting 
results for men and women (Caces and Harford 1998). For example, Skog et al. 
(1995) and Rossow (1993) found stronger effects of drinking on male suicide rates 
relative to females. On the other hand, Rossow (1995) and Brainerd (2001) found 
that relative to males, female suicide mortality appeared to be more sensitive to 
levels of consumption. In his time-series analysis, Nemtsov's (2003a:166) results 
led him to conclude "that the alcohol-suicide relationship is nearly exclusively 
a male problem" in Russia. The findings presented here show that both male 
and female suicide rates were positively and significantly associated with heavy 
drinking. One possible explanation for the discrepancy between these and 
Nemtsov's results is methodological artifact. While there is considerable cross- 
sectional variation in female suicide mortality in Russia, there was little temporal 
variation in female rates over the past 40 years (Pridemore and Spivak 2003). 
Such a small amount of variation would make it difficult to find a significant 
association. A possible substantive explanation is that female suicide rates are 
sensitive not only to female but also to male drinking patterns. For example, 
alcohol use by the husband has been shown to have significant effects on wife 
abuse in Russia (Cubbins and Vannoy 2005), both of which may be related to 
female suicide rates. 

Several scholars have discussed the impact of socioeconomic change and 
the resulting stress on levels of male mortality in transitional Russia (Leon and 
Shkolnikov 1998, Walberg et al. 1998). Further, Pridemore and Kim (2006b) found 
that socioeconomic change was significantly related to the change in Russian 
homicide rates during the 1990s, and Brainerd (2001:1007) showed that male 
suicide rates were "highly sensitive" and female suicide rates "insensitive to the 
state of the macroeconomy” in her study of several former Soviet transitional 
countries. With recent evidence suggesting Russian males are more likely than 
females to drink heavily in the context of stress control (Koposov et al. 2002), 
research should be undertaken to examine the effects of socioeconomic change 
on suicide, both directly and indirectly via alcohol consumption. It should be 
noted, however, that the cross-sectional association between alcohol and suicide 
was apparent well before the dissolution of the Soviet Union, and thus is not a 
period effect due solely to the conditions of the Russian transition. 


Dry/Wet Drinking Cultures vs. Alcohol Preference 


In their summary of several studies of alcohol and mortality, Norstróm and Skog 
(2001:S16) concluded that “the aggregate link between alcohol and suicide...is 
quite marked in northern Europe, but weak or non-existent in south and central 
Europe." Norstróm (1995) contended that differences in sensitivity of suicide 
rates to alcohol consumption may be due to the level of integration of alcohol 
into the national culture. His main idea was that heavy drinking and accompanying 
behavior are more accepted in wet drinking cultures, while similar behavior in dry 
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cultures is more likely to be perceived as abnormal. Heavy drinkers in dry cultures 
are therefore more likely to be marginalized, to have weaker social and familial 
bonds, and thus to be at higher risk for suicide. This hypothesis has received 
general support from studies of several European countries, especially when 
comparing the strength of association between alcohol and suicide in the wet 
drinking cultures of southern and central Europe to the dry drinking cultures of 
northern Europe (Norstróm 1995, Ramstedt 2001, Skog et al. 1995). 

The positive association between alcohol and suicide in Russia found in this 
study appears to be at odds with Norstróm's hypothesis. Russia is a definitively 
wet drinking culture — not due simply to high levels of alcohol consumption, 
but to the long history of alcohol's integration into Russian culture and daily 
routines — with a high social tolerance for heavy drinking (White 1996). Further, 
the conclusions drawn from the present study coincide with those of Nemtsov 
(20038), who also pointed out the inconsistencies between his results and 
Norstróm's theory. An alternative hypothesis that is consistent with the present 
findings and in accord with the findings from the studies cited above, however, 
is the potential connection between beverage preference and suicide rates. 
Southern and central Europe are considered beer and wine countries, whereas 
distilled spirits are more prevalent in Scandinavia and Russia. This beverage- 
specific hypothesis is consistent with studies that have shown an association 
between spirits and suicide but not between overall or beer/wine consumption 
and suicide (Gruenewald et al. 1998, Razvodovsky 2003). This potential beverage- 
specific association with violence directed at the self is also a theme in a Norstróm 
(1998a) study of violence directed at others, and Pridemore (2002) argued that 
what (i.e., vodka) and how (i.e., binge) Russians drink may be responsible for the 
robust association between alcohol and homicide in the country. The results from 
Russia are also consistent with conclusions drawn by Mákelà (1996) specifically 
and more generally by Rossow's (2000) literature review, both of whom stated 
that the relationship between alcohol and suicide was stronger among those 
who drank more frequently and to a deeper intoxication (see also Norstróm and 
Skog 2001). 


Limitations 


There are a few main limitations that must be considered when interpreting 
these results. The first is the measurement of alcohol consumption in Russia. As 
discussed earlier, the different ways of creating area estimates ot consumption 
used elsewhere would result in invalid estimates in Russia. Nemtsov has in 
several publications created estimates of per capita consumption using various 
methods and has also examined BAC-positive suicides, but these estimates were 
either for the country as a whole or for a small number of regions, and thus were 
not available for all the regions employed in this analysis. In light of this situation, 
Shkolnikov, McKee and Leon (2001:917) stated that "changes in acute alcohol 
poisoning can be regarded as a good estimation of changes in the frequency of 
excessive drinking" in Russia, and it has become common to employ this cause- 
of-death category as a proxy for heavy drinking until valid estimates from more 
direct indicators can be created. 
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Another potential limitation concerns the Russian mortality data, which in the 
past were falsified by the Soviet government? and which have reported dramatic 
changes over the past 15 years. The validity of the Soviet and Russian overall 
and external cause mortality data has been examined closely. Anderson, Katus 
and Silver (1994), Anderson and Silver (1997), Leon et al. (1997), and Wasserman 
and Várnik (1998) all concluded that the data are valid, though subject to the 
same sources of error in measuring suicide as in other nations. However, during 
the 1990s the number of all violent deaths placed in the "violent death, cause 
unspecified" category (number 175 in the Soviet classification that was used 
before Russia switched to ICD-10 in 1999) increased disproportionately. Many of 
these were undoubtedly suicides (Pridemore 2003, Shkolnikov and Chervyakov 
2000), and this phenomenon deserves further research. 

Finally, it may be argued that the association between heavy drinking and 
suicide rates is spurious. The most plausible explanation in this situation is that 
heavy drinking and suicide are both caused by some other variable such as the 
anomic conditions of the Russian transition. 


Conclusion 


Nemtsov (2002:1424) revealed the "incredible rate of alcohol-related mortality in 
Russia" and the relationship over time between alcohol and suicide rates in the 
country (Nemtsov 20033). The results from the present study showed a positive 
cross-sectional association between heavy drinking and regional suicide rates 
after controlling for social structural factors. This association held for both males 
and females. 

This study is unique in several respects. First, Norstróm and Skog (2001) called 
for more country-specific analyses of the aggregate-level alcohol-suicide link. 
Similarly, though some have noted similarities in drinking and suicide trends or 
shown an association between drinking and overall and external cause mortality 
in Russia (Chenet et al. 1998, Gavrilova et al. 2000, Leon and Shkolnikov 1998, 
Nemtsov 1998, Reitan 2000), very few studies have directly tested the association 
between alcohol and suicide (Nemtsov 20032). This research is thus one of the 
few to study heavy drinking and suicide in Russia and is the first of its kind to 
examine their cross-sectional covariation. This is important given (1) past Soviet 
secrecy that restricted valid studies, (2) the very high rates of heavy drinking and 
suicide in the country, and (3) the considerable variation in drinking and suicide 
rates throughout the vast nation. 

A second unique contribution is the beverage-specific hypothesis that may 
tie together seemingly disparate findings. The results presented here appear to 
run counter to Norstróm's (1995a) notion that the alcohol-suicide association is 
weak or absent in wet drinking cultures, but are consistent with research that 
has shown that the type of alcohol consumed plays an important role. However, 
while the beverage-specific hypothesis is theoretically different, it is consistent 
with the empirical findings of Norstróm and others concerning wet/dry drinking 
cultures and work on the relationship between beverage-specific effects on 
suicide and other types of violence (Gruenewald et al. 1995, Nemtsov 2003a, 
Norstróm 1998a, Ramstedt 2001, Razvodovksy 2003). In other words, at least in 
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terms of the alcohol-suicide link, the beverage of choice may be just as important 
as the drinking culture itself. Of course one may reasonably argue that Russia is 
unique, and that this hypothesis is tenuous. Nevertheless, this evidence offers a 
new hypothesis to be tested, and this new hypothesis draws in large part from 
the results of studies of beverage-specific effects on violence and suicide in other 
nations such as the United States, Sweden and Belarus. Finally, these findings 
invite further study of the effects of heavy drinking on sex-specific rates of suicide 
mortality in Russia and on how drinking patterns in the country (e.g., binge 
drinking and the preference for distilled spirits), not just alcohol consumption in 
general, may influence suicide rates and other forms of violent mortality. 


Notes 


1. Although it is true that many of the general limitations of the Russian alcohol 
data on production, sales and taxes are also present in Western countries, in 
most cases the problems with the Russian data are more serious by several 
orders of magnitude. 


2. The most recent year for this measure was 1994 because of the micro-cen- 
sus carried out in Russia that year. Updated information for this measure will 
soon be available from the full Russian census undertaken in 2002, but these 
data would be inappropriate here since the dependent variable was mea- 
sured in 2000. To test the sensitivity of the model and of the alcohol-suicide 
link to this control, an alternative model was estimated using the divorce 
rate from 2000. It should be noted that, for reasons beyond the scope of this 
discussion, this measure is likely less reliable and presents problems of its 
own. Nevertheless, the inferences drawn relative to the association between 
heavy drinking and suicide remained the same when this alternative measure 
was employed. 


3. Notonly were suicide rates significantly lower in the Northern Caucasus, but 
other factors also make it important to employ this as a control. For example, 
a higher proportion of Muslims in these regions result in a different cultural 
context in the Northern Caucasus relative to the rest of Russia, not to men- 
tion religious differences and the Islamic strictures against suicide. Further, 
the intersection of these cultural factors with social institutions meant that 
several of the measures included here as controls were confounded with 
location in this area. Alcohol consumption was also lower in these regions 
and, perhaps more importantly for the new hypothesis provided later in this 
paper, wine products are more often consumed here than in the rest of the 
country, meaning that the preference for vodka is not as strong as elsewhere 
in Russia. Using a dummy variable for these regions should help to control 
for any unobserved heterogeneity due to these and other factors. 


4. Four of the regions had no cities with more than 100,000 people. Because 


the natural logarithm of 0 is undefined, 1 was added to each score before 
taking the log. 


5. See Caces and Harford (1998) for a discussion of the importance of control- 
ling for unemployment in aggregate studies of alcohol and suicide. 
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One important issue to point out here is the potential that age is confounding 
the association between heavy drinking and suicide. For example, the age 
distribution of overall, external, alcohol-related and suicide mortality show 
similarities, including marked peaks in middle-aged males. When models 
were re-estimated including a control for males in this age group, however, it 
was not significant and the inferences did not change for any of the other as- 
sociations (including the alcohol-suicide link). Similar results for heavy drink- 
ing, age and homicide in Russia have been reported elsewhere (see Table 2 
in Pridemore 2002). 


In that study, Nemtsov also reported that estimates of BAC-positive suicides 
in Russia were much higher than in other nations from which such estimates 
are available. 


Further, the quicker and deeper intoxication resulting from heavy consump- 
tion of spirits may act as disinhibitor against self-harm and/or a catalyst for 
(or intensifier of) emotional distress. Further, the ill effects on health and on 
the economic conditions of heavy spirits drinkers (especially in the face of 
the collapse of the social safety net during marketization) could serve to ex- 
acerbate any existing suicidal ideation in those already at risk. 


These vital statistics data were inaccessible until recently. For most of the 
Soviet era, mortality data were provided only to government statisticians for 
creating classified reports (Shkolnikov and Meslé 1996). From 1965 to 1987, 
suicides were among a group of ideologically sensitive causes of death that 
were extracted from the original statistical tables and put into a secret table 
called "5B" (Andreev et al. 1995). There was no pressure to falsify the initial 
cause of death decision, however, and thus the underlying data still con- 
tained valid estimates of homicide and suicide deaths. Nevertheless, this 
count remained classified and the deaths in Table 5B were publicly reported 
in the "other and unknown causes" category. This secrecy ended in the late 
1980s and today suicide data are released annually in various publications 
available from the Russian State Committee for Statistics and the Russian 
Ministry of Health. 
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Consensus and the Creation of Status Beliefs 


Cecilia L. Ridgeway, Stanford University 
Shelley J. Correll, Cornell University 


Abstract 

We develop a new status construction theory argument that apparently valid social 
realities in which a salient social difference is consistently linked to signs of status and 
competence induce participants to form status beliefs. Supporting this social validity 
account, an experiment showed that when an influence hierarchy developed between 
categorically different actors and appeared to be consensually accepted in the situation 
and therefore valid, participants formed strongly differentiated status beliefs about the 
categorical distinction. Yet when slight challenges to these influence hierarchies broke the 
validating consensus, participants’ status beliefs were significantly weaker and less clear. 
An implication is that acts of resistance can disrupt the emergence of new status beliefs, so 
that while some differences become axes of status inequality, others do not. 


How do apparently nominal social differences between people become status 
differences that create axes of inequality in socially accorded esteem? A society's 
status distinctions have social histories after all. Some status distinctions such as 
gender have ancient origins, but others such as educational attainment developed 
more recently. Yet others, like Irishness in the United States, have lost their status 
significance in recent years while new status distinctions have emerged. It may 
be, for example, that the "digital divide" between the computer literate and non- 
literate is currently being transformed into a status distinction. What processes 
transform some, but not all, categorical differences into status distinctions? What 
processes might interrupt this transformation so that a categorical difference does 
not become a status distinction? These are the questions this paper addresses. 

Status distinctions are important in a society because, as Weber (1968) pointed 
out, they act as axes along which inequality in social relations are organized. 
Status is a form of inequality based on differences in social esteem and respect 
that, in turn, yield influence. It has an interesting dual character. As Weber (1968) 
emphasized, status can be understood as an evaluative ranking between social 
groups in which one group (e.g., professionals) is held in the culture to be more 
socially significant and worthy of respect than is another (laborers). Yet status can 
also be seen as an evaluative hierarchy among individuals in which one person is 
more respected, deferred to and influential than another (Goffman 1967). 

Status between groups and status between individuals are linked by a society's 
widely shared status beliefs. A status belief associates greater social esteem and 
competence at the “things that count” in society with people in one category of 
a social difference than another (Berger et al. 1977). In an achievement-oriented 
society, the association of difference with competence is especially significant 
because it legitimates an inequality in esteem. 
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Bylinking differences in esteem and competence with widely recognized social 
differences such as occupation, race, gender or ethnicity, status beliefs define 
the evaluative standing of social groups in a society. Furthermore, as decades of 
research in the expectation states tradition has demonstrated, status hierarchies 
among individuals are largely organized by the way that people's distinguishing 
attributes evoke shared status beliefs about the social categories to which they 
belong (Berger et al.1977, Wagner and Berger 2002). Not surprisingly, most of 
the social distinctions that form significant axes around which inequality in social 
relations is organized in the United States, including occupation, education, 
race, gender, age and ethnicity, are associated in U.S. culture with widely 
shared status beliefs (see Fiske et al. 2002, Ridgeway 2001). The formation of 
widely shared status beliefs about categorical differences among people is key 
to the organization of status inequality whether it be among social groups or 
individuals. 

Status construction theory is an explanation of how status beliefs form and 
spread in a population (Ridgeway 1991, 2000; Troyer 2003; Webster and Hysom 
1998). While there are likely to be others, the theory proposes one set of social 
processes that it argues are sufficient to create shared status beliefs about a 
socially recognized, but not yet consensually evaluated categorical difference. 
The theory focuses on the local contexts of action in which people from different 
categories of a social difference (e.g., different ethnic groups) encounter one 
another. It argues that under certain social structural conditions, these local 
encounters create experiences for their participants that induce them to form 
status beliefs about their social differences. Furthermore, such local encounters 
also spread status beliefs when those who have previously acquired the belief 
“teach” it to others by treating them in accord with the belief. Thus local contexts 
in which people routinely encounter those who differ from them in some socially 
recognized way are, according to the theory, potent arenas in which status beliefs 
can be created, spread, interrupted or refreshed. 

Previous investigations of status construction theory have shown that people 
do indeed form status beliefs from their encounters with socially different others 
(Ridgeway et al. 1998, Ridgeway and Erickson 2000). While these studies show 
that status beliefs form under conditions specified by the theory, they do not fully 
explain how they form. The studies show that status beliefs form in encounters 
that juxtapose difference with situational influence and esteem, but they do not 
clarify why a mere situational juxtaposition causes people to form status beliefs 
about whole categories of people. The next important step, then, is to delineate 
the specific processes through which encounters with different others induce 
their participants to form status beliefs about their difference because, until 
we fully understand how status beliefs are induced, we cannot know how the 
formation of new status beliefs is sometimes interrupted or prevented. Not all 
social differences are transformed into status differences. Why not? What might 
block the formation of status beliefs in encounters with different others and 
prevent them from spreading and becoming axes of inequality? These questions 
are important for understanding how new status beliefs form from everyday 
experience, thereby creating new forms of categorical inequality. 


r 
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Delineating how encounters with different others create status beliefs requires 
us to explain how such encounters induce their participants to take on status 
beliefs not only when they flatter their own social category, but also when the 
beliefs disadvantage their own category. After all, if people only accepted status 
beliefs that favor their own category, there is no set of status beliefs that could 
become widely held in society. 

Inthis paper, we extend status construction theory to develop and experimentally 
test an account of the processes through which encounters between people who 
differ on a salient but not yet consensually evaluated social difference induce them 
to form status beliefs about that difference. In particular, we consider how the 
appearance of consensus about key aspects of a situation creates an apparently 
valid social reality that facilitates the formation of shared beliefs that "most 
people" would view people in one category of the difference as more respected 
and competent than people in another category of that difference. We begin with 
a more detailed discussion of the nature of status beliefs. 


Status Beliefs 


Status beliefs have three distinctive qualities that are useful to keep in mind when 
considering how they might be created. First, status beliefs differ from the simple 
in-group favoritism that is the typical reaction to the recognition of social difference 
(Brewer and Brown 1998). When a status belief forms, both those advantaged 
by the status belief (i.e., the high status group) and those disadvantaged by it 
accept, as a matter of social reality, that the advantaged group is more respected 
and seen as generally more competent than the disadvantaged group. Thus, 
status beliefs transform competing in-group favoritisms into a consensual belief 
that, whether individuals like it or not, people in one group are accorded more 
social respect than are those in another. 

Second, when people form a status belief, they form a generalization, not just 
about specific individuals that they may have encountered, but about the status 
worthiness and competence of whole categories of people who share a salient 
distinguishing characteristic with those individuals. In this sense, status beliefs 
are stereotypes and indeed, when widely shared status beliefs develop, they 
form part of the cultural stereotypes of the categorical groups involved and add a 
distinctive evaluative component to those stereotypes (Fiske et al. 2002). 

Third and importantly, as beliefs about which group is more respected in society, 
status beliefs are a type of social reputation. For this reason, status beliefs are 
necessarily beliefs about what “most people” do or would think about the status 
worthiness and competence of one categorical group compared to another. They 
are assumptions about the beliefs or perspective of the “generalized other” in 
society. We define beliefs about what “most people” think as “third order” beliefs 
to distinguish them from what other status researchers have labeled “first order” 
beliefs (what an actor personally thinks) and “second order” beliefs (what an actor 
thinks specific others in the situation think) (Troyer and Younts 1997, Webster and 
Whitmeyer 1999). 

People who hold status beliefs as third order beliefs often personally endorse 
them as first order beliefs as well, although this is not always the case. Studies of 
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the "false consensus" effect (Marks and Miller 1987) show that people frequently 
do not distinguish between what they personally think and what they think most 
people think both because perceptions of the views of most shape personal 
beliefs, and personal beliefs bias inferences about others' views. Personal 
endorsement of a status belief may be most likely when the belief is flattering to 
a person's own group but occurs as well when the belief is relatively unflattering 
(see Eagly, Wood and Diekman 2000). 

Status beliefs’ character as third order beliefs about what "most people" 
believe is what gives them force in social relations. If | assume that most people 
share a status belief, then | expect that they will act in accord with that belief 
themselves and that they will judge me according to it. As a consequence, | 
must take that belief into account in shaping my own behavior whether or not | 
personally endorse the belief. In this way, the presumption that status beliefs are 
the views of most shapes peoples' mutual expectations for behavior in ways that 
tend to become self-fulfilling, as expectation states research has shown (Wagner 
and Berger 2002). 

To function as a status belief with significant implications for social relations, 
then, the belief must have the character of a third order assumption about what 
"most" believe. It is possible, of course, for people to assume that "most people" 
esteem one group more than another when few at a personal level actually do. 
While this situation of "pluralistic ignorance" is likely to be fragile, it hints at the 
processes through which status beliefs might develop in the first place. Status 
beliefs are likely to be most stable in a society, however, when they are not only 
nearly consensually shared as third order beliefs but also endorsed as first order 
beliefs by a majority of the population. 


Status Construction Theory 


Status construction theory has developed empirically supported propositions 
about the development and spread of status beliefs in a population (Ridgeway 
and Erickson 2000). The theory's basic argument is straightforward (Ridgeway 
1991, 2000). When people from different categories of a recognized social 
difference (e.g., computer literacy) must regularly cooperate to achieve what 
they want or need, hierarchies of influence and esteem are likely to develop in 
these intercategory encounters just as they do in virtually all cooperative, goal- 
oriented encounters (Ridgeway 2001). Such interpersonal hierarchies develop 
implicitly through multiple small reactions that the participants rarely scrutinize. 
Because the actual origins of their influence hierarchy is obscure to them but 
their categorical difference is salient, there is some likelihood that participants will 
associate their apparent difference in worthiness and competence in the situation 
with their categorical difference. The repeated association between social 
difference categories and influence and esteem in such encounters eventually 
induces participants to form generalized status beliefs about the social difference. 
People carry these status beliefs to subsequent encounters with individuals from 
the other category and, by acting on the beliefs, induce some of these others to 
take on the status beliefs as well, creating a diffusion process. 
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Whether the status beliefs individuals form in such local encounters have the 
potential to diffuse broadly in society and become widely shared or dissipate 
in a confusion of conflicting beliefs depends on the structural conditions that 
shape the terms on which people from different categories encounter one 
another. These conditions also determine which category of the social difference 
is cast as higher status. According to the theory, if people from one category are 
structurally advantaged in some way (i.e., by material resources or technology) 
that provides them with a systematic advantage in gaining influence over those 
from another category in intercategory encounters, then status beliefs favoring 
the advantaged group are likely to overwhelm counter beliefs and spread to 
become widely held beliefs (see Ridgeway and Erickson 2000 for a more detailed 
explanation). 

Computer simulations of the diffusion processes have shown that if people 
form and spread status beliefs in local intercategory encounters as the theory 
argues, then nearly consensual status beliefs would indeed be a logical outcome 
under many social structural conditions (Ridgeway and Balkwell 1997). Bolstered 
by this logical support for the theory, empirical investigation of it has centered 
on experiments that examine whether local encounters actually do induce their 
participants to develop and spread status beliefs. These experiments confirm 
that people form status beliefs about a previously unevaluated categorical 
difference from repeated encounters in which the local influence hierarchies 
consistently corresponded to the participants’ categorical difference (Ridgeway 
and Erickson 2000). They have also shown that when the influence hierarchies in 
these encounters were biased by a factor like material resources, the beliefs that 
people formed favored the resource advantaged category as predicted (Ridgeway 
et al. 1998). Further experiments have demonstrated that people spread status 
beliefs by treating someone in a situation according to the belief. Both those who 
are directly treated in accord with the status belief and participants who witness 
such treatment acquire the status belief from such experiences (Ridgeway and 
Erickson 2000). 

Finally, applications of the theory to ongoing social problems suggest 
its explanatory utility. Webster and Hysom (1998) analyze the formation of 
status beliefs about sexual orientation with it. Ridgeway (1997) shows how 
status construction processes not only create, but also recreate and maintain 
status beliefs, contributing to the persistence of gender inequality over major 
transformations in society. 

The empirical studies of status construction theory support its claim that 
encounters with socially different others can induce the participants to form 
status beliefs about their difference (Ridgeway et al. 1998, Ridgeway and 
Erickson 2000). Yet these studies do not reveal how such encounters induce 
people to form status beliefs other than by offering participants a simple 
juxtaposition of standing in local influence hierarchies and social difference. 
What would cause people to form generalized beliefs about whole categories of 
social actors from these localized, individual encounters? It is particularly unclear 
how such encounters motivate participants to take on a generalized status belief 
that disadvantages the participants” own group, as must occur for shared status 
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beliefs to emerge. As we noted earlier, the question of how encounters induce 
status beliefs is important not just for itself, but also because, if we know the 
necessary conditions for the formation of status beliefs, we learn something 
about the processes that can disrupt the formation of status beliefs so that some 
differences do not become status distinctions and axes of social inequality while 
others do. 


Social Validity and the Formation of Status Beliefs 


Consider the kind of situation in which status construction theory argues that 
status beliefs are likely to develop. Categorically different actors, say As and Bs, 
work together on a mutual goal and an influence hierarchy arises between an A 
and B. The formation of this hierarchy creates a shared reality for the participants 
in which a person from one category, say an A, is more active, influential and 
apparently more competent while a person from the other category, a B, is 
cast into the role of reacting to the A. The hierarchy develops implicitly through 
multiple small behaviors so that its origins are obscure for the participants while 
at the same time their social difference is salient. Consequently, they simply 
find it revealed to them, through the unfolding of events, that the A is more 
proactive, influential and competent, and the B is more reactive and apparently 
less competent. : 
Since both As and Bs participate in these events, the pattern that they enact 
appears consensual among them. As a result, the social validity of what the 
pattern reveals about the A and B becomes difficult for the participants to deny. 
When subsequent encounters repeat the same pattern, the association between 
influence and competence and the A/B difference is even clearer and the apparent 
consensual support for this pattern seems even broader, strengthening its 
validity. Eventually, even the Bs who participate in these encounters are forced to 
concede, as matter of social reality, that “most people” would rate As in general 
as higher status and more competent than Bs. In this way, the participants form 
status beliefs that are (1) generalizations about As and Bs rather than specific 
individuals, (2) third order presumptions about what most people think, and (3) 
shared, as third order beliefs, by those they disadvantage as well as advantage. 
Based on this analysis, we argue that local contexts such as intercategory 
encounters create status beliefs by presenting their participants with an apparently 
valid social reality in which a categorical distinction is consistently linked to Signs 
of status and competence such as proactive influence versus reactive acceptance 
of influence. The concept of social validity comes from Weber (1968, pp. 31- 
33) and refers to the collective dimension of legitimacy that is captured by the 
sense that others will accept something in a situation, whether or not the actor 
approves of it. If others will accept a pattern that links the distinction to esteem 
and competence, then the actor has good reason to form a third-order and often 
a first-order status belief about that distinction too, if for no other reason than 
to anticipate and manage the behavior and judgments of others. It is as though 
the appearance of social validity establishes for the actor a socially accepted 
bond of relevance between the categorical distinction and markers of status and 
competence that allows the status implications of those markers to "Spread" 
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to that distinction (Thye 2000). The spread of status from the behavior of 
individuals to their distinguishing categorical difference facilitates the formation 
of a generalized status belief about entire categories of actors who share that 
difference. This, then, is the power of local contexts to create status beliefs. It is 
the power to create apparently valid social realities for participants that pressure 
them to accept that "most people" would rate one group as more status worthy 
and competent than another. 

A social pattern will seem socially valid to actors if it is either supported by 
legitimate authority (e.g., an expert) or appears to be consensually accepted 
by others in the situation, according to legitimacy research (see Zelditch 2001). 
In the everyday encounters that are the focus of status construction theory 
authoritative backing for an association between categorical difference and status 
and competence is not typically present. Instead, social validity arises from the 
appearance of consensus among the actors in the situation about an enacted 
pattern in which members of one category of the difference display higher status 
and competence behaviors in relation to members of the other category, creating 
an influence hierarchy between them. The apparent consensual acceptance of the 
association between difference and status and competence in the /oca/ situation 
in effect "bootstraps" the formation of status beliefs by causing participants to 
presume that others outside the local setting would also accept the association, 
creating a third order belief. Note that this is not just an Asch-style conformity 
process through which participants accept a belief that is already held and 
espoused a local majority. Instead, the appearance of consensus in the situation 
causes participants to similarly form from their shared reality a new belief that 
none of them held previously. 

If validity turns on apparent consensus in intercategory encounters, then 
that suggests a way to test our social validity account of the formation of 
status beliefs. The validity account implies that the broader the appearance 
of consensus in encounters about a link between an unevaluated categorical 
difference and markers of status and competence, the more likely participants are 
to form strongly differentiated status beliefs about that difference. This proposition 
extends status construction theory by offering a specific account of the belief 
formation process in place of the theory's earlier, informal argument about "mere 
juxtaposition." (Ridgeway 2000) It is a clear implication of the proposition and the 
validity argument behind it that challenges to the local consensus supporting an 
association between a categorical distinction and status and competence will 
undermine the formation of clearly differentiated status beliefs. Specifically, we 
hypothesize the following: 


Participants will form strongly differentiated status beliefs 
from encounters in which an enacted association between a 
categorical difference and markers of status and competence 
is consensually accepted. In otherwise similar encounters in 
which such an enacted association is challenged, however, 
participants will form weaker, less differentiated status 
beliefs about the social difference. 
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If this hypothesis is supported, it will clarify not only how encounters create 
status beliefs, but also how that process might be interrupted so that a new 
status distinction does not emerge. 


Methods 


Design Overview 


In the experiment female and male subjects were each categorized by a two- 
category nominal distinction and then took part in two same-sex, task-oriented 
encounters, each time as a member of a different four-person team. Teams 
consisted of one subject and three teammates. In each encounter a clear 
influence hierarchy developed between two of the subjects’ categorically 
different teammates. The third teammate, actually a confederate, supported the 
influence hierarchy in the supportive, consensus conditions. In the nonsupportive, 
challenging conditions, the third teammate challenged the influence hierarchy 
between the two categorically different teammates by making non-supportive 
comments about decision making in the hierarchy. 

The 2 x 2 x 2 experimental design crossed the position in the influence 
hierarchies (high/low) of the subject’s own category on the nominal distinction 
with consensual support or challenge of these influence hierarchies and with 
the sex composition (all male/all female) of the teams. Eighteen to 20 paid- 
undergraduate volunteers were randomly assigned to each condition within sex 
for a total of 157 (80 female and 77 male) subjects. 


Procedures and Experimental Manipulations 


A Categorical Distinction 

Subjects were seated in separate rooms to avoid confounding effects of visible 
status differences. They first completed a task designed to create a salient 
categorical distinction among teammates. Adapted from social identity studies 
of “mere difference,” this task requires subjects to choose between Klee and 
Kandinsky reproductions (Tajfel 1978). As in previous studies, subjects were told 
that this “test” of “personal response style” differentiates between two types of 
people, S2s and Q2s, whose very different response styles were stable aspects 
of their selves (Ridgeway et al. 1998, Ridgeway and Erickson 2000). Supposedly, 
there are roughly equal numbers of S2s and Q2s in the world. 

Subjects were given apparent test results for themselves and their teammates 
and copied this information onto a form. Whether the subject was an S2 or Q2 
was counterbalanced within conditions. Subjects found that their own response 
style was the same as one of their three teammates but different from the other 
two. Although subjects participated in two rounds of the study with different 
teammates, the response style composition of the teams (2 S2s, 2 Q2s) was 
identical for each round. 


Associated Influence Hierarchies | 
Subjects were then told that they would work with their teammates on a 
computerized 10 trial “meaning insight” task that required the team to correctly 
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associate English words with one of three words supposedly from a reconstructed 
language. To study diversity and decision making under different communication 
procedures, subjects heard, teammates had been randomly assigned different 
duties. Each trial would begin with all four team members privately choosing an 
initial answer. Two of the subjects' teammates, the "discussants," were assigned 
the role of then publicly announcing and discussing their initial choices for the 
benefit of the team via an audio link. A third teammate would then comment, 
also via the audio link, on the ideas exchanged by the discussants. As the 
team evaluator, the subject's job was to listen and consider the discussion and 
comments. All four members would then enter their final choices into their 
computer. Subjects were told that the team members’ final choices on each trial 
would be combined and scored as a team for the number who chose the correct 
alternative. Members of the best scoring team would split a $200 bonus. 

Verbal interaction between the team discussants, who always differed on 
the S2/Q2 response style distinction, served to create a consistent association 
between that categorical distinction and an influence hierarchy on both rounds. 
Although presented as live, these interactions were actually audiotapes of 
interactions of a subject and a confederate from previous status construction 
experiments (Ridgeway and Erickson 2000). Subjects heard on these tapes a 
person who was categorically different from them (the taped confederate from 
the previous experiment, henceforth the confederate discussant) interact with a 
person labeled as similar to them (the taped subject from the. previous experiment, 
henceforth the target discussant) in either a confident, assertive, non-deferential 
manner or in a hesitant, deferential manner. These styles of interaction have 
been shown to be associated with perceptions of higher or lower status and 
competence (Carli, LaFleur and Loeber 1995). 

The taped interactions enacted clear influence hierarchies. The confederate 
discussants announced a different initial choice than the target discussants on 
8 of the 10 trials in round one and 7 of the 10 trials in round two. Confederate 
influence was the percentage of these disagreement trials on which the target 
discussants changed their choices to agree with the confederate discussants 
during the discussion with higher percentages indicating more confederate 
influence. When the confederate discussants acted in a confident, non-deferential 
manner, confederate influence was 93 percent on the male tapes and 87 percent 
on the female tapes. When the confederate discussants acted deferentially, 
confederate influence on the male and the female tapes was 33 percent. 

Responses to postround questionnaires confirmed that subjects clearly 
perceived these influence hierarchies. Subjects rated the discussants on a 
nine-point scale from "not at all" to "very" influential. Using similar nine-point 
scales subjects also rated the discussants on the usefulness of their ideas, their 
contribution to high quality decisions, and skill at the task. The later three scales 
were averaged to provide measures of the discussants’ perceived task skill. 
We subtracted ratings of the confederate discussant from ratings of the target 
discussant to yield perceived differences between them in influence and task skill. 
As expected, positive mean differences showed that perceived influence favored 
the target discussants in conditions in which the confederate was deferential 
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(round 1: m = 3.03; round 2: m = 2.18). Negative mean differences indicated that 
perceived influence favored the confederate discussants in conditions in which 
the confederate was assertive and non-deferential (round 1: m = -2.18; round 2: 
m = -3.51), resulting in a significant difference between these conditions (round 
1: F = 129.49, p < .0001; round 2: F = 131.42, p < 0001). As also expected, 
perceptions of the discussants’ relative task skill corresponded to their place 
in the influence hierarchy. Positive mean differences between the discussants' 
rated skill showed that the target discussant was perceived as more competent 
than the confederate discussant when the confederate was deferential and 
low influence (round 1:m = 1.01; round 2: m = .76). The target discussant was 
perceived as less competent than the confederate in conditions in which the 
confederate was nondeferential and influential (round 1:m = -.75; round 2: m = 
-.54) again producing a significant difference between conditions (round 1: F = 
70.32, p.« .001; round 2: F.— 26.91, p «-.001). 

Thus, conditions in which the confederate discussant acted deferentially 
portrayed an association between the subjects' own response style category 
(represented by the target discussant) and behavioral markers of superior 
influence, status and competence in the situation. Conditions in which the 
confederate discussant acted non-deferentially enacted an association between 
the subjects' category and /nferior influence, status and competence. Henceforth, 
we refer to these conditions respectively as own group high influence and own, 
group low influence to indicate the position of the subject's own categorical 
group in the influence hierarchies they experienced. 


Challenges to Consensus 

The use of taped interactions to create the association between the categorical 
distinction and influence hierarchies ensured that subjects in comparable 
conditions were exposed to precisely the same degree of influence differences 
between categorically different teammates. Our theoretical argument and 
hypothesis turn on the degree of consensual support for these matched 
influence hierarchies. To manipulate consensual support for the hierarchies, 
the voice of a commentator teammate, actually a second confederate following 
a script, was edited into the tapes after each trial's discussion. In consensus 
supporting conditions, the commentator expressed general support for the 
influence hierarchy and its decision-making: “Good team work on this one,” “| 
agree with their choice,” “Their reasoning sounds convincing.” In non-supportive, 
challenging conditions, the commentator questioned the influential discussant's 
arguments or asked to hear more from the less influential discussant: “| don't 
know if | agree with Member £2's reasoning,” “Member #3 seems to have 
some good ideas. | wish she would say more." We refer to these responses as 
commentator supportive or commentator non-supportive. 

To minimize extraneous effects other than support or nonsupport for the 
hierarchy, commentators never mentioned their own initial choices or offered 
any persuasive arguments. Their simple statements of support or nonsupport for 
the hierarchy were neutral in tone, never more than two sentences in length, and 
balanced for length across supportive and non-supportive conditions. The brevity 
and neutrality of the commentators' remarks in contrast to the relatively lengthy, 
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apparently consensual exchanges between discussants provides for a stringent 
test of our hypothesis that breaks in consensual support for the hierarchy will 
significantly weaken status beliefs. Confirming the success of the support/ 
nonsupport manipulation, when asked on a postexperiment questionnaire 
whether commentators generally supported the arguments of discussants, all 
subjects in supportive conditions answered yes and 72 of 77 in non-supportive 
conditions answered no or not entirely. 

In order to avoid problematic minority/majority effects, teams were composed 
of two S2s and two O2s. To test our theory, it was also necessary for the two 
discussants to differ on the S2/Q2 distinction. These constraints forced the 
commentator's category to differ from that of the subject. Consequently, the 
commentator represented a low influence group in own group high conditions 
and a high influence group in own group low conditions which might give the 
commentator lesser impact in own group high conditions. Thus, it is a strong test 
of the validity argument to show that the commentator weakens status beliefs 
even in own group high conditions. 


Status Beliefs and Other Measures 


Status beliefs about the categorical distinction were measured by seven-point 
semantic differential items on the post-experiment questionnaire. Subjects 
indicated how "most people" would rate the typical S2 and, on another page, 
the typical Q2 on status and power items (respected/not respected, powerful/ 
powerless, high status/low status, leader/follower), and on competence items 
(competent/incompetent, knowledgeable/unknowledgeable, capable/incapable). 
Scores were summed and averaged to create scales measuring the perceived 
status and competence of the typical S2 and O2, with high scores indicating 
greater status and competence. We then reclassified these scales from ratings of 
S2s and Q2s to ratings of subjects’ own categorical group and the other group. 

Although the above scales constituted our primary measure of status beliefs, 
additional measures tapped related perceptions. Using the same semantic 
differential items, subjects indicated how they personally would rate the status 
and competence of typical S2s and Q2s. After both team rounds, subjects also 
indicated who would be more likely to be: (1) in a position of greater responsibility 
in the university, (2) give the keynote address at a professional conference, (3) 
receive early promotion at a major corporation, and be (4) chosen as foreman of 
a jury for a set of choice categories including an S2 and a Q2. The percentage 
of the questions on which subjects chose someone from their own response 
style category provided a measure of in-group favoritism. Finally, additional 
semantic differential items (considerate/inconsiderate, likable/unlikable, pleasant/ 
unpleasant, cooperative/uncooperative) tapped how subjects thought most 
people and they personally would rate the social considerateness of typical S2s 
and Q2s. All scales were reliable (mean alpha = .86). 


Results 


Our hypothesis predicts that subjects in the supportive, consensus conditions 
will form clearly differentiated status beliefs favoring the category of the S2/Q2 
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distinction that was consistently influential in their encounters. In comparable 
non-supportive conditions, however, subjects' status beliefs about the distinction 
should be significantly weaker and less differentiated. We tested this hypothesis 
with a three-way — own group influence (high/low) by commentator response 
(supportive/non-supportive) by team gender — analyses of variance of the status 
belief and related measures. We expected to see main effects for own group 
influence, indicating that subjects’ status ratings were significantly shaped by the 
nature of the association they experienced between the categorical distinction 
and influence in the encounters. However, we also expected that this main effect 
would be conditioned by an interaction effect between own group influence and 
commentator response, suggesting that the commentators’ supportive or non- 
supportive remarks altered the formation of status beliefs. 


Status Beliefs 


In-group Favoritism 

For status beliefs to form, subjects who saw their own category consistently 
associated with low influence must overcome the in-group favoritism that is the 
typical response to “mere difference” and concede that the other categorical 
group is seen as higher status than their own. As the means for subjects’ 
preference for their own group show, this is just what happened in low influence 
supportive, consensus conditions (left side of Table 1). When unchallenged 
influence hierarchies consistently cast their own response style group as low 
influence, subjects overall showed only a 23 percent preference for their own 
group, meaning that they actually preferred the other group for a high status 
position most of the time. In contrast, subjects in high influence supportive 
conditions preferred their own group 89 percent of the time, producing a 
powerful main effect for own group high/low influence in an analysis of variance 
(F 2 195.13, p « .0001). 

When hierarchies linked to the categorical distinction were challenged by a non- 
supportive commentator, however, differences between subjects' preferences 
for their own group in high and low influence conditions were substantially 
reduced (right side of Table 1), resulting in a strong interaction effect between 
own group influence and support/nonsupport (F — 39.62, p « .001). Although 
both were significantly affected (p « .01), challenges to consensus had much 
stronger effects on subjects who were under pressure to form status beliefs 
that disadvantaged their own categorical group compared to those likely to be 
advantaged. In these own group low conditions, subjects' self interests as well 
as the commentators' association with a high influence category may have made 
challenges to consensus especially effective. 

Subjects whose category was associated with low influence in non-supportive 
conditions rejected the clear preferences for the other group shown by their 
peers in supportive conditions and expressed instead choices near the 50 percent 
point. An in-group favoritism rating of 50 percent indicates that subjects see no 
difference in the likelihood of either group, their own or the other, being in a high 
status position. In contrast, subjects whose category was associated with high 
influence in non-supportive conditions significantly moderated their own-group 
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Table 1: Mean Percent Preference for One's Own Categorical Group 


Experimental Condition 


Supportive Non-supportive 
High Influence Low Influence High Influence Low Influence 

Subjects 

Male 90 (14) 28 (21) 71 (21) 57 (21) 

Female 87 (13) 18 (19) 82 (22) 47 (26) 
BO ke Le E TO (22). 25115 23) 
Notes: 
Standard Deviations in Parentheses. 
(N = 157) 


Main effect of own group influence (High or Low) significant at p < .000 in analysis of variance. 
Interaction effect of own group influence (High or Low) X commentator (Supportive or Non- 
supportive) significant at p < .000 in analysis of variance. 

Interaction effect of own group influence (High or Low) X gender significant at p < .05. 


preference to 76 percent from 89 percent in equivalent supportive conditions. 
(Gender differences were not significant). By this measure of in-group favoritism, 
then, the defining structure of status beliefs, in which both groups agree that one 
group is higher status, developed under consensual conditions but collapsed 
when support for the hierarchy differentiating the categorical groups was 
challenged. These results clearly support our hypothesis. 


Primary Measures of Status Beliefs 

Our hypothesis depends most crucially, however, on subjects’ estimates of how 
“most people” would rate the status and competence of the typical member 
of each categorical group. Table 2 shows the means for these variables by 
commentator response (supportive/non-supportive) and own group influence 
(high/low) pooled across team gender. We did not expect that gender would 
impact the processes by which status beliefs form and the data bear this out. 
There were very few gender effects in these or any other of our analyses and 
none that affected tests of our hypotheses. 

Logically, the first question must be, did strongly differentiated status beliefs 
form in the supportive consensus conditions? The means on the left side of 
Table 2 show that the answer is clearly yes. Subjects in supportive high influence 
conditions thought most people would rate their own categorical group as 
substantially higher status (simple main effect of high/low influence in supportive 
conditions, F = 145.09, p < .0001) and more competent (F = 55.76, p < .001) 
than did subjects in comparable low influence conditions. Subjects in these high 
influence conditions also thought the other group would be seen as considerably 
lower in status (F = 167.06, p < .0001) and less competent (F = 95.90, p < .0001) 
than did those in equivalent low influence conditions. 

The clearest confirmation that strong status beliefs formed in supportive 
conditions comes from examining the differences between how subjects 
estimated that most people would rate their own and the other categorical group 
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Table 2: Mean “Most People's” Evaluations of Own and Other's Categorical Group 


Experimental Condition 
Supportive Non-supportive 

High Influence Low Influence High Influence Low Influence 
Status 
Own Group 5.96 (.64) 3.38 (.91) 5.34 (1.00) 4.25 (1.24) 
**++$ 
Other Group 3.28 (.83) 5.86 (.68) 3.88 (.91) 5,261212) 
We 


Difference 2.68 (1.18) -2.49 (1.28) 146 (1.57) -1.11 (1.80) 
**++ 

Competence 

Own Group 5.85 (.70) 4.27 (.96) 5.09 (.89) 4.93 (1.20) 
445 


Other Group 3.94 (.70) 5.82 (.70) 4.73 (.84) STAT PAD 
**++$ 


Difference 1.91 (84)  -155 (1.16) 37 — (118) -18 (1.49) 
**L+ 

Considerate 

Own Group 45251416) 5.85 (.80) 4.37 (.94) 5.60 (.96) 


Other Group 592 (71 376 (116 571 (86) 396 (1.08) 
ES 
Difference 140 (127) 2.09 (1.60) -1.35 (135 164 (171) 


** 


Notes: Standard Deviations in Parentheses. 

N=157 

Combined results for male and female subjects (teams). 

** Main effect of own group influence (High or Low) significant at p < .000 in analyses of variance. 
++ Interaction effect of own group influence (High or Low) X commentator (Supportive or 
Nonsupportive) significant at p < .000 in analyses of variance. 

$ Main effect of team gender significant at p « .05. 


(Table 2). For those in high influence conditions, these difference scores are 
significantly positive, indicating that subjects thought their own group would be 
rated as higher status (t-test for difference from zero = 14.37, p < .001) and 
more competent (t = 14.36, p < .001) than the other group. Strikingly, however, 
such difference scores for those in low influence conditions are consistently and - 
significantly negative, showing that they thought their own group would be seen 
by most as clearly lower status (t = -12.26, p < .001) and less competent (t 
= -8.44, p < .001). When the influence hierarchies differentiating categorically 
different actors went unchallenged, then, subjects formed strongly differentiated 
status beliefs favoring the categorical group that had a higher position in the 
influence hierarchies. They did so even when these status beliefs cast their own 
group as lower status. 

The critical question, however, is whether challenges to the apparent 
consensus supporting these influence hierarchies caused the status beliefs 
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subjects formed about the categorical distinction to be significantly weaker and 
less differentiated. As an inspection of the means on the right side of Table 2 
shows, the challenging/non-supportive commentator noticeably reduced the 
differences between subjects in high influence and low influence conditions in 
the status and competence they thought most would attribute to their own group 
and the other group. When differences between those in high influence and low 
influence conditions are reduced, status beliefs are less differentiated and weaker. 
The weakening effect of the non-supportive commentator on status beliefs was 
reflected in highly significant interaction effects between support/nonsupport 
and high/low influence in analyses of variance (for own group status, F — 24.04, 
p < .001; other group status, F = 15.09, p < .001; own group competence, F = 
21.79, p < .001; other group competence, F = 30.05, p < .001). 

As with the formation of status beliefs under consensus, the weakening of status 
beliefs when consensus is challenged is best seen by examining the differences 
between how subjects judged that most people would rate the subjects’ own 
group and the other group in terms of status and competence (Table 2). Figure 
1 displays these difference scores and clearly shows the predicted interaction 
effect in which status (F = 31.55, p < .001) and competence (F = 58.26, p < 
.001) differences are both significantly smaller in non-supportive conditions, 
consistent with our hypothesis. 

While significantly smaller, status and competence differences in the eyes 
of “most people” retained the basic form of status beliefs in non-supportive 
conditions. Perhaps because the taped interaction between discussants displayed 
strong influence hierarchies enacted over about two minutes per trial, while the 
commentator’s remarks were limited to a few seconds per trial, status difference 
scores in high influence conditions still significantly favored the subjects’ own 
group (t-test of difference from zero = 5.77, p < .001) and those in low influence 
conditions favored the other group (t = -3.79, p < .001). 

As Figure 1 shows, however, this was not the case for competence difference 
scores, which collapsed, in non-supportive, challenging conditions to near the 
zero point. Note that a difference of zero would indicate that the challenge 
to consensus had completely blocked the formation of status beliefs. In non- 
supportive conditions, competence difference scores did not fully differ from 
zero by conventional standards of significance either for those in high influence 
conditions (t = 1.94, p = .060) or low influence conditions (t = -.75, p = .461). 
As with the preference for own group measure, the collapse of competence 
difference scores was most striking for those in disadvantaging, low influence 
conditions. These results suggest that the commentators’ brief challenges to 
the consensus supporting influence differences between categorically different 
discussants not only weakened status beliefs but essentially destroyed their key 
legitimating component: presumed competence differences. Together, these 
results offer substantial support for our hypothesis. 


Related Beliefs 
Social Considerateness 


Although status beliefs are defined by group differences in status and competence, 
some have argued that such beliefs also typically carry compensatory beliefs that 
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Figure 1. Difference Between Own and Other Group, Most People's Evaluation 
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associate lower status groups with lesser valued but positive qualities such as 
greater warmth and expressiveness (Conway, Pizzamiglio and Mount 1996; Fiske 
et al. 2002). Such compensatory beliefs may arise partly because members of 
low status groups are cast into a reactive, agreeing role in relation to agentic, 
influential members of the high status group (Conway et al. 1996). 

Subjects in this study (see Table 2) did indeed develop beliefs that, in the 
eyes of most people, the higher status, more competent categorical group was 
not as socially considerate as the lower status, less competent group. Similar to 
previous status construction experiments (Ridgeway et al. 1998, Ridgeway and 
Erickson 2000), subjects in low and high influence conditions differed significantly 
in their assessment of how most people would rate the considerateness of the 
subject's own group (F — 67.37, p « .001), the other group (F = 171.08, p < 
.0001), and the difference between own and other group in considerateness 
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(F = 188.78, p < .0001). Interestingly, as Figure 1 shows, the commentators’ 
challenges to consensual support for the hierarchies did not significantly reduce 
compensatory beliefs about considerateness. Perhaps this occurred because the 
commentator did not challenge consensual views about the social behavior of 
the discussants. 


Personal Beliefs 


Although the distinctive nature of status beliefs is best measured by how 
subjects think groups are perceived by most people, it is nevertheless important 
to examine how subjects themselves perceived the status, competence, and 
social considerateness of their own and the other group. As in previous studies, 
personal and “most people” ratings were highly correlated (average r = .71), 
suggesting that “first order” beliefs were generally quite similar to “third order” 


Table 3: Mean Personal Evaluations of Own and Others’ Categorical Group 


Experimental Condition 


Supportive Non-supportive 
High Influence Low Influence X High Influence Low Influence 
Status 
Own Group 5.78 (.85) 371. — (1:10) 551.7 41:04) 3.95 (1.06) 


Other Group 2.98 (.75) 5.58 (.68) 9:92 4714:09) 5.00 (.94) 
**++ 

Difference 281 (141)  Á-187 (1.44) 2.19 be KÉ en «105: 241.41) 
SEA 

Competence 

Own Group 5213 (.97) 4.78 (1.20) 503 0108) 5.03 a5 161.09) 
*+$ a 

Other Group 3.91 (.87) 5.07 (1.04) 4.4 (1.24) 436 (1.17) 
Réi 

Difference 1:632 (1:29) -28 (141) 298. (1.172) 65 (145) 
*++ 

Considerate 

Own Group 5.01 (1.29) 6.04 (.80) 4.61 (1.09) 5.89 (.92) 
GE 

Other Group 6.03 (.92) 3250 dba 5.95 (.73) 323 (1.34) 


Difference -1.03 (1.51) 269. (VDA) 21:33 (1.16) 2b0/. (93) 


** 
A es ee ee 
Notes: Standard Deviations in Parentheses. 

N= 157 

a. Combined results for male and female subjects (teams). 

** or * Main effect of own group influence (High or Low) significant at p « .001 (**) or p «.05 (*) in 
analyses of variance. 

++ or + Interaction effect of own group influence (High or Low) X commentator (Supportive or 
Non-supportive) significant at p < .001 (++) or p « .05 (+) in analyses of variance. 

$ Main effect of team gender significant at p « .05. 
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estimates of most people's views (correlations available on request). While the 
correlations were significant (p « .001) in both the supportive ahd non-supportive 
conditions, they were significantly weaker (p « .05) in the non-supportive 
conditions for some measures, suggesting that a break in consensus about the 
influence hierarchy also encouraged subjects to distinguish their own views from 
the perspective of "most people. 

The structure of personal beliefs is most clearly examined in terms of the 
differences between subjects' ratings of their own and the other group (Table 
3 and Figure 2). Subjects in high influence conditions rated their own group as 
higher status, more competent and less considerate than the other group. Those 
in low influence conditions personally believed their own group to be lower 
status, slightly less competent and much more considerate than the other group, 
producing strong main effects for high/low influence in analyses of variance (for 
status difference, F = 278.08, p < .0001; competence difference, F = 10.36, p = 
.002; considerate difference, F — 248.88, p « .0001). 

Similar to most people's beliefs, personal beliefs about the status and 
competence, but not the considerateness, of subjects own group compared 
to the other group were also significantly weakened by the non-supportive 
commentators' challenges (for interaction effects between high/low influence 
and support/nonsupport, F = 9.37, p = .003 for status differences; F = 23.19, 
p < .001 for competence difference; F = .40, n.s., for considerate difference). 
Once again, the commentators’ challenges most dramatically eroded subjects’ 
ratings of the competence differences between groups. After hearing the 
non-supportive commentator, subjects in low influence conditions refused to 
concede that their own group was less competent than the other. As a result, 
there were no significant differences between subjects in low influence and 
high influence conditions in the relative competence they ascribed to their own 
compared to the other group (simple main effect for high/low influence in non- 
supportive conditions, F = 1.25. p = .265). At a personal level as well, then, the 
commentator's challenges substantially undercut the structure of status beliefs 
by weakening acceptance of status differences and eliminating acceptance of 
competence differences. 


Conclusion 


Fora simple difference between people to be transformed into a status distinction 
that forms an axis of inequality, status beliefs about the difference must develop 
and spread through the population. In an empirically supported account of how . 
this occurs, status construction theory argues that local contexts in which people 
from different categories routinely encounter one another are potent arenas in 
which status beliefs can be both bred and spread (Ridgeway 1991, Ridgeway et 
al. 1998, Ridgeway and Erickson 2000). Until now, however, the specific process 
through which these encounters actually induce their participants to form status 
beliefs about their categorical difference has been unclear. As a result, it has 
been equally unclear how the process is sometimes interrupted, so that some 
differences become status distinctions while others do not. 
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Figure 2. Difference Between Own and Other Group, Personal Evaluation 
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To address these questions, we developed and tested the theoretical claim 
that local contexts such as inter-category encounters create status beliefs by 
presenting their participants with an apparently valid social reality in which a 
categorical distinction is consistently linked to signs of status and competence 
like the assertion versus acceptance of influence. This theoretical claim 
extends status construction theory by offering a specific account of the belief 
formation process in place of the theory's earlier, informal argument about "mere 
juxtaposition." Evidence indicates that a social pattern such as a link between 
a categorical distinction and status and competence will seem socially valid to 
actors if it appears to be consensually accepted by those present (see Zelditch 
2001). Therefore, our social validity argument predicts that when repeated 
encounters present a consistent correspondence between positions in influence 
hierarchies and a categorical distinction, and this correspondence appears to be 
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accepted or unchallenged by those involved, participants will form clear status 
beliefs about the difference. However, when someone in the situation challenges 
that correspondence, the formation of status beliefs will be undermined. 

The results of the experiment clearly supported this hypothesis and with 
it, the social validity argument from which it was derived. As expected, after 
two encounters in which a categorical distinction was consistently associated 
with behavioral markers of status and competence and this association went 
unchallenged, subjects formed strongly differentiated status beliefs about the 
distinction. They formed beliefs that most people would rate the typical member 
of the categorical group that was more influential in the encounters as higher 
status and more competent, but less considerate, than the typical member of 
the less influential group. 

These beliefs had the distinctive, "third order" nature of a social reputation (what 
"most" think) that is characteristic of status beliefs, but were largely endorsed as 
personal, "first order" beliefs as well. Note, too, that these beliefs were not just 
about subjects' teammates but instead were generalizations about whole groups 
of people delineated by the categorical distinction. Subjects formed such status 
beliefs even when the beliefs cast their own categorical group as lower status. 

The theoretical importance of our results, however, turns on the prediction 
that challenges to the consensual acceptance of the influence hierarchy 
between categorically different actors in encounters will significantly undermine 
the formation of status beliefs. Our most important results showed that after 
two encounters identical to those above, except that a teammate briefly and 
neutrally questioned the influence hierarchies between the categorically different 
discussants, status beliefs were significantly weaker. Subjects in these non- 
supportive conditions perceived significantly smaller differences between the 
categorical groups in terms of their likelihood of being in a high status position, 
their assessments of how most people would rate the groups' status and 
competence, and their personal ratings of the groups' status and competence. 

Although challenges to consensus weakened both status and competence 
assessments, beliefs about competence differences between the groups were 
so undermined as to be nearly eliminated. The beliefs of subjects who were under 
pressure to view their own group as lower status were particularly disrupted. The 
greater impact of challenges on these subjects may have partly resulted from 
the challengers' association with the high influence category in these conditions. 
In an especially strong test of the validity argument, however, even those who 
would be advantaged by a status belief bowed significantly to the pressure of the 
shared local reality in which the underpinnings of the association between status 
and competence and the categorical distinction were challenged. 

status beliefs require presumptions about competence differences to 
legitimate status inequality between groups. Status beliefs also require that both 
the advantaged and disadvantaged groups accept, as a matter of social reality, 
that most people perceive such competence differences between the groups. 
That neither fully occurred in conditions in which the link between influence 
hierarchies and the categorical distinction was challenged suggests that, as the 


r 
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validity argument predicts, the complete structure of clearly differentiated and 
shared status beliefs did not emerge. 

This study suggests that while some shared local realities can cause their 
participants to infer broad social support for a link between categorical difference 
and status and competence (i.e., form status beliefs), other shared local realities 
in which this link is challenged have the power to interrupt the formation of new 
status beliefs by undermining their apparent validity. Perhaps the most important 
implication of our results is that they show that resistance counts. Note that 
in our experiment, resistance did not merely affect the resistor (who was a 
confederate), but rather reduced the formation of status beliefs by participants 
who observed this resistance, apparently by changing the construction of the 
shared local reality. Acts of resistance in everyday encounters, these results 
suggest, alter the nature and power of what those contexts teach and, in so 
doing, can interrupt the formation and spread of new status beliefs. 

This point returns us to a key question. If local contexts have the power to foster 
status beliefs about social difference, why is it that not all differences become 
status differences? The answer, by this analysis, is that local contexts have such 
power only under conditions of social validity and social validity can be disrupted. 
The social structural conditions (e.g., group differences in material resources or 
technology) that frame the local contexts in which people from different social 
categories encounter one another become important here. The same structural 
conditions that advantage people from one group in gaining influence over those 
from another group in local contexts may also sometimes constrain the ability of 
those in the disadvantaged group to safely display resistance to that influence. It 
is under these conditions, when structural conditions not only advantage people 
in one group but constrain those in another, that the group difference is most 
likely to be transformed into a status distinction that forms a significant axis of 
inequality. 
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Status on Trial: 
Social Characteristics and Influence in the Jury Room 


Erin York, University of Chicago 
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Abstract 

The American jury is heralded as an institution that is simultaneously representative and 
egalitarian. However, jury studies conducted 50 years ago found that white, upper-class 
men dominate jury deliberations, presumably due to their higher status outside of the jury 
room. Logistic regression analysis of dyadic influence inside the jury room updates this 
research. Results indicate that today upper-class jurors alone — not men, not whites — are 
regarded as most influential in deliberations. Upper-class jurors’ influence is not simply a 
product of status deference. Rather, upper-class jurors seem to influence deliberations due 
to generalized expectations of their competence or their possession of skill sets that enhance 
jury room performance. We conclude that increased statistical representation in the jury 
pool does not guarantee that diverse views will affect verdicts. 


Fifty years ago, research from the Chicago Jury Project demonstrated that 
upper-class men do most of the talking in mock jury deliberations (James 1959; 
Strodtbeck, James and Hawkins 1957; Strodtbeck and Mann 1956). This research 
indicated that recognizable, external status characteristics, such as gender and 
social class, can restructure otherwise undifferentiated small groups. However, 
women and minorities were severely underrepresented in juries and mock jury 
studies of the 1950s, and times have changed. The Jury Selection and Service 
Act of 1968' mandated that jury pools include all eligible voters, and subsequent 
jury selection reforms have increased jury diversity. Are the more diverse juries of 
today actually able to achieve an inclusive and egalitarian interaction, or do jurors 
from historically privileged status groups still dominate deliberations? 

From outsideits doors, the jury room appears to be oneof the mostrepresentative 
and egalitarian spaces in American society. The historical development of the 
jury was guided by the assumption that diverse groups could set aside external 
differences to arrive at a shared verdict. In fact, the earliest English mixed juries 
of the 12th through the 15th centuries decided disputes between persons from 
two communities by bringing together a mix of jurors, such as foreigners and 
citizens, Jews and Christians, and local and foreign merchants (Constable 1994). 
And, in the past several decades, many discriminated groups have argued for 
and won representation in the jury pool on the grounds that jury composition is 
integral to the achievement of justice.” 
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In addition to the historical importance of jury diversity, formal norms of jury 
procedure aim to minimize role differentiation inside the jury room. Unanimous 
verdicts require that all jurors must concur - one juror in disagreement can overturn 
all the others. Juror orientation materials encourage inclusive deliberations with 
instructions such as, "No juror should dominate the discussion. No juror should 
remain quiet and leave the speaking to others. Everyone should participate.”* 
Compared with stratified deliberations, the egalitarian deliberations promoted 
by these procedural norms are more likely to focus on story construction, rather 
than bargaining, and ultimately result in more accurate verdicts (Hastie, Penrod 
and Pennington 1983). 

The jury room, in its idealized form, constitutes a protected space where 
diverse individuals can gather and deliberate as equals. However, sociological 
research suggests that, in small heterogeneous groups, processes such as status 
deference and status generalization may allow members of historically-dominant 
groups to steer discussions.^ Thus, the combination of a diverse jury and the 
expectation of an egalitarian deliberation poses a paradox. A representative jury 
is assumed to draw on the group's diverse perspectives, while overlooking the 
external status differentials that make the group diverse. 

We revisit the Chicago Jury Project's early research to consider whether 
race, class and gender still structure influence in the jury room. We use surveys 
completed by jurors and operationalize influence as a dyadic, or juror-to-juror, 
process. We examine whether status deference and status generalization, key 
processes of status translation in small groups, allow some jurors to have more 
influence than others. If influence in jury deliberations is determined by external 
status roles, then the potential value of jury diversity is not fully achieved. 


Status Processes in Small Groups 


Sociologists have long been preoccupied with the role of status — achieved or 
ascribed, subjective or objective, total or segmental, consistent or inconsistent 
- in structuring social interactions. As Linton (1936:202) noted, "it is extremely 
hard for us to maintain a distinction in our thinking between statuses and the 
people who hold them and who exercise the rights and duties which constitute 
them." Indeed, we are hard pressed to find a status vacuum; there seems to be 
no situation in which status does not, in some way, affect interactions. 


Status Deference 


The concept of status deference originates in the close — almost inextricable 
- linkages between individuals, their statuses and their concomitant rights and 
duties (Shils 1968). Status itself is both constituted by and reflected in interactional 
gestures at the dyadic level. Such acts can be appreciative or derogative. High 
deference reflects one's positive assessment of another, while low deference 
indicates one's negative assessment of another. Shils emphasizes the close 
connections between status and the roles or actions that reflect status by 
suggesting that one's overall status can be described as a "deference-position." 
(107) Taken as a whole, then, status differences are most obvious when we 
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consider the acts of deference that reflect, on the dyadic level, evaluative 
judgments about positions in society. 

Deference-entitling characteristics are derived from one's closeness to, or 
distance from, social centers of power, values, norms and legitimacy. As indicators 
of proximity to societal centers, occupational position and educational attainment 
loom large. According to Shils, ascribed characteristics, such as ethnicity and 
gender, afford deference only because they are frequently concordant with 
occupational and educational distinctions. However, subsequent research 
emphasizes a broader range of characteristics potentially relevant to status 
hierarchy formation. Race, gender, age, ethnicity, religion and even height have 
been found to generate status differentials (Balkwell and Berger 1996; Berger, 
Cohen and Zelditch 1972; Katz and Cohen 1962; Kirchmeyer 1993; Marcus, 
Lyons and Guyton 2000; Meeker and O'Neill 1977; Ridgeway and Smith-Lovin 
1999; Webster and Whitmeyer 2001; and Ziller and Exline 1958). 


Status Generalization 


While Shils (1968) theorized about status deference, Berger, Cohen and Zelditch 
(1966,1972) detailed a broader process of status generalization: 


When a task-oriented group is differentiated with respect 
to some external status characteristic, this status difference 
determines the observable power and prestige within the 
group whether or not the external status characteristic is 
related to the group task (1966:243). 


In other words, social characteristics are laden with expectations about 
generalized competency (Berger, Cohen and Zelditch 1972; Skvoretz and Fararo 
1996). Culturally-derived status distinctions are transposed to valuations and 
behavioral outcomes in small task groups (Berger, Fisek, Norman and Zelditch 
1977; Webster and Foschi 1988). Here, influence in groups flows to persons 
in historically advantaged status categories because others simply assume that 
they are more competent. 

To elaborate this process, status generalization begins when one individual, 
i, observes the status characteristic(s) of another, /. Je observable status 
characteristics lead / to form expectations about /'s general abilities, and these 
expectations are manifest in behavioral outcomes. As a result, individuals in a small 
group are likely to assign low-status positions within the group to persons whose 
characteristics are negatively-evaluated in the larger society. Conversely, they 
assign high-status positions within the group to persons whose characteristics 
are positively viewed in the larger society (Webster and Foschi 1988). 

While status deference relies on recognized status differentials, particularly at 
the dyadic level, the process of status generalization is rooted in the conscious 
or unconscious transposition of external status to determine power and prestige 
orders in small group interactions. Both theoretical approaches have contributed 
to the development of an extensive body of research noting the profound effects 
of status expectations and cultural schemas. As long as status distinctions are 
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recognizable and culturally or socially meaningful, they can betransposed to order 
power and prestige hierarchies within any otherwise undifferentiated group. 

Evidence of the processes of status deference and generalization surfaces in 
many contexts. Previous research has elucidated various behavioral outcomes 
of status differentials and has identified a multitude of settings in which external 
status structures otherwise egalitarian interactions. Status differentials are 
manifest in small group settings as disparate opportunities to perform, rates of 
performance outputs, performance evaluations, influence (Berger, Cohen and 
Zelditch 1972), collective validation (Kalkhoff 2005), leadership roles and respect 
or esteem (Webster and Driskell 1978). Micro-level analyses have indicated that 
social status is predictive of conversational variations such as turn-taking (Smith- 
Lovin and Brody 1989) and topic-changing (Okamoto and Smith-Lovin 2001). 
These behavioral outcomes, stemming from external status differentials, have 
been observed in workplaces (Caudill 1958, Katz and Cohen 1962), summer 
camps (Sherif, White and Harvey 1955), and classrooms (Cohen and Roper 1972), 
as well as innumerable experimental groups and mock juries (Strodtbeck, James 
and Hawkins 1958). 


Status in the Jury Room 


Theories of status hierarchy formation are informative in attempts to understand 
jury room influence. Research examining the effects of social status in the jury 
room long precedes this paper. As early as 1953, the University of Chicago Jury 
Project assembled mock juries to observe whether class, race and gender affect 
participation in deliberations (James 1959; Strodtbeck, James and Hawkins 1957; 
Strodtbeck and Mann 1956). More recent research has considered the effects of 
status on specific features of deliberations, such as turn-taking (Manzo 1996), 
discussion topics (Hastie, Penrod and Pennington 1983), coalition formation 
(Hawkins 1961) and juror narratives (Manzo 1993). 

As in the Chicago Jury Project research, recent studies of power or influence 
in jury deliberations focus on differentials in juror participation (Kirchmeyer 1993; 
Nemeth, Endicott and Wachtler 1976). This approach posits that participation in 
a small group discussion correlates with influence over the group decision or 
product (Strodtbeck, James and Hawkins 1957). Past jury studies reveal surprising 
variation in levels of participation — on average, three jurors account for more than 
half of the total speaking time in a deliberation (James 1959; Strodtbeck, James 
and Hawkins 1957). Who are the most participatory jurors? 

First, early research using mock juries suggests that men speak in deliberations 
more frequently than women, regardless of occupation or race (James 1959; 
Strodtbeck, James and Hawkins 1957; Strodtbeck and Mann 1955), and more 
recent mock jury research echoes this finding (Kirchmeyer 1993; Nemeth, Endicott 
and Wachtler 1976). In fact, one mock jury study suggests that males initiate 
about 40 percent more comments than females (Hastie, Penrod and Pennington 
1983). These findings parallel social psychological research, which has shown 
that gender differences in mixed-sex groups shape performance expectations, 
conversational norms, and behavioral outcomes (Balkwell and Berger 1996, 
Meeker and O'Neill 1977, Ridgeway and Smith-Lovin 1999). 


, 


Status on Trial * 459 


Second, previous jury research indicates that individuals with higher 
occupational statuses and higher levels of education participate more than their 
lower status counterparts (Hastie, Penrod and Pennington 1983; Hawkins 1961; 
James 1959). In mock jury experiments, proprietors and clerical workers spoke 
more frequently than skilled and unskilled laborers (Strodtbeck, James and 
Hawkins 1957). Income levels combine with both educational attainment and 
occupation in their effects on participation in deliberations (Hastie, Penrod and 
Pennington 1983). 

Unfortunately, previous research has not adequately addressed jury deliberation 
participation by race, as past jury studies have severely underrepresented minority 
subjects. However, small group studies indicate that members of minority 
races are often relegated to positions of low status in task groups (Asante and 
Davis 1985). Compared with their white counterparts, minorities receive fewer 
opportunities to participate (Elsass and Graves 1997), and ultimately contribute 
less to decision-making (Kirchmeyer 1993). An experiment using college-age men 
found that whites dominated small groups even after numerous interventions 
designed to increase black members’ assertiveness (Katz and Cohen 1962). 

In sum, a multitude of studies provide strong evidence that status distinctions 
may affect participation in the jury room despite ideals of unanimity and diversity. 
But much of the research on social status in the jury room is quite dated, and 
particular issues, such as the effects of race and whether participation can be 
equated with influence, have not yet been adequately explored. Many of the 
mock jury studies were conducted prior to or concurrent with the Jury Selection 
and Service Act of 1968 and subsequent rulings which increased diversity in 
jury pools. The potential effects of status deference and status generalization 
are intimately linked to social norms, and cultural and social changes of the past 
generation may have altered the effects of social status in the jury room. 


Data 


While there are many benefits of mock jury research - the researcher can 
observe the deliberations and manipulate details of the case with different 
juries, for example — it has several notable shortcomings. Because mock jurors 
are aware that they are not deliberating toward a decision that will have real-life 
consequences, they do not experience the same motivations as actual jurors 
(Kessler 1975). Even in the best of scenarios, mock jury decisions do not exactly 
mirror actual jury decisions. A real jury and a mock jury of the same size may sit in 
a criminal courtroom and listen to the same testimony, but the real jury is simply 
less likely to convict (Diamond and Zeisel 1974:291). If verdict outcomes differ, 
we cannot infer that deliberation interactions of mock juries accurately reflect 
those of real juries. 

Nevertheless, the observation of actual juries is nearly impossible. In an effort 
to validate their mock jury studies, researchers from the Chicago Jury Project 
recorded six jury deliberations in 1954. Although they had gained permission from 
the judges and lawyers involved, the researchers were accused of jury tampering 
and subpoenaed to testify in a congressional investigation. They were ordered 
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to destroy the data, and resulting legislation largely closed the jury room door to 
social scientific researchers. à 

In the absence of actual jury deliberations, post-verdict juror surveys provide a 
way to gauge jurors' perceptions of the deliberations (Marcus, Lyons and Guyton 
2000). Theoretically, one's recollection of influence in an interaction may more 
accurately represent his or her impressions than an actual coding or observation 
of the interaction (Thomas and Thomas 1928). An anonymous paper and pencil 
survey provides a forum in which an individual may express status-based 
discrimination that might be more suppressed in a face-to-face context. Post- 
verdict recollections depart from observation of the actual interaction in that they 
are filtered through respondents' perceptions of social norms, but we suspect 
that these same perceptions also affect the jury deliberation. 

On the other hand, an individual may be more conscious of his or her behavior 
when completing a survey than interacting in face-to-face deliberations. In this 
case, he or she might suppress status-based discrimination when completing 
the post-verdict survey so that results of the survey would provide only a very 
conservative measure of the effects of social status. Further research is needed 
to assess whether post-verdict recollections overestimate or underestimate the 
effects of status. We proceed with this analysis based on our belief that post- 
verdict surveys provide a rare and valuable, albeit incomplete, look behind the 
closed door of the jury room. 

The first author surveyed jurors in Massachusetts courts from November 1998 
to January 1999. Four judges in the Superior Court of Suffolk County and one 
judge in the Superior Court of Middlesex County? were selected using a snowball 
sample. Each judge distributed the surveys to jurors after a verdict had been 
rendered in his or her courtroom. The judges were given a script to introduce 
the surveys and encourage the jurors to complete them as soon as possible after 
leaving the courthouse. They assured the jurors that their responses would remain 
anonymous, as the surveys were coded only by respondent numbers. A stamped, 
addressed envelope was included with each survey to allow its anonymous 
return to the researcher. In total, completed surveys were received from jurors 
who served on 14 different juries. Eleven of the juries decided civil cases (78.6 
percent) and three juries heard criminal trials." The criminal juries consisted of 12 
members, while civil juries had either 12 or 14 jurors. The overall response rate 
was 35 percent, with 62 of a total 177 jurors completing the survey.? 

We are interested in jurors' perceptions of each other and how these 
perceptions might have affected jurors' diagnoses of others' influence over the 
verdict. To facilitate recollection, the first page of the survey provided respondents 
with a diagram of the table in the jury room and asked them to try to remember 
other jurors by using their seating positions (labeled with the letters A-L or A-M, 
depending on jury size). The survey then asked respondents to record the letters 
representing those jurors who were “most influential in the group's decision.” It 
is this judgment that we investigate in this paper. 

On subsequent pages of the survey, respondents rated each juror's amount 
of participation in the deliberations. Respondents were then asked to recall each 
juror's race and gender and to infer his or her occupation and lifestyle.? While 
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jurors may not share information about their occupation and are probably even 
less likely to offer information about their income, we expect that respondents 
can infer such characteristics based on other jurors' dress, speech and casual 
references to experiences (Strodtbeck, James and Hawkins 1957). Thus, class is 
measured using respondents' estimates of other jurors' lifestyles (as a qualitative 
measure of income) and coded into four categories: upper, upper-middle, middle 
and lower class." 


Analytic Strategy 


The dependent variable is one's influence during a jury deliberation, as assessed 
by one's fellow jurors. We first examine the distribution of the influence 
nominations to determine what kinds of jurors were most likely to be seen as 
influential. In particular, we want to determine how internal status roles (such as 
foreperson status and amount of participation) and external status roles (such as 
race, class and gender) affect influence in the jury room. 

Recent research suggests that participation may not be the causal link 
between status and influence in the jury room. Conversation analyses of 
status generalization in small groups indicate that early, frequent or assertive 
participation by low status group members can arouse negative reactions. On 
the other hand, group members tend to respond positively to any participation 
by high status individuals (Meeker and O'Neill 1977, Ridgeway and Berger 1986). 
Thus, we examine both participation and influence in the jury room, and allow 
social status to affect both participation and influence separately. 

This study improves upon research that has examined social status in jury 
deliberations on an aggregate level. Deference, as Shils (1968) theorized, occurs 
when one individual recognizes the higher or lower status of another. In this way, 
a juror exercises influence not on the jury as a whole, but on each of the jurors 
interpersonally. One juror's perception of another's influence may vary according 
to his own social background and the background of the juror he is evaluating. 
If we want to know whether an upper-class juror was seen as influential due 
to her performance or due to the deference gained by her status, we need to 
somehow account for the status of her fellow jurors. Aggregate measures of 
group similarity would not capture this dyadic process. 

To examinethe extentto which jury deliberations are marked by status deference 
or status generalization, we use logistic regression to assess the probability that a 
given juror, /, identified another juror, /, as influential." The unit of analysis is a pair 
of jurors, rather than an individual. In a given setting containing n persons, there 
are n * (n - 1) potential dyads. There were 14 juries surveyed, some comprised of 
12 jurors and others 14. Not everyone returned the questionnaire, so the number 
of dyads reported in each jury varies. We utilize influence judgments made by 62 
(35 percent) of the 177 jurors. Excluding missing cases (e.g., where a respondent 
did not remember one of the other jurors), we observe 828 dyads."? 

The analysis proceeds as a series of nested models. We first consider 
whether status roles internal to the jury process - such as /'s participation in 
the deliberations and whether or not / is the foreperson — are associated with js 
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perceived influence on the verdict. J's foreperson status is modeled with a dummy 
variable, where a positive effect signifies that authority over the formal aspects of 
deliberations causes influence. Note that when jurors select forepersons there is 
a likely additional link between status and participation (Strodtbeck, James and 
Hawkins 1957). In Massachusetts, forepersons are selected by the judge, so this 
linkage is attenuated. 

The second model considers the effects of status roles that are external to the 
jury room. This model includes /'s sex, race, and social class (see Table 1).'? We 
use the lifestyle rating described above as a proxy for class.'^ The respondent's 
(s) characteristics (race, sex and class) are entered in the third model. These 
variables are essentially controls. Without considering them, it is impossible 
to know whether members of a given group, such as males, are judged most 
influential because they were actually more influential or because males were the 
dominant sex group in the jury room, thus allowing them to capitalize on in-group 
preferences. 

The final model assesses whether positive appraisals of influence reflect 
valuations of social status, using variables that indicate the relationship between 
's and /'s attributes (shown in Table 1). For race, sex and class we create one 
dummy variable apiece indicating status asymmetry between / and /, such that / 
has higher status or is in a more historically dominant group than /. If any of these 
dummy variables are positive and significant, then status deference is operating 
on influence appraisals.56 


Results 
Who is Influential in the Jury Room? 


Influence is not exactly a scarce resource in juries, and jurors often vary widely in 
their appraisals of influence (Marcus, Lyons and Guyton 2000). In our study, 44 (or 
about 25 percent) of the 177 jurors were identified as influential by at least one 
respondent. On average, each juror who was seen as influential was nominated 
by 40 percent of his fellow jurors." In half of the juries, the average level of 
agreement about who was influential fell just below 33 percent, with only one 
jury under 25 percent agreement and one above 50 percent agreement. 

Influence can emerge through two key mechanisms: status differentiation 
internal to the jury process and the mapping of external social characteristics 
(and their corresponding status roles) onto jury interactions. With regard to 
internal processes, one would expect the foreperson to be particularly influential. 
Indeed, of the 14 forepersons, eight (57 percent) were named as influential by at 
least one other juror.' 

A second form of internal status is a juror's participation in deliberations. Highly 
participatory jurors are more influential. Overall, about 40 percent of the jurors 
who participated "much more" than others were seen as influential, compared to 
only 14 percent of those who spoke "much less" than others. As Figure 1 shows, 
when we look within each jury, those who talked "much more" than others were 
recognized as influential by about 19 percent of their fellow jurors, on average. 
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Figure 1. Average Proportion of Influence Nominations from Fellow Jurors, by Juror j's 
Participation Level (n = 177) > 
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By contrast, those who spoke “somewhat less” or “much less” than others had 
less than a 4 percent chance of influencing their fellow jurors. 

Influence in the jury room may also stem from external status. Early research 
found that upper-class, male jurors have the most influence over verdicts (Hawkins 
1961; James 1959; Strodtbeck, James and Hawkins 1957; Strodtbeck and Mann 
1956). Our analyses indicate that these associations are not as strong as they 
once were. Overall, 27 percent of male jurors and 24 percent of female jurors 
received at least one influence nomination. (The difference is not statistically 
significant.) In addition, 28 percent of both whites and non-whites were seen as 
influential. 

While influence is conferred nearly equally among males and females, whites 
and non-whites, it is not evenly distributed by class. About 39 percent of upper- 
class and 38 percent of upper-middle-class jurors were regarded as influential, 
compared to just 13 percent of middle class and 8 percent of lower class jurors 
(x? = 17.054, p < .001). Figure 2 indicates that members of the upper classes 
received influence nominations from a relatively high proportion of respondents 
within their juries. Upper-class jurors received nominations from 17 percent of 
the respondents in their juries, whereas lower-class jurors received nominations 
from only 2 percent of their fellow jurors. 

Who, then, has influence in the jury room? Forepersons and highly-participatory 
jurors are influential in deliberations. Influence does not appear to vary by race 
or gender, but it does seem to vary by class. Thus, jurors derive influence from 
statuses both internal and external to the jury. From these analyses, however, it is 
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Figure 2. Average Proportion of Influence Nominations from Fellow Jurors, by Juror j's 
Social Class (n = 177) 
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difficult to tell whether differentials in influence are due to status generalization 
or status deference. Theoretically, both processes allow the transposition of 
external status differentials onto jury room interaction. Interpersonal influence, 
however, implicates the statuses of both the rater and the ratee within the same 
process. We thus turn to the dyadic analysis. 


Influence at the Dyadic Level 


Dyadic logistic regression analysis, presented in Table 2, demonstrates robust 
effects of internal status roles on influence. Forepersons are more than twice as 
likely as non-forepersons to be regarded as influential (odds ratio = 2.3, p < .05). 
Participation has a similarly strong impact on influence. Recall that jurors rated 
each other using five levels of participation. A juror at any given participation level 
was 1.7 times more likely to be influential than a juror at the next lowest level of 
participation, on average. 

Status roles external to the deliberation process affect influence in a limited 
way. The inclusion of /'s external status characteristics significantly improves 
the model's predictive power. (See Table 2 for the decrease in -2 log likelihood 
associated with the addition of these variables.) Of the three characteristics 
— sex, race and social class — only class has a significant impact on influence. 
On average, members of any given social class are 2.3 times more likely to be 
influential than members of the next lowest class. 

Figure 3 clarifies the relationship between social class, participation and 
influence. Neither high social class nor high participation guarantees influence in 
the jury room, but they are both very consequential. For example, the probability 
that an upper-class juror who participates at the highest level will be judged 
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Table 2: Logistic Regression Models Predicting the Likelihood of a Juror (j) Being Judged by 
Another Juror (i) as Most Influential in the Jury's Decision (n — 605 dyads) 


a > RN NE 
j's Characteristics 
Foreperson status 2.219" 2.280* 2.245* 2.229* 
(.428) (.444) (.453) (.452) 
Participation 1.649*** 1:505; 1635" AULA 
(.148) (.151) (.154) (.160) 
Male 1.393 1.396 1.525 
(.332) (.337) (.592) 
White 1.023 1.059 1.439 
(.484) (.490) (.627) 
Social class 2.308** 2 27 1° 3.616** 
(.279) (.282) (.452) 
i's Characteristics 
Male 3.258" 3.447* 
(.459) (.628) 
White .688 .308 
(.567) (.934) 
Social class .955 .620 
(.404) (.501) 
Dyadic Characteristics 
Sex deference .923 
(.719) 
Race deference .303 
(1.012) 
Class deference 313 
(.675) 
Constant .026*** .003*** .002** .004*** 
(.751) (1.085) (1.526) (1.592) 
Decrease in -2LL* -- 11.086' 9.417' 3.902 
Nagelkerke R .196 .231 .261 213 


Note: Standard errors are presented in parentheses below odds ratios. Dummy variables indi- 


cating from which jury the case is taken and controls for /’s expansiveness and /’s attractiveness 
are included, but not shown. 


*p«.05 *p<.01 *** p<.001 (one-tailed tests) 
* Decrease in -2 log likelihood from previous model is evaluated using x? (df = 3), * p « .05. 
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influential is .825, whereas, for middle class jurors who participate at the highest 
level, the probability is .390. At the same time, it appears that class has a much 
stronger effect on influence than participation. Lower-class jurors who participate 
at the highest level barely achieve the influence enjoyed by upper-class jurors 
who participate at the lowest level. Highly participatory members of the upper 
class were the most influential, but we find no interaction between participation 
and social class. 

The third logistic regression model introduces external status characteristics 
— the sex, race and social class - of /, the rater. The predictive power of these 
characteristics rests primarily on /'s sex. The likelihood of male respondents judging 
afellow juror as influential was overthree times that of female respondents. Perhaps 


Figure 3. Predicted Probability of Influence Nomination, by j's Social Class and Participa- 
tion Level (n = 605 dyads) 
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men were more comfortable conferring influence nominations, or they recognized 
influence more readily, than women. Further research is needed to clarify the cause 
of this discrepancy. The external status characteristics of /, however, are not a main 
focus of this study and are included here primarily as controls. 

Finally, we incorporate dyadic characteristics, which are intended to measure 
deference across sex, race and class categories. The key finding here is that the 
inclusion of these variables does not significantly alter the model's predictive 
power (x? = 3.902, p = n.s.). Furthermore, none of the deference variables are 
significant. Status differences between individuals, then, do not appear to create 
a greater propensity to confer influence.'? For example, women were no more 
likely to recognize men as influential than they were to nominate other women, 
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and minorities were no more likely to rate whites as influential than they were to 
nominate other minorities. e 

The final model, with all controls, reiterates the impact of social class on jury 
room influence. Upper-class jurors are three and a half times as likely as lower- 
class jurors to be regarded as influential. As depicted in Table 3, upper- and upper- 
middle class jurors, on the other hand, enjoy a large share of influence votes. By 
contrast, none of the lower-class jurors were seen as influential. This pattern is 


Table 3: Influence Nominations from Juror i to Juror j by Social Class (n = 605 dyads) 





fs Social Class 


Upper-Middle 

Ps Social Class Lower Middle and Upper 
Lower 0% 0% 50.0% 

(n= 5) (n= 5) (n = 2) 
Middle 0% 8.7% 16.4% 

(n= 21) (n = 241) (n = 67) 
Upper-Middle 0% 5.2% 16.5% 
and Upper” (n = 14) (n = 154) (n = 97) 
Total 0% 7.3% 16.9% 

n=40 n= 400 n= 166 


* For the purposes of this table, upper class cases were merged into the upper-middle class 
category due to the small number of upper-class jurors. 


not affected by respondents’ class. These results, then, suggest that external 
status differences do not impose themselves on the otherwise egalitarian jury 
via status deference. Upper-class jurors were the most influential because jurors 
from a// classes recognized them as such. 


Conclusion 


The jury has come a long way in the 50 years since the studies of the Chicago 
Jury Project found that upper-class men dominate deliberations. Minorities and 
women have been brought into the jury room, and these groups now appear 
to participate in jury deliberations and influence jury verdicts as much as their 
white, male counterparts. Status deference does not seem to be operating in 
today's jury room. Our results suggest that members of historically, culturally or 
socially dominant groups, /pso facto, no longer dominate jury discussions. Male 
jurors do not influence female jurors, white jurors do not influence their minority 
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counterparts, and upper-class jurors do not influence lower class jurors solely 
because of differences in social status. 

Nevertheless, jury room influence is not wholly unaffected by status. There 
is a powerful, and singular, effect of social class on influence in the jury room. 
Upper-class jurors exercise more influence than middle- or lower-class jurors. 
This is not due to status deference, as some classical sociological theories might 
suggest. Members of the upper class are more influential because jurors almost 
unanimously recognize them as influential. 

More predictably, participation also appears to contribute to influence in the 
jury room. The present findings depart from earlier studies which posited that 
participation in deliberations is an adequate proxy for influence (Strodtbeck, 
Simon and Hawkins 1957). Upper-class individuals' greater participation does not 
fully explain their above-average levels of influence. Instead, both participation 
and upper-class status simultaneously contribute to influence in the jury room. 

If upper-class jurors' influence cannot be traced to status deference, and it 
is not explained by their higher levels of participation, then to what might it be 
attributed? The most accessible explanation, given the wealth of research on the 
topic, is that status generalization is operating in the jury room. Recall that status 
deference stems from status asymmetries, but status generalization is based on 
broader beliefs and shared expectations about particular groups (Berger, Cohen 
and Zelditch 1966; Berger et al. 1977; Webster and Foschi 1988). 

Social class may translate more readily than race or gender into expectations 
of a juror's performance for two reasons. First, unlike race or gender, class is 
often an achieved status. As such, it is more amenable to expectations about 
generalized skills. Assumptions about the greater competence, intelligence, and 
experience of upper-class persons may cause jurors to value the opinions of 
upper-class jurors, and to doubt lower-class jurors' contributions. 

Second, effortsto addressthe historicaleffects of racism andto create protected 
categories related to visible status differences have decreased the legality and 
social acceptability of translating such statuses to generalized expectations of 
competence. Doing so, in fact, would amount to overt discrimination. As a result, 
social and cultural distinctions of ascribed statuses, such as race and gender, are 
being replaced by the growing inequalities introduced by achieved status (Wilson 
1980). Lines of social class now crosscut racial and gender inequalities. Social 
class, then, may constitute the most salient and socially-acceptable basis for 
status generalization today. 

The influence of upper-class jurors might also stem from the high education 
levels and high-status occupations that are typically correlated with upper-class 
status. Skills such as public speaking, group leadership, and logical reasoning 
could enhance the effectiveness of upper-class jurors’ comments (e.g., Brady, 
Verba and Scholzman 1995). Upper-class jurors also may be more accustomed to 
the types of arguments, technical details and legal complexities presented during 
a trial, resulting in a greater mastery of, and ability to recount, this information. 
Perhaps a cultural affinity among upper-class jurors, attorneys, expert witnesses 
and judges, gives upper-class jurors more clout among their fellow jurors. 
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Further research is needed to identify the source of upper-class jurors' 
influence. This will require distinguishing between the potential effects of 
status generalization and actual skills that are concentrated in the upper class. 
Whatever the source, we also need to consider more fully the legal and social 
implications of upper-class individuals' disproportionate influence within the jury 
room. Does this have an effect on patterns of verdicts for defendants of various 
backgrounds? How would verdicts differ if influence were equally distributed by 
class? A perceived lack of influence by members of the lower and middle classes 
might result in their dissatisfaction with and distrust of the justice system. 

While not all jurors can be equally influential in deliberations, we might take 
some measures to reduce the disproportionate influence of the upper class. 
Experiments attempting to reduce status generalization in mixed-race small 
groups have found that merely reminding participants that all group members are 
equal does not result in egalitarian interactions. Informing group members that 
typically marginalized groups have something unique and valuable to contribute 
to the discussion, though, does lead to more inclusive interactions (Katz and 
Cohen 1962). Thus, if jurors are told that lower- and middle-class individuals, in 
particular, have important viewpoints to contribute to deliberations, they might 
look to these jurors for input. 

Second, giving pre-trial instruction about the legal issues in a case and 
providing a formal structure for deliberations might help to reduce the advantage. 
that upper-class individuals enjoy as a result of acquired skills, such as public 
speaking and group leadership. Aside from the unanimity requirement and 
general guidelines about participation typically found in juror handbooks, few 
formal norms guide the process of deliberation. Without a specified procedure 
for the discussions, upper-class members are likely to adopt a pattern of 
interaction that closely resembles that of "position statements" in organizational 
meetings (Manzo 1996:111). By outlining a more detailed format for structured, 
egalitarian deliberations, reaching a verdict could be a unique and unfamiliar 
experience for all jurors. 

Finally, changes to the process of trial and deliberation could reduce the 
advantage of upper-class jurors. Recent studies have examined the effects of 
trial innovations, such as allowing jurors to take notes and ask questions in 
the courtroom, permitting juror discussion during recesses, providing jurors 
with notebooks of evidence and videotapes of testimony, and presenting more 
detailed verdict instructions (Diamond et al. 2003, Mott 2003). Further research 
is needed to determine how such modifications could alter interactions inside 
the jury room. 

Examining the impact of status characteristics in the jury room is useful for 
more than just opening the black box of jury deliberations or for testing whether 
we can have better deliberations or better justice. Jury room interactions are 
"bound to have a great influence on national character." (Tocqueville [1835] 
1969:274) Small groups, such as juries, allow for the collective development and 
recognition of symbols, communities and identities (Harrington and Fine 2000). 
The jury system makes possible the interaction of diverse individuals within the 
institutionalized authority of the American courtroom, and the emergence of 
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external status hierarchies within this space powerfully reinforces the dominance 
of particular groups. 


Notes 


Ë 


2: 


10. 


28 U.S.C., secs. 1861-69. 


In Taylor v. Louisiana 419 U.S. 522 (1975) the Supreme Court required that 
the jury pool represent a cross-section of the community. And, the Court's 
decisions in Batson v. Kentucky 476 U.S. 79 (1986) and Powers v. Ohio 499 
U.S. 400 (1991) aimed to reduce group-based discrimination in peremptory 
challenges. 


Massachusetts Trial Juror's Handbook. 1984. Boston, MA: Office of the Jury 
Commissioner for the Commonwealth. 


See, for example, summaries on the effects of gender (Meeker and O'Neill 
1977, Ridgeway and Smith-Lovin 1999), education (Webster and Whitmeyer 
2001), race (Katz and Cohen 1962), and occupation (Berger, Cohen and Zeld- 
itch 1972) in small group interactions. 


United States Senate. 1955. Hearings on the Recording of Jury Deliberations 
Before the Subcommittee to Investigate the Administration of the Internal Se- 
curity Act and Other Internal Security Laws of the Senate Committee on the 
Judiciary, 84th Congress, 1st Session. Washington, D.C.: Government Print- 
ing Office. 


In the commonwealth of Massachusetts, Superior Courts have jurisdiction in 
civil cases where the damages exceed $25,000 and in criminal felony cases 
such as murder, armed robbery and rape. 


This proportion of civil and criminal trials correlates with the distribution of 
cases within the Massachusetts Superior Court system, where approximate- 
ly 80 percent of the cases heard each year are civil trials (conversation with 
Judge Peter Lauriat, Oct. 28, 1998). 


Female and white jurors were slightly more likely to respond to the survey, 
but across race and gender, the subsets of jurors and respondents are repre- 
sentative of county populations. (See appendix Table A1 for race and gender 
distributions.) 


The respondent's initial identification of the most influential juror(s) was pur- 
posively located on the first page of the survey, while questions about each 
juror's race, class and gender were asked on following pages. This design 
was intended to reduce the likelihood that recall of status characteristics and 
social and cultural norms regarding their value would affect a respondent's 
likelihood to nominate a particular juror as influential. 


Subjective assessments are a useful estimation of the effects of social sta- 
tus, as it is through individuals’ subjective evaluation of one's status that 
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II 


16. 


dida 


status differentials operate in a small group interaction. Status is a measure 
of deference, and as such, assessments of the audience offering deference 
are an appropriate measure of status (Shils 1968). 


Six respondents identified more than one juror as being the most influential. 
In such cases, all nominations are considered valid. 


. The response rate is low. Therefore, to test for potential selection effects in 


the sample, we used logistic regression to predict whether or not a juror's 
influence affected his decision to complete a survey. We also included as 
predictors sex, race, class, foreperson status, jury number and participation. 
Only class and sex were significant at the .05 level, suggesting that while 
returned surveys are not selective with respect to the dependent variable, 
results disproportionately represent the views of females and those in higher 
classes. These status characteristics are controlled in the analysis. 


. Awhite/non-white race variable is sufficient to capture the effects of being in 


America's historically dominant race group. A more elaborate coding scheme 
involving five race categories was tested, but results were not different. 


. We ran models using a dummy-based operationalization of class. The mod- 


els included upper-middle, middle and lower class, while treating upper class 
as the reference category. Results did not change considerably, so we use 
the ordinal operationalization. 


. To moderate effects of non-independence among cases in dyadic regression 


models we incorporate several parameters that capture structural features of 
the network relative to each dyad (Wasserman and Pattison 1996). One way 
of capturing fixed effects by each sender and receiver is to include two mea- 
sures: 1) 's "expansiveness," or tendency to confer influence, as measured 
by the total number of jurors / nominated, not including j; and 2) /'s "attractive- 
ness" or tendency to attract influence, which is the number of times / was 
nominated by people other than /. This is a parsimonious alternative to the 
roughly 240 parameters that would need to be added in the form of dummy 
variables representing each / and each / (Moody 2001). (There would be 11 or 
13 dummy variables for the js in each jury plus the number of dummy variables 
for /, which would equal the number of respondents.) Controls for the effects 
of expansiveness and attractiveness are included in each of the models. 


There is some evidence of multicollinearity arising from high correlations 
between the variables for race deference and "Is / white?" and between 
the variables representing sex deference and “Is j female?” Thus, we ran 


alternate models dropping the individual-level measures. The results did 
not change. 


To generate this statistic, we calculated the proportion of nominators that 
voted for each juror who received at least one nomination. We then aggre- 
gated this statistic to the jury level and then calculated the overall average. 
These are conservative estimates of agreement, as they do not correct for 
the fact that, in some cases, not all respondents made nominations. Instead 
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of counting these as missing, we assume that no one stood out as influential 
for these respondents. Furthermore, there is tremendous agreement about 
who was not most influential, which is not reflected in this measure. 


18. The forepersons in the 14 cases studied here include eight female (57 per- 
cent), 12 white (85 percent), 7 middle-class (50 percent) and 7 upper-middle- 
class (50 percent) jurors. 


19. We tested whether status similarity, rather than deference, affects perceived 
influence. Results were not significant. However, status similarity was posi- 
tively related to the conferring of influence, whereas deference appears to 
be negatively related to influence. 
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Appendix 


Table A1: Racial Composition of Jurors, Respondents and Overall Population for 
the Counties Included in the Sample 


Race Jurors Respondents Residents* 
Asian 3 2 84,491 
(1.6996) (3.2%) (4.10%) 
Black 30 8 178,916 
(16.95%) (12.9%) (8.68%) 
Hispanic 6 1 116,955 
(3.39%) (1.6%) (5.67%) 
White 138 51 1,670,812 
(77.97%) (82.3%) (81.01%) 
Other 0 0 11,185 
(.54%) 
Total 177 62 2,062,359 
(100.0%) (100.0%) (100.0%) 





Source: 1990 U.S. Census and survey data. 
a “Residents” represents the total residents of Middlesex and Suffolk Counties in 
Massachusetts, from which the jurors were drawn. 
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Social Context and Musical Content of Rap Music, 1979-1995 


Jennifer C. Lena, Vanderbilt University 


Abstract 

There is a link between the context of production and the content of rap music singles. 
This research finds that when independent labels owned most of the charted singles, lyrics 
emphasized features of the local environment and hostility to corporate music production 
and values. In contrast, the major-label dominated market featured lyrics blending “street” 
credibility and commercial success in the "hustler" protagonist. I contribute a new finding 
to research on market concentration and musical diversity: artists’ reactions to the market 
effects musical content. 


Over the past three decades, scholars produced important evidence on the 
role of industry, organizational structure and markets in the creation of cultural 
goods. However, the consequences of these structures for creative people and 
the content of the cultural objects they produce are not yet well understood, as 
noted by Cerulo (1984) and Peterson and Anand (2004). This gap in the literature 
is striking given the emphasis of lead production of culture scholars on the 
impact of the context of production on artistic work (Becker 1982, White and 
White 1965). Yet, there is much evidence for the effect of market concentration 
and the organization of firms on the production of rap music. 

The structure of the contemporary recorded music industry is oligopolistic, 
controlled by multi-national firms with diversified holdings across multiple 
media ànd consumer product markets (Burnett 1995; Burnett and Weber 1989; 
Dowd 2000, 2004; Frith 1987, 1988). These firms restructured their bureaucratic 
infrastructure in the past several decades, establishing semi-autonomous 
divisions that oversee subsidiary labels. Subsidiary labels are typically run 
by freelance producers, who are “delegated the responsibility of producing 
marketable creations, with little or no interference from the front office beyond 
the setting of budgetary limits.” (Hirsch 1972:644) Recently, Dowd (2004) found 
one effect of this “de-centralized” production was to attenuate the negative effect 
of oligopolistic markets on musical diversity.? De-centralized production allows 
subsidiary labels to imitate independent labels” ability to seek out new talent 
and musical styles; at the extreme, they are able to co-opt the type of musical 
products independent labels offer (Dannen 1990, Negus 1999). 

Scholars do not agree, however, on the source of diversity in music markets. 
Anderson and Hesbacher (1980) found independent labels often introduce new 
genres to the marketplace, but these genres have little variability. Independent 


I would like to thank Peter S. Bearman, Laura Carpenter, Daniel Cornfield, Holly Mc- 
Cammon, Gina Neff, Richard A. Peterson, Tammy Smith, two anonymous reviewers for 
Social Forces and Editor Peter Uhlenberg. An earlier draft of this paper was presented 
at the American Sociological Association's Sociology of Culture Panel on Creativity, In- 
novation and Ideas in August 2004. Direct correspondence to Jennifer C. Lena, Vanderbilt 
University, Department of Sociology, VU Station B #351811, Nashville, TN 37235-1811. 
E-mail: jennifer.c.lena@vanderbilt.edu. 
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labels introducing new genres typically record new artists (Dowd 2003, Kennedy 
1994, Lopes 1992, Peterson 1997); however, Dowd (2000) found that hits of new 
performers are not as musically varied and dissimilar as those of established 
performers. Disagreements over the source of diversity are partly attributable 
to the lack of a standard measure: the literature relies both on a variety of proxy 
measures and direct measures of musical content. Proxy measures of musical 
diversity include the appearance of new artists (Lopes 1992, Peterson and Berger 
1975, Rothenbuhler and Dimmick 1982), the appearance of new firms and artists 
(Dowd 2004, Lopes 1992, Peterson and Berger 1975), and the number of female 
(Alexander 1996, Anderson and Hesbacher 1980) and African-American (Dowd 
and Blyler 2002) performers. Direct measures of musical content include lyrical 
diversity (Peterson and Berger 1972, 1975), melodic and chordal structure (Dowd 
1992), musical dissimilarity (Dowd 2000), and musical material (Alexander 1996). 

This research seeks to contribute new measures of diversity, an analysis of 
lyrical and sub-genre musical content. It offers the first analysis of R&B music 
charts and population data on charted rap music as well as evidence of the effect 
of decentralized production by oligopolistic firms on the content of music. There 
are consequences of social context on art production. 


Rap Music 


Rap music was the best case for this study for several reasons. First, the link 
between record label management practices and rap music content has been 
a focus of highly publicized criticisms of the genre. Sen. Joe Lieberman (D- 
Conn.) captured national attention with a letter writing and radio campaign aimed 
at pressuring record companies to cease production of “obscene” rap music 
(Holland 1996:8). Lieberman was joined by fellow Sen. Sam Dunn (D-Ga.) and 
C. Delores Tucker (National Political Congress of Black Women). This followed 
two years of anti-rap record store demonstrations and Congressional Hearings 
on the issue. There is evidence that record labels encouraged rap artists to 
produce prurient content. For example, during this period, Carmen Ashhust- 
Watson described the development strategy of rap label Def Jam: “Right now 
gangsta rappers are the big thing. If [a hypothetical rap group] look like the kind 
of group that has the capacity to do that, then [our label] might suggest they 
do some gangsta-style songs.” (quoted in Rose 1994b:124) The link between 
(objectionable) rap content and record labels is thus of interest to policy makers 
and the public. Second, rap music charted earliest on the Billboard Magazine 
weekly R&B charts, a data source scholars have ignored. Drawing upon these 
charts provides the most complete temporal snapshot of charting rap songs and 
includes more sales heterogeneity than other studies of music markets, with the 
notable exception of Dowd (2004).? 


Data and Methodology 
The primary data are all rap singles that appeared in the weekly 7op 700 R&B 


Billboard Magazine charts from Jan. 1, 1979 through Dec. 31, 19953 The 
compilers of the Billboard charts have based their rankings on a weekly sampling 


Context and Content of Rap Music * 481 


of wholesale record sales, jukebox plays and airplay on radio stations since Aug. 
4, 1958. Although the formula coordinating these three sources of data has 
changed over time (Hesbacher 1974), experts agree that the Billboard charts are 
the fairest and least subjective of the published music charts (Anderson et al. 
1980, Denisoff 1986, Dowd 2004, Lopes 1992, Peterson and Berger 1975). 

Four criteria were used to determine if R&B singles were included in the 
population of rap songs: chart designation, rap interludes, shelving designation 
and expert opinion. First, a song was included if it was designated “rap” in the 
official compilation of these weekly charts (Whitburn 1996). R&B songs with rap 
interludes were designated thusly: “featured performer MC X (rap),” indicating 
a rap within the song, and this was sufficient for inclusion in the population. 
If the text indicated neither the rap genre nor the presence of any interlude, 
the shelving designation on the back cover of the album/CD was considered; 
if either "rap" or "hip-hop" were listed among the first two genres, all the songs 
that charted from that album were included in our population.^ Finally, several 
rap music experts, including major retailers of rap music and radio professionals, 
were asked to determine the genre of ambiguous songs. In total, 1,221 songs 
met at least one of these criteria for inclusion. 

The record label of each rap single was recoded to identify it with the 
appropriate parent corporation. Following Peterson and Berger (1975), Lopes 
(1992), and Dowd (2004), these corporations were determined by issues of 
the Annual Billboard International Buyers Guide and on information from the 
Securities and Exchange Commission (in particular, 10-K Forms and annual 
reports to stockholders). Corporations typically under-report limited partnerships 
with recording companies, but errors were reduced by supplementing these 
sources with Standard and Poor's Daily News reports and trade press articles. 

Rap music content was classified into mutually exclusive, exhaustive 
and independent categories that reflect producers’ and consumers’ axes of 
differentiation. Sociologists of culture have provided evidence of the differentiation 
of artistic tastes that constitute consumers as fan communities (Bourdieu 1984, 
1993, 1995; DiMaggio 1991; Frith 1996; Mark 1998; White 1993; White and White 
1965). Audiences sense - literally hear (or see, etc.) - genres and sub-genres in 
music and by audibly mapping the system and choosing their preferences they 
contribute to these communities. The link between sub-genres of rap music and 
audience groups is established in the work of Schusterman (1991), Gilroy (1993a, 
b), Rose (1994a), Brennan (1994) and Walser (1995). Krims (2000:3) argues, “the 
degree to which a rap (or more generally, hip-hop) fan will defend the authenticity, 
originality, and sophistication of her/his favorite rap style/genre/artist/aloum/song 
is Virtually unparalleled.” 

Little musicological analysis has been done to obtain the dimensions of 
comparison used by rap audiences with the exception of Krims (2000), and it is 
from this work that sub-genre classifications were taken. Krims (2000) performed 
acontent analysis of articles and advertisements in popular magazines, interviews 
with artists, conversations with fans and the music Itself to generate the musical 
and lyrical components that distinguish rap sub-genres. His categories reflect 
the “remarkable consistency in how the field of...rap music is divided and 
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that characteristics are attributed to each [sub-genre]." (Krims 2000:47) These 
distinctions reflect criteria used by producers and consumers to organize the 
field of production into sets of contrasts between songs, artists and albums. 
From these distinctions, a taxonomy based on four criteria was developed. When 
applied to a song, this sorted it into a sub-genre group with a unique admixture 
of attributes: 


1. Flow is "the verbal wordplay of rap" — the rhythmic 
pattern and meter used by the rapper in delivering spoken- 
word texts. There are three kinds of flow: a sung flow is 
marked by rhythmic repetition, on-beat accents and regular 
pauses, and gives the effect of a sing-along. A percussion 
saturated flow employs a sharply attached or crisp delivery 
and offbeat attacks to push the boundaries of meter, 
spilling the text past the caesura of the metric line. A speech 
saturated flow resembles spoken language, but with a great 
number of rhyming syllables. Rappers often begin a new 
four-measure unit with the end of the previous measure or 
rhyme complex, and commonly omit a metric pulse. 


2. Musical Style distinguishes the function performed by 
the "background" music. If instrumentation is backgrouna 
or rhythm for the primary voice or voices then that rap 
has a rhythmic musical style. When pre-recorded music 
("samples") is foregrounded in the composition (even if the 
sound or tempo is distorted), this is a memory musical style. 
When instrumentation or samples function as the "speaking 
voice,” this is an interlocutor musical style. This technique is 
most recognizable when scratching, the practice of using a 
body part or object to slide a record back-and-forth under a 
needle to create a percussive distortion of the music. 


3. Rhythmic Style is characterized by the regularity of 
rhythm. One crucial role for rhythm in rap songs is to make 
possible the audibility of layering, or the multiplication of 
musical tracks within a recording through the assemblage 
of disparate musical sources, creating polyrhythmic (and 
polytimbral) textures (Krims 2000, Rose 1994a)5 Rap 
songs are distinguished based on whether they have a 
unitary rhythm style (one dominant beat), or use multiple 
rhythms. For example, a song with a multiple rhythm style 
will permit the listener to distinguish a piano from a guitar 
from a drum. 


4. Semantic Content distinguishes songs based on lyrical 
content. There are more than 10 tropes found in rap music 
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between 1979 and 1995, including sex, love, violence, 
genderroles, politics, race, partying, money, comedy/parody 
and boasting. These categories are not mutually exclusive 
and exhaustive. 


The complete audio recording or majority segments of each of the singles 
in the population and lyrical content were secured from several archives.? It is 
possible that inaccessible sonic segments of some singles may have caused 
a partial coding to result, and thus an erroneous allocation of those singles 
to sub-genre groups. It is possible that inaccuracies in the text of lyrics could 
have caused errors in the lyrical coding. In addition, the coding of lyrics was 
complicated by the use of slang, metaphor and irony. However, errors of this 
kind are likely to be random, and will affect only the magnitude of effect, not its 
presence or absence. 

Each single was allocated to a sub-genre group using a "best fit" method, 
according each single one and only one category for three of the four coding 
criteria (except semantic content). In order to establish the reliability of the coding 
procedure, 25 percent of the data was recoded a second time after an interval 
of nearly three years. The simplest estimate of stability is the percent of codes 
on which the two waves agree; published articles typically have a minimum of 
90 percent coding reliability, although scores in the high 80th percentile are not 
uncommon (Wimmer and Dominick 1997). The reliability score of this coding 
was 88 percent, indicating adequate reliability. After songs were sorted into sub- 
genres, sub-genre names were assigned that matched the content and style of 
the songs and are commonly used to refer to those songs by fans. Thirteen sub- 
genre groups were identified: booty rap, crossover rap, don rap, dirty south rap, 
east coast gangsta rap, g funk, jazz rap, new jack swing, parody rap, pimp rap, 
race rap, rock rap and west coast gangsta rap (see Table 1). 


Results 
Sub-genre Development 


There were several notable characteristics of sub-genre development. Almost half 
of the charting singles were crossover raps, including the first rap single to chart, 
“Rapper's Delight” (Sugarhill Records 1979) (see Table 2). This is unsurprising 
given that crossover rap's musicological attributes (fast-paced with intelligible 
lyrics and thin instrumentation) resemble popular music of the period. In contrast, 
dirty south rap, a genre notable for its lyrical focus on sex, partying and a twangy 
“southern” enunciation, had the fewest charting songs; this is unsurprising given 
its late development and the right-censoring of the data (Miller 2004). The other 
11 charting rap sub-genres achieved parity in terms of chart success, relative to 
these two sub-genres. 

The second notable characteristic of rap sub-genre development between 1979 
and 1995 is the evolution of stylistic characteristics out of older sub-genres and 
into new ones. For example, in 1982 parody rappers expanded the musical and 
rhythmic styles of crossover rappers to include interlocutor and multiple rhythmic 
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styles, but narrowed the lyrical focus to include only parodies of other rap songs, 
or song-length "toasts." This stylistic transition was smooth compared to the 
break between the first pimp rap in 1983 and its peer genres: new jack swing, 
west and east coast gangsta, and booty rap. While pimp raps resemble crossover 
raps in the use of a sung flow, unitary rhythmic style and rhythmic background 
music, the lyrical focus on sex, money, boasting and violence demarcated it 
from the rest of 1983's charted raps. The single remained sonically and lyrically 
distinct until 1988 when six pimp rap songs charted and were joined by singles 
in other, new rap sub-genres that contained similarly graphic depictions of sex 
and violence. 


Corporations and Puerile Content 


Did major record labels produce the majority of puerile rap? The perceived 
abundance of this lyrical content prompted anti-rap crusaders like Robert H. 
Bork to describe the genre as a “knuckle-dragging sub-pidgin of grunts and 
snarls, capable of fully expressing only the more pointless forms of violence and 
the most brutal forms of sex.” (1996:125) Industry watchers blamed the profit 
motivation of major record labels; they concluded that as long as there were 
customers willing and able to buy, “record labels will continue to put ethics and 
morality aside to release [violent or sexist rap].” (McAdams and Russell 1991: 
R-22) The shock value of this music contributed to its success on the charts 
(Perkins 1996): Billboard Magazine editors argued that corporations invested in 
puerile rap "because [it] was sleazier as a lure, easier as an enterprise, and more 
speedily remunerative at the end of the day." (Anonymous 1996:4) 

For the purposes of evaluating this contention, songs were recoded in eight 
sub-genres (pimp, new jack swing, west and east coast gangsta rap, booty, g 
funk, don and dirty south) as "hardcore" rap because their lyrics contained 
graphic descriptions of sex or violence. The strength of similarity between these 
sub-genres rests in their distinctiveness in the field of rap music production - in 
comparison to other charting rap sub-genres. Scholars have noted that across 
sub-genre styles, hardcore rap lyrics feature a "hustler" as a protagonist (Krims 
2000, Rose 1994a). The hustler dominates or victimizes others using force and 
seduction; hustlers are either "baaadmen" who dominate through force and 
intimidation or pimps (macks or players). The pimp and the baaadman are anti- 
heroes — representations of men who can control others physically or lyrically. As 
anti-heroes, pimps and baaadmen personify accomplishment against the odds; 
they live in a world of reverse power where the oppressed are powerful (Gates 
1988, Kelley 1996, Roberts 1989, Smitherman 1997, Toop 1991). The protagonists 
of hardcore rap have one foot in the world of extreme bourgeois consumption 
and success and the other in the ghetto. 

Among singles charting in the late 1980s, there is a striking profusion of sub- 
genres utilizing very different styles of flow, background music and rhythm, but 
all featuring hustler protagonists. Sub-genres differed in how they deployed the 
hustler narrative. Booty rappers like Miami group 2 Live Crew epitomized sexual 
anti-conformity (in songs like “Dick Almighty," “Me So Horny” and "The Fuck 
Shop” (As Nasty As They Wanna Be, Little Joe Records 1989), while pimp rappers 
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foregrounded money and placed sex as a correlate (Sean "P Diddy" Combs' work 
is an example, particularly "It's All About the Benjamin's" (No Way Out, Bad Boy/ 
Arista Records 1997). In contrast, gangsta rappers emphasized the violent anti- 
social behavior of the baaadmen. For example, on the inner sleeve of their album 
Straight Outta Compton (Priority Records 1988), gangsta rap group N.W.A. (Niggaz 
Wit Attitude) thanked: 


All the gangsters, dope dealers, criminals, thieves, vandals, 
villains, thugs, hoodlums, killers, hustlers, baseheads, hypes, 
winos, bums, arsonists, police, maniacs and bad ass kids for 
listening to our shit... 


Finally, don rappers (like Master P and Junior M.A.FI.A.) combined gangsta rap's 
violence with the emphasis on wealth, individuality and the sexual dominance of 
the pimp persona. 

Table 3 reveals that starting in 1988 the largest record corporations charted 
substantially more “hardcore” rap songs than independent labels. In the eight 
years between 1988 and 1995, majors charted up to five and a half times as 
many hardcore rap singles as all their independent competitors combined. 
Major record labels produced the majority of puerile rap, consistent with the 
hypothesized trend. 


Authenticity and Identity 


While coding the data on lyrics, strong similarities across sub-genres during the 
period before 1988 were readily apparent. Charting artists in crossover, parody 
and rock rap shared a lyrical emphasis on partying, romance, humor and parody, 
and upon closer inspection appeared to use similar slang and regional argot 
as well as references to specific personalities, places, events and concerns. 
Songs that described romantic entanglements identified specific community 
members, as in UTFO’s “Roxanne, Roxanne” (U7FO., Select Records 1985). 
Lyrical competition, or battling, was organized largely around neighborhood 
boundaries as in the contest between Queensbridge and The Bronx captured 
in MC Shan's “The Bridge” (Down By Law, Cold Chillin’ Records 1987) and 
Boogie Down Productions’ answer “The Bridge is Over” (Criminal Minded, B- 
Boy Records 1987). Cautionary tales like Dr. Jeckyll and Mr. Hyde's "Fast Life" 
(Profile Records 1981) or Grandmaster Flash’s "The Message" (7he Message, 
Sugarhill Records 1982) were located in particular surroundings and evoked the 
physical and social deprivation faced by many rappers and their families. These 
rap lyrics linked “representin’ the ghetto" with "keepin' it real." Sincerity and a 
ghetto identity were deeply associated in early rap lyrics, a facet of “authenticity 
work.” (Peterson 2005) The early focus of rap lyrics on local people, places and 
things could reasonably be a function of familiarity alone. These artists used for 
inspiration the source material in their environments. 

Rappers were particularly likely to respond to the impact of the increasing 
commercialization of rap on their local environment. Rappers often forswore the 
corrupt values of commercial enterprise and instead promoted an ethos akin 
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to Bourdieu's conception of “art for art's sake.” (1989) Rapper JDL (Cold Crush 
Brothers) captures the sentiment of many old school rap musicians: 


See, we came from the origin of hip-hop, when it wasn't a 
money thing. It wasn't a political structure that the record 
company builds around hip-hop. It wasn't about getting 
paid and all that. lt was about something you loved. 
(Fernando 1994: 14) 


In the distinction of "money" and "love," JDL contrasts rappers who work in 
a "political structure" that is run by the "record company," to "original" legitimate 
rappers. Early rappers repeatedly echoed the positive value of marginal commercial 
status. Remarkably, even Rick Rubin, producer of several of the earliest gold and 
platinum rap albums, lamented the commercialization of rap: 


At the time that | started it was still kind of a pure thing... 
[people] were doing it out of love for it. And | think that 
changed to people thinking they could get something from 
it. And when that happened the art kind of went away, and 
it became commerce, and people trying to put records 
together to sell. And that's what really ruined it for me. 
(Rubin quoted in Fernando 1994: 169) 


Beyond its ubiquity, what is striking about this anti-commercial ethos is the 
productive context in which it emerged. The rap industry had entered a period 
of consolidation, and rap artists were aware of the oligopolization efforts of 
corporations as early as 1983 when Sugar Hill Records, the earliest and most 
successful rap label, established a contract making it a subsidiary of CBS Records 
(Lena 2003). Early rap artists noted a rise in the number of major label contracts 
and chart spots allocated to rap groups (Fricke and Ahearn 2002). A number of 
high visibility contract disputes during this period still fuel morality tales that warn 
new rap artists of the pitfalls of doing business with the "majors" (e.g., Fricke and 
Ahearn 2002:212-15, 271, 324). 

Rappers developed particular tropes of artistic legitimacy to cope with the threat 
of corporate incursion into the market reflected by the “hustler” character. The 
large-scale shift toward hardcore rap lyrics after 1988 may have been promoted 
by ex-independent rap artists who sought to reconcile their earlier espousal of 
anti-commerciality with their new status as "corporate" artists. They felt pressure 
to craft an identity suitable (and saleable) in the mainstream recording industry 
while "keepin' it real," remaining congruent with an older value system. The 
hustler is still an outsider, but one with a comfortable relationship to commercial 
culture and material success. Despite the surface-level tension that exaggerated 
violence in hardcore rap provokes between reality and art, it still manages to 
represent "a religion and ideology of authenticity." (Perkins 1996:20) To wit, 
hardcore identity is deeply rooted in "being real" (especially found in slogans of 
"real niggaz,” "niggaz for life" and "bein' and stayin' real;" see Perkins 1996:19- 
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20). The shift toward hardcore rap may reflect a re-alignment of rap identity, one 
that reconciles “old school” authenticity with “new school” profits.° 


Conclusion 


Theseresults suggestan unanalyzed dimension of research on market concentration 
and musical diversity: artists react to this environment and this affects musical 
content. In the early period of rap production, lyrics emphasized features of the local 
environment, and producers emphasized anti-corporate values as an extension of 
their vision of authentic artistic production. In contrast, the major-label dominated 
market for rap music featured lyrics that effectively blended "street" credibility 
and commercial success in the hustler protagonist. These results suggest artists’ 
perceptions of the R&B market effect artistic content. 

This research has three goals: (1) to contribute new lyrical and sub-genre 
measures of musical diversity, (2) to offer a novel analysis of R&B charts and rap 
music, and (3) to offer evidence of the contemporary music market's effect on 
rap content. Production and artistic content are clearly linked. While the diversity 
of rap sub-genres over 16 years of production is undeniable, an analysis of rap 
lyrics suggests strong similarities across sub-genre styles. Taken singly, either 
level of analysis would contribute only a partial understanding of the relationship 
between market concentration and musical content. Using past proxy measures 
(e.g., the appearance of new songs, artists and firms), and some direct measures 
of musical content (e.g., background music, flow and rhythm) would lead one to 
conclude that diversity is thriving in post-1988 rap music. However, this analysis 
moves beyond such a partial understanding by illustrating critical and strong 
similarities across styles and artists. 

Future research necessitates case studies of artistic production. Only by 
turning to the reflections of artists and experts was the nature of the link between 
social context and musical content in rap made clear. These results illustrate the 
critical need for studies that join specific social contexts to the production of 
art works. Following Cerulo (1984:886), | exhort sociologists to engage in this 
research because "if a composer is a product of a particular environment, and 
hence, subject to its influences, it is reasonable to expect that a specific social 
context will provide insight into the processes of musical construction." 


Notes 


1. Using a range of historical and ecological factors hypothesized to effect musi- 
cal diversity, Dowd (2004) found growing levels of decentralized production ` 
reduced the negative effect of concentration on carriers of diversity, and at 
the most expansive levels of decentralization, concentration no longer had 
any impact on these carriers (2004:1444). In fact, Dowd found distinct ef- 
fects on two forms of "producer" diversity: the number of new performers 
was more sensitive to contextual factors than the number of new firms. 


2. Lopes (1992) remains the only author to have investigated rap music diver- 
sity in light of market and industry structure, however, he relied on year-end 
charts of the top 100 singles and albums. 
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3. The charts used in this study were variously entitled "Hot Soul Singles" (1979- 
1982), "Hot Black Singles" (1982-1990), and "Hot R&B Singles" (1990-1995). 


4. "Hip-hop" is a more expansive term than "rap," including particular styles of 
language, music, dress, dance and occupation of physical space, however, 
“rap” is the preferred term here because the focus is on the music. 


5. There is a close relationship between timbre and rhythm in rap music: timbre 
is the quality of sound that allows a listener to distinguish sounds that have 
the same pitch and loudness (Krims 2000). It is determined by the harmonic 
content of a sound and its dynamic qualities (like vibrato or attack). 


6. http://www.ohhla.com. http://www.lyrics.com. 


7. Toasts, or long, profane vernacular narrative pieces performed orally, and their 
connection to rap lyrics is discussed by Kelley (1997) and Rose (19944). 


8. Kelley (1996:130) suggests that rappers parallel white-collar crime with street 
crime, thus "reversing dominant discourses legitimating entrepreneurship, 
hostile takeovers and a global corporate culture more concerned with profits 
than committed to a national community...gangsta rappers hold up the illicit 
economy as a mirror image of American capitalism." 


9. Kelley (1996) offers an alternative interpretation, arguing that these are mas- 
culinist narratives that use metaphorical and playful descriptions of violence, 
and also that they are "street ethnographies," functioning as an alternative 
vision of American life. This sentiment is echoed in other work, notably that 
of Tricia Rose (19944). 
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Volunteering in Middle and Later Life: 
Is Health a Benefit, Barrier or Both? 


Yunging Li, New Jersey Department of Health and Senior Services 
Kenneth F. Ferraro, Purdue University 


Abstract 

The positive association between volunteering and health has been widely interpreted 
as evidence of the salutary effect of volunteering during adulthood. Using three waves 
of data from a national survey, this study uses structural equation models to examine 
the relationships among volunteering, functional limitations, and depressive symptoms 
during middle and later adulthood. The findings reveal a salutary effect of volunteering in 
later life as well as a compensatory mechanism. By contrast, only a barrier mechanism was 
uncovered in middle age: Depressive symptoms decreased volunteer participation over time. 
The results demonstrate life course variation in the relationship between volunteering and 
health and suggest greater attention to selection processes in the study of social engagement 
and health. 


Voluntarism is an important component of social organization, and most modern 
societies rely on a wide variety of voluntary associations to serve communities 
in tangible ways. Engagement in voluntary associations contributes to a more 
civil society and reportedly improves volunteers' physical (e.g., Thoits and Hewitt 
2001) and mental health (e.g., Musick and Wilson 2003). Such benefits could be 
substantial over the life course, despite the fact that volunteer activity fluctuates 
during adulthood. 

Previous research reveals that voluntary association participation is generally 
greater in middle age as work and family responsibilities spur social engagement 
from union halls and fraternal organizations to support groups and soccer fields 
(Curtis, Baer and Grabb 2001; Smith and Shen 2002). Although late middle age and 
older ages are associated with more free time, poor health may reduce activity in 
later life, thereby reducing participation in voluntary associations (Li and Ferraro 
2005). Previous research clearly documents important differences in the levels of 
participation over the life course, especially due to role incumbency at various life 
stages. These differences have been used to account for the differential health 
benefits of volunteering in both younger and later life (Musick and Wilson 2003; 
Van Willigen 2000). 
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One suspects that health may be more intricately related to volunteering 
in later life, both as a potential benefit and barrier; volunteering may benefit 
health, but poor health may preclude sustained volunteering. Failure to consider 
both the benefit and selection processes, however, may result in misleading 
conclusions, most likely overestimating the beneficial effects of volunteering. 
Unfortunately, relatively few studies have systematically examined the benefit 
and barrier mechanisms between volunteering and health in middle and later 
life. The present research is designed to address this gap and uses three-wave 
longitudinal data from a national sample to test both mechanisms. It also seeks 
to explicate the unique life course factors that influence selection into volunteer 
participation in middle and older adulthood. 


Voluntary Association Participation during Adulthood 


Numerous studies reveal that social engagement varies over the adult life 
course. Most studies show that voluntary association participation peaks in 
middle age and declines slightly in later life (Knoke and Thomson 1977; Rotolo 
2000). Most often this is attributed to the family life cycle. Parents of school- 
aged children often engage in a variety of activities with schools, sports, and 
religious organizations as part of the socialization of children. Athletic, religious, 
and recreational organizations need parents to help the programs function and. 
fulfill their mission. There are other organizations designed to serve adults related 
to their occupation or political aims. The need for volunteers in many of these 
organizations converges in middle age, making it a busy time of life. 

Older adults may have more time for voluntary association participation, 
especially after retirement. Many older Americans ascribe to a busy ethic and 
desire to engage in socially meaningful activities (Ekerdt 1986). Thus, volunteer 
activities provide a viable arena for social contributions and engagement (Krause 
and Shaw 2000). Many types of organizations from civic, political, religious and 
social welfare support groups recognize the potential of older volunteers and 
promote their contributions to the community (Herzog et al. 1989). However, it 
may be difficult for older adults to continue voluntary association participation 
over a long time period. 

Health problems are probably the greatest threat to sustained voluntary 
participation in later life. Disease and the way in which disease compromises 
physical functioning may make volunteering difficult. Decline in sensory perception 
may also make some types of voluntary activity difficult. For instance, impaired 
vision may become an obstacle for some older adults to sustain activities that 
require night driving. It is also possible that mental health problems, especially 
depression, may reduce voluntary association activity (Thoits and Hewitt 2001). It is 
plausible that volunteer participation depends more on physical and mental health 
in later life, but depends more on work and family responsibilities in middle age. 

Social institutions such as work and family may exert differential influence on 
volunteer participation over the life course (Omoto, Snyder and Martino 2000). 
Major social roles that are present include marital, parental and employment 
status. Among the three, marital status is the only factor that has consistently 
promoted volunteerism across ages (e.g., Rotolo and Wilson 2004). Empirical 
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evidence is mixed regarding how employment status and parenthood influence 
volunteer participation. In later life, cessation of full-time employment has been 
shown to promote involvement in formal volunteer work for people who had not 
volunteered previously (Mutchler, Burr and Caro 2003). Children's leaving home 
is associated with a decline in voluntary association membership (Knoke and 
Thomson 1977). In mid-life, one study shows that being employed is negatively 
related to volunteer participation (Rotolo and Wilson 2004), but another study 
reveals that full-time employment increases women's organizational participation 
(Rotolo 2000). Research also indicates that having young children, which carries 
tremendous obligations, limits volunteer participation (Oesterle, Johnson and 
Mortimer 2004). Still, other research suggests that this might be true only 
for having pre-school-age children; having schoolage children most often 
encourages volunteering (Rotolo 2000, Wilson and Musick 1997). 


Volunteering and Health 


Drawing from social integration theory, one would expect volunteering to have 
positive health consequences. Social integration is often described as social 
embeddedness of individuals, typically due to multiple social roles (Durkheim 
[1897] 1951). Multiple social roles provide meaning and purpose in life and often 
protect people against despair and isolation during difficult times (Thoits 1986). 
Active engagement in social roles also promotes social support, social interactions 
and other psychosocial resources that enhance resilience in the face of disease 
(Mendes de Leon, Glass and Berkman 2003; Moen, Dempster-McClain and 
Williams 1989). A volunteer role is intrinsically rewarding because it leads to social 
recognition and a stronger sense of identity (Sieber 1974). The act of helping others 
often makes people feel happier and healthier (Wuthnow 1991). 

A large body of research has demonstrated the salubrious effect of 
volunteering on physical and mental health, often drawing from role-identity and 
role enhancement theoretical perspectives (Greenfield and Marks 2004, Lum 
and Lightfoot 2005, Morrow-Howell et al. 2003). Volunteering has been shown 
to enhance psychological well-being, net of the effect of social psychological 
resources such as self-esteem, formal social integration and informal social 
interaction (Musick and Wilson 2003). Volunteering has also been shown to 
improve different aspects of personal well-being over time, such as happiness, 
life-satisfaction, self-esteem, sense of control, self-rated health and depression 
(Thoits and Hewitt 2001, VanWilligen 2000). Research also reveals that participation 
in voluntary organizations promotes longevity (Moen et al. 1989) and reduces the 
risk of functional dependency (Morrow-Howell et al. 2003) and mortality (Musick, 
Herzog and House 1999; Oman, Thoresen and McMahon 1999). However, a recent 
study by Lum and Lightfoot (2005) found that although volunteering over age 70 
is associated with improved self-reported health, depression levels and functional 
levels, it does not seem to affect physician-diagnosed medical conditions and 
nursing home residence rate. Their findings suggest that volunteer activities do 
not influence the onset of a medical condition, but help one cope with medical 
conditions by providing social and psychological resources. 
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On the other hand, claims that volunteering is good for health may not always 
be realistic. High levels of volunteer activity may have diminishing returns 
or perhaps even become harmful to health. The paradox is that "voluntary" 
association activity may not be purely voluntary. Voluntarism may help initiate the 
course of action, but once linked to an organization, some degree of obligation 
may ensue (Cress, McPherson and Rotolo 1997). Especially as one considers the 
participation of middle-age adults, the activities of children may place important 
demands on their parents. Multiple roles may be good, but multiple roles may 
also lead to role strain, thereby leading to compromises in physical or mental 
health. In short, one should not assume that voluntary association participation is 
uniformly beneficial to health. 

Estimates of the benefits of volunteering on health may also be influenced 
by social selection (Thoits and Hewitt 2001), which may appear in either of the 
two forms: barrier or compensation. It is possible that health problems can be a 
barrier to continued voluntary association participation. The higher prevalence of 
compromised physical function in later life means that one might be more likely 
to observe this phenomenon among older adults. Indeed, some study shows that 
functional limitations reduce volunteering in later life (Herzog et al. 1989). Failure 
to account for a barrier effect may upwardly bias estimates of the impact of 
volunteering on health, leaving an impression of a stronger positive relationship 
between volunteering and well-being, which is partly augmented by the effect of- 
social selection. 

The second selection process is compensation. Compensation refers to 
behavioral and psychological efforts to maintain adequate functioning in the 
face of loss (Carstensen, Hanson and Freund 1995). Compensation by means of 
volunteer participation is likely when people with health problems seek meaningful 
social involvement to bolster their psychological functioning. In the face of loss, 
people would choose to engage in social roles that are most predictive and 
promote social support, including the feeling of social embeddedness (Lang 
and Carstensen 1994). In essence, this is selective optimization delineated 
in the socioemotional selectivity theory (Carstensen 1992, 1995). Selective 
optimization favors social interaction and involvements that link to self-concept 
and identity. It is particularly relevant in later life when older adults strive for 
maximizing emotional gratification by optimizing social and emotional capital and 
minimizing risks (Carstensen 1995). An analysis by Hendricks and Cutler (2004) 
has suggested that increasing volunteer activity with age reflects its importance 
in maintaining a stable socio-emotional climate in the aging process. Recent 
research shows that depression may spur a new season of volunteering in later 
life, providing evidence of a compensatory effect (Li and Ferraro 2005). “Helping 
others” (or giving instead of receiving) is one way to fight loneliness or depressed 
affect (Ekerdt 1986). In short, volunteering may have health benefits, but health 
problems can also determine the level of volunteer participation. 


Research Questions 


Although benefit mechanisms have been studied the most, there is evidence for 
both barrier and compensation mechanisms, and these mechanisms may exist 
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for physical and/or mental health. It is not clear whether physical functioning 
and depression have distinct influences on volunteering at various life stages. 
The current study is designed to extend the existing research by modeling the 
interrelationships among volunteering, depressive symptoms, and functional 
limitations during middle and late adulthood. We consider benefit, barrier and 
compensation mechanisms for both physical and mental health and give explicit 
attention to selection processes. In consideration of the unique circumstances 
over the life course, this study also pays attention to the potential differential 
influences of social roles on voluntary participation at the two stages in adulthood, 
middle and later adulthood. 

Our analysis is distinctive in several ways. First, the analysis uses three-wave 
longitudinal data to distinguish the three mechanisms. Second, we are unaware 
of any study that considers these mechanisms for both physical and mental 
health outcomes with longitudinal data. Third, this study adds to the existing 
research on the life course effects of social roles on volunteering by comparing 
adults in middle age and later life. Finally, the analytic approach systematically 
accounts for the non-random attrition inherent in any longitudinal study. 

Four research questions guide the analysis: 


1. Does volunteering reduce functional limitations and 
depressive symptoms (benefit mechanism)? 

2. Do functional limitations and depressive symptoms 
decrease(barriermechanism)orincrease(compensation 
mechanism) the likelihood of volunteering? 

3. Are the relationships among volunteering, functional 
limitations and depressive symptoms similar for middle- 
age and older adults? 

4. Do work and family roles promote volunteerism in 
middle and later adulthood? 


Methods 
Sample 


The data for this research come from three waves of the Americans' Changing 
Lives study (ACL; House 1995). It is a multistage stratified area probability sample 
of persons 24 years of age or older who lived in the continental United States.' 
The baseline data were collected in 1986 (N = 3,617) and include an oversample 
of black adults (n = 1,174) and persons 60 years of age or older (n = 1,669). Two 
follow-up interviews were conducted in 1989 (N = 2,867) and 1994 (N = 2,348), 
hereafter referred to as Wave 2 (W2) and Wave 3 (W3). All surveys were face-to- 
face interviews conducted in the home of the respondent. 

Since our research focuses on middle and later life, we analyzed data from two 
groups of people, those 40-59 years old (n — 875) and those 60 years and older 
(n = 1,669). We divided the middle- and older-age groups at age 60 because the 
study over-sampled people 60 years of age and over. Exclusion of the younger 
subjects was also warranted because we could not establish measurement 
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invariance in preliminary analyses that examined three age groups (< 40, 40-59 
and 60+), failing a basic requirement for initiating age-group analysis (Byrne, 
Shavelson and Muthén 1989). Characteristics of young people such as being 
single and raising pre-school age children are social roles that tend to hinder civic 
participation (Oesterle et al. 2004). Moreover, most young people are generally in 
good health. Thus, the ages of the respondents in the analyses which follow are: 
(a) middle-age, 40-59 at Wave 1 (W1), 43-62 at W2, and 48-67 at W3; (b) older- 
age, 60+ at W1, 63+ at W2, and 68+ at Ws. 

In 1989, follow-up interviews were successfully completed on 732 of the 
respondents who began the study when they were between ages 40 to 59 and 
on 1,279 of the respondents who began the study at 60 years of age or older. 
In 1994, approximately 7.5 years after the initial interview, a total of 683 middle- 
age respondents and 889 older respondents who completed interviews in both 
1986 and 1989 were surveyed. The living non-response rates are 14.1 percent 
at the second observation and 13.7 percent at the third observation for middle- 
age adults, and 14.5 percent at the second observation and 10.5 percent at the 
third observation for older adults. By the end of the third observation period, 7.1 
percent of the middle-age adults and 27.5 percent of the older adults had died. 


Endogenous Variables 


Volunteer Work 

Two measures of volunteer work are combined to form a volunteer latent 
construct. The volunteer index was constructed by summing five types of 
volunteer work the respondent did during the past 12 months: volunteering in 
(1) church, synagogue or other religious organization, (2) school or educational 
organization, (3) political group or labor union, (4) senior citizen group, and (5) 
other national or local organization. People were scored 0 if they did not volunteer 
at all and 5 if they volunteered for all types of organizations. Volunteer hours is a 
report of the number of hours the respondent spent doing volunteer work during 
the past year. Whereas 0 was assigned to those who did not participate, the 
other five responses include volunteering for (1) 1-19 hours, (2) 20-39 hours, (3) 
40-79 hours, (4) 80-159 hours, and (5) 160 hours or more. 


Depressive Symptoms 

A depressive symptoms scale was constructed by summing six items from the 
Center for Epidemiologic Studies Depression Scale (CES-D; Radloff 1977) in each 
wave. In ACL, depressive symptomatology consists of 11 items from the CES-D 
scale in each wave. We eliminated the items that reflect interpersonal relationship, 
positive affect and degree of loneliness before building our depressive symptoms 
latent construct. These five items were excluded because they were confounded 
with the social support system, which is volunteer work in this study (Krause, 
Herzog and Baker 1992). The six items used are "I felt depressed," "I felt sad," "I 
could not get going," "I did not feel like eating," "Everything was an effort," and 
"My sleep was restless." All six items have the same response categories. They 
are (1) hardly ever, (2) some of the time, and (3) most of the time. The range of the 
constructed scale is from 6 to 18. We obtained the reliability separately for the two 
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age groups. The coefficient alpha equals to .78 at W1, .77 at W2, and .76 at W3 for 
middle-age adults and .74 at W1, .73 at W2, and .72 at W3 for older adults. 


Functional Limitations 

Three indicators of physical health were summed to create this scale. The first 
indicator, self-rated activity limitation, asks "How much are your daily activities 
limited in any way by your health or health-related problems?” The responses 
range from (1) not at all to (5) a great deal. The other two indicators. were the 
amount of difficulty the respondent had walking blocks and the amount of 
difficulty the respondent had climbing stairs. The categories range from (1) none 
to (5) cannot do. We obtained the reliability separately for the two age groups. 
The coefficient alpha equals to .86, .83 and .85 for the middle-age adults at all 
three waves respectively and .86, .83 and .86 for older adults at all three waves 
respectively. 


Exogenous Variables Measure at W1 


Chronic Conditions 

Each respondent was asked whether he or she has experienced any of 10 health 
problems during the past 12 months. These health problems are arthritis, stroke, 
foot problems, heart attack or trouble, incontinence, diabetes, lung problems, 
hypertension, broken bones and cancer. The range of the variable is from 0 to 7. 


Formal Social Integration 

Two measures of formal social integration were included. Forma/ meeting 
attendance was measured by asking the question: “How often do you attend 
meeting of groups, clubs, or organizations you belong to?” The six response 
categories were coded (1) never (2) less than once a month (3) about once a 
month (4) 2 or 3 times a month (5) once a week and (6) more than once a week. 
Religious service attendance was measured by the question: “How often do you 
usually attend religious services?” This variable has the same coding as formal 
meeting attendance. 


Informal Social Integration 

This is a standardized index constructed by taking the arithmetic means of two 
items: how often in a typical week the respondent (1) talks on the telephone with 
friends, neighbors or relatives; and (2) gets together with friends, relatives or 
neighbors. The responses for the first item range from (1) never to (6) more than 
once a day, and for the second item from (1) never to (6) more than once a week. 


Self-esteem 

To control the influence of self-esteem on depressive symptoms, a standardized 
index was constructed by taking the arithmetic means of the three items from 
Rosenberg's (1989) 10-item global self-esteem scale. The three items were, “| 
take a positive attitude toward myself," "At times I think | am no good at all," and 
"All in all, | am inclined to feel that | am a failure." The responses for these items 
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are from (1) strongly agree to (4) strongly disagree. When the index was created, 
the first item was reverse coded. High values on this index indicate a high level 
of self-esteem. The coefficient alpha of reliability is .58 (.60 for middle-age and 
53 for older adults). The modest reliability of self-esteem should be recognized 
as a limitation of the analysis. At the same time, controlling for self-esteem is 
important because higher self-esteem is likely to be associated with volunteer 
activity. 


Social Roles 

Marital status was coded 1 for those who were married or living together with a 
partner for six months or longer and 0 for all others. Parent was coded 1 for those 
who have children living in the household and 0 for all others.* Employed was 
coded 1 for those employed full- or part-time and O for those not employed. 


Status Characteristics 

Education represented the highest years of schooling completed and ranged from 

0 to 17. Total family income was measured with 10 categories ranging from (1) 

less than $5,000 to (10) over $80,000. Gender and race were two binary variables, 

with female and black each coded 1. Age was measured in years (40-96). The 

sample was stratified into middle-age group (people of age 40 to 59) and older- 

age group (people of age 60 to 96), and the interval measure of age was included . 
in the analyses to control for the variation of age within each age group. 


Missing Data 


We used both single imputation and multiple imputation with the EM algorithm 
available in PRELIS 2 to handle /tem missingness for depressive symptoms, 
which was the only endogenous variable with a considerable number of missing 
values (Jóreskog and Sórbom 1996b). Because our analyses of the data from the 
two imputation approaches yielded almost identical results, we presented the 
results from the single imputation approach. ? Multiple imputation methods may 
be preferable, however, when item-missing data are more extensive (Schafer and 
Graham 2002). For the depressive symptoms items across the three waves, there 
are 33 missing values for the total middle-age sample (n = 875) and 111 for the 
older sample (n = 1,669). After dealing with item-missing data, the functional 
sample size is 867 for middle-age sample and 1,642 for older sample. 

There was considerable nonrandom attrition over the 7.5 years of observation 
- unit missing data. The mean differences of the observed variables used in our 
analysis between the complete and incomplete data reveal the extent of the 
attrition (shown in Table 1). Regardless of age, people having incomplete data 
tend to have more depressive symptoms and functional limitations. People with 
incomplete data also have much lower levels of volunteer participation. They 
are more likely to be black, single, unemployed, poorly educated and with low 
income. People with incomplete data also have more chronic conditions and 
a lower level of both formal and informal social integration. Older adults with 
incomplete data show significantly lower self-esteem compared to older adults 
who completed all three survey waves. 

In order to account for nonrandom attrition, we applied multi-group structural 
equation models with complete and incomplete data (Allison 1987, Bollen 
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Table 1: Range, Means, and Standard Deviations of Variables for Middle-age and Older 
Adults by Pattern of Missingness 


Incomplete Data’ 


Complete Data’ W18W2 W1 Only 

Variables (Range) Middle Older Older Middle Older 

Endogenous Indicators 

W1 Depressive Symptoms (6-18) 8.83* 8.65 9.45*** 9:32" 9.41*** 
(2.47) (2.35) (2.72) (2.70) (2.74) 

W2 Depressive Symptoms (6-18) 8.62 8.46 9.37*** — -— 
(2.47) (2.29) (2.82) 

W3 Depressive Symptoms (6-18) 8.36 8.49 — — — 
(2.35) (2.33) 

W1 Volunteer Index (0-5) .82 WO Ewe rw ‘ola 
(1.07) (1.04) (.89) (.79) (.71) 

W2 Volunteer Index (0-5) .84 18 dr — — 
(1.15) (1.09) (.84) 

W3 Volunteer Index (0-5) .83 AAF — — — 
(1.09) (1.02) 

W1 Volunteer Hours (0-5) 1.27 1.21 E 1665 522 
(1.69) (1.68) (1.43) (1.29) (1.21) 

W2 Volunteer Hours (0-5) 1.27 1.28 .69*** — — 
(1.70) (1.76) (1.40) 

W3 Volunteer Hours (0-5) 1.11 1.05 — — — 
(1.47) (1.56) 

W1 Functional Limitations (3-15) 4.32 52004 OE 5076 iZ 
(2.59) (3.19) (3.92) (3.52) (4.27) 

W2 Functional Limitations (3-15) 4.36 52525 691” — — 
(2.47) (3.03) (3.80) 

W3 Functional Limitations (3-15) 4.71 5.84*** — — — 
(2.68) (3.34) 

Exogenous Indicators (W1) 

Age (40-96) 48.97 EI bü 72.44*** 49.34 72.42*** 
(6.07) (5.79) (7.72) (6.16) (8.16) 

Female (0-1) .63 A m DÉI 57 162% 

Black (0-1) .31 247 1960€ AG*** 3552 

Married (0-1) .69 dos era A9" oe 46: 

Employed (0-1) .78 Lore ES .68** OA 

Children in Household (0-1) .54 Wey 18 155 21 

Education (0-17) 12.13 14:19 9.39*** 11:08:52 9.55 
(3.00) (3.52) (3.68) (8.16) (3.77) 

Income (1-10) 5.50 4.10** 2:055 ABAE 3.027% 
(2.66) (2.49) (2.02) (2.58) (2.12) 

Chronic Conditions (1-7) 1.12 190 Dee 1.34* Zola 
(1.21) (1.33) (1.43) (1.48) (1.48) 

Self-esteem (-4.3-1.2) .002 MATT .02* -.07 -.06** 

(.99) (.97) (.99) (1.18) (1.02) 

Formal Meeting Attendance (1- 6) 2.88 3.01 2.85 2:901 2.42"* 
(1.81) (1.83) (1.91) (1.79) (1.81) 

Religious Service Attendance (1-6) 3.66 3.83 3.70 32020 341"* 
(1.78) (1.75) (1.87) (1.79) (1.82) 

Informal Social Integration (-3.1-1.3) -.11 .06** -.09** -.28* = 26155 
(1.01) (.99) (1.12) (1.10) (1.13) 

N 624 822 437 243 383 


Note: W1 = Wave 1; W2 = Wave 2; W3 = Wave 3. Unweighted sample. N = 1,642 (older-age 
group, 60 years of age and older) and N = 867 (middle-age group, between 40-59 years of age). 
a Significance tests for comparing groups with incomplete data to same age group with com- 
plete data. 

* Significance tests for comparing the older and the middle-age group. 

* Mean (SD). Standard deviations of dichotomous variables are omitted. 

zu 05 nc 0] Sop. < .001 (two-tailed) 
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1989, McArdle and Hamagami 1991). The model features multi-group analyses 
by both age and pattern of missingness, with missing data analysis nested 
within age-group analysis. The analysis uses all available information in the 
data by simultaneously estimating parameters that reflect different pattern of 
unit missingness while constraining certain parameters to be invariant across 
subsamples (Allison 1987). For the older sample, we created three subsamples. 
The complete data group includes older respondents who completed the survey 
in all three waves (n = 822). The two incomplete data groups consist of (1) those 
older respondents who have complete data only on the first two waves and are 
missing on the third wave (n = 437) and (2) the remaining older sample that 
answered the questions in the first wave only (n — 383). For the middle-age 
sample, the data were divided into two subsamples. The complete data group 
includes the respondents who completed the survey in all three waves (n — 624). 
The incomplete data group includes the remaining sample that either completed 
the first two waves (n = 107) or completed only the first wave (n = 136).* 

We estimated structural equation models using the fully constrained 
maximum-likelihood fitting function in LISREL 8.53 (Jóreskog and Sórbom 
1996a). The estimated matrix is an augmented moment matrix calculated from 
a covariance matrix and a vector of means. The general LISREL model that 
accounts for the incomplete data follows the procedures outlined by Allison 
(1987) and Bollen (1989). This entails "inputting pseudo-values for the missing 
sample moments and specifying fixed values for certain factor loadings and 
errors variances" to accommodate different patterns of missingness across 
subsamples (Allison 1987: 81).* 


Analytic Plan 


Our structural equation model is a recursive model that tested the lagged effects 
of the repeated measures of volunteering, depressive symptoms, and functional 
limitations on all three measures at the following waves. The model included the 
direct lagged effects from W1 repeated measures to W3 repeated measures. 
Figure 1 gives a simplified view of the structural relationships as it summarizes 
only the significant structural relationships among the endogenous variables 
for both middle-age and older-age groups. The tested model also included all 
exogenous variables listed in Table 1. No directional relationships were specified 
among endogenous variables in each wave, but the error covariances among the 
endogenous variables (‘¥) within each wave were allowed to correlate. The errors 
in the same indicators across time (0,) were also allowed to correlate to account 
for the autocovariance. 

In this hypothesized model, the negative lagged effects through the paths 
linking volunteer work in the previous waves to either functional limitations 
or depressive symptoms would reveal the benefit effects. The barrier effects 
would be manifest by negative lagged effects linking functional limitations and/or 
depressive symptoms in the previous waves to volunteer work. Positive lagged 
effects linking functional limitations and/or depressive symptoms in the previous 
waves to volunteering would provide evidence for compensation. 


Results 


Table 1 lists the ranges, means and standard deviations of all observed variables 
used in the analysis. Older adults have more functional limitations than middle- 
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Figure 1. Structural Relationships among Volunteering, Depressive Symptoms and Func- 
tional Limitations 
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Note: W1 = Wave 1; W2 = Wave 2; W3 = Wave 3. 

Error covariances among the endogenous variables (W) at each wave were omitted to simplify 
the presentation. 

Nine direct paths from W1 latent constructs to W3 latent constructs were included in the 
tested model 

The non-significant paths were omitted in the Figure. 

All exogenous variables listed in Table 1 were included in the tested model. 


——=> Path significant for both age groups at p « .05 
— —— Path significant for old-age group only at p < .05 
-----?* Path significant for middle-age group only at p < .05 


age adults across all three waves. The only age-group difference in volunteering 
was for the volunteer index at W3, with higher values for middle-age respondents. 
Among the control variables, all means except for means of the indicators for 
formal social integration are significantly different between the two age groups. 
While most age differences are expected, itis interesting to note that older people 
tend to have higher self-esteem and informal social integration than do middle- 
age people. The correlations between volunteer index and volunteer hours (not 
shown) are consistently strong across all three waves. The correlations among 
the endogenous latent variables (not shown) are generally similar across the two 
age groups." 

Our hypothesized model showed excellent model fit as indicated by chi-square 
statistic (x? [733] = 1899.23) and other measures of goodness of fit: root mean 
square error of approximation is .00, normed-fit index is .99, and comparative 
fit index is .99 (Kelloway 1998).8 Parameter estimates are presented in Table 
2 (endogenous relationships) and Table 3 (relationships between exogenous 
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covariates and endogenous volunteering variables). Findings from the final model 
regarding the endogenous relationships will be described first. Beside paths 
linking repeated measures over time, the paths depicting the effects of functional 
limitations at W1 on depression at W2 and W3 are the only two that are both 
positively significant for both age groups. Although depression at W2 predicts 
volunteering at W3 for both age groups, their effects are in exactly opposite 
direction (shown in Table 2). Figure 1 also reveals that more structural relations 
are significant for the older-age sample than for the middle-age sample. 

Columns 1 through 3 in Table2 reflectthelagged effects of baseline endogenous 
variables on these same variables at W2 while controlling for all the covariates. 
The strong positive coefficients for the repeated measures indicate greater 
likelihood of depressive symptoms, volunteering, and functional limitations over 
time. Within the older-age group, we observed significant beneficial effects of 
volunteering on both depressive symptoms and functional limitations. Neither 
depressive symptoms nor functional limitations at W1 had a significant effect on 
volunteering at W2, probably suggesting that social selection due to poor health 
is minimal. In addition, the model identified jointly positive effects between 
depressive symptoms and functional limitations over time. It suggests a cycle of 
decline — the repercussion of worsening mental health is the decline in functional 
health and vice versa. By contrast, parallel equations for the middle-age group 
revealed fewer significant structural relationships. Except for coefficients for the 
repeated measures, the only significant coefficient was the positive effect of 
functional limitations on depressive symptoms. 

Columns 4 to 6 of Table 2 display the equations for W3 latent dependent 
variables. In the panel for the older-age group, volunteering at the previous waves 
did not directly affect depressive symptoms. However, volunteering at W1 exerted 
a significant total effect on depressive symptoms at W3, providing evidence for 
the long-term benefit of volunteering on mental health. Volunteering at W2 also 
substantially reduced the risks of functional limitations among older adults at W3, 
reinforcing the benefits of volunteer participation on health outcomes. Depressive 
symptoms showed a positive effect on volunteering, suggesting a compensatory 
mechanism. Functional limitations at W1 had a long-term total effect on depressive 
symptoms at W3 whereas the reverse relationship did not exist. 

Although weaker structural relationships exist in the sample of middle-age 
adults, two significant patterns are noteworthy. First, depressive symptoms at 
W2 moderately decreased volunteer participation at W3. In contrast to being a 
means of compensation in older adulthood, depressive symptoms operated as a 
barrier to volunteering in the middle adulthood. Second, functional limitations led 
to an increase in depressive symptoms at all following waves. 

Table 3 presents parameter estimates for the relationships between the 
exogenous variables and volunteering latent constructs at all three waves (from 
the same structural equation model as reported in Table 2). W1 social roles (being 
employed, having children in household and being married) appeared to have 
limited impact on volunteer participation. Altogether, only 3 out of 18 coefficients 
are statistically significant. Being employed and being married helped promote 
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Table 2: Maximum Likelihood Estimates of the Relationships among Volunteering, Depres- 
sive Symptoms, and Functional Limitations 


Wave 2 Wave 3 


Variables 1 2 3 1 2 2 
EE ATT SA 


Wave 1 Measures 


1. Depressive Symptoms 3542-0006 .088** 227/0252 -.016 -.048 
(.028) (.009) (.034) (.033) (.012) (.046) 
—' —' — .961"*^  -.006 -.019 
2. Volunteering -.219* 073°” -.243* -.195 ALES .109 
(.098) (.032) (.118) (.148) (.054) (.204) 
— — — -.269* 502 200 

3. Functional Limitations .080*** .000 AT1"* .009 -.002 272 D 
(.020) (.007) (.025) (.026) (.009) (.035) 

— — — .051* ^ -.006 .431** 

Wave 2 Measures 

1. Depressive Symptoms — — — .245*** 1041535 -.010 
(.032) (011) (.044) 
2. Volunteering — -— — -.013 1041755 -.370* 


(435) (049) (.186) 


3. Functional Limitations — — — .047 -.017 1340552 
(.027) (.009) (.037) 
b b b 


Pseudo R? .306 .576 .466 .387 .588 .469 
Middle-age Group (N = 867) 
Wave 1 Measures 


1. Depressive Symptoms A45*** -.008 021 Os .014 -.051 
(.028) (.009) (.034) (.040) (.016) (.044) 
— — — Apu -.003 -.017 
2. Volunteering -.052 .664*** -.157 .003 296°* -.028 
(.098) (.032) (.118) (.015) (.060) (.167) 
— — — .007 AT2"* -.013 
3. Functional Limitations 1094 2 -.000 .490*** -.007 -.010 120955 
(.020) (.007) (.025) (.039) (.015) (.044) 
— — — 107525 -.003 .974^** 


Wave 2 Measures 
1. Depressive Symptoms — — — SS -.035* .063 
(.040) (.016) (.044) 
b b b 


2. Volunteering — — — .058 OE .107 
(.128) (.051) (.143) 
b b b 


3. Functional Limitations — — — Side .021 Enn 
(.044) (.017) (.049) 
b b b 


Pseudo R° .430 .527 .583 422 465 480 


Note: Variables controlled are age, sex, race, education, income, marital status, employment sta- 
tus, having children in household, chronic conditions, self-esteem, formal meeting attendance, 
church attendance and informal social integration measured at Wave 1. Direct and total effects 
for each of these are presented in Table 3. 

Model estimates: x2 = 1899.23; df = 733; root mean square error of approximation = 0.0; 
normed fit index = .99; comparative fit index = .99. 

* Direct effects: unstandardized coefficients, standard errors in parentheses. 

> Total effect omitted as it is the same as the direct effect. 

* Total effects. 

* p € .05 ** p «€ .01 *** p « .001 (two-tailed) 
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Table 3: Maximum Likelihood Estimates of the Relationships between Exogenous and 
Endogenous Volunteering Variables 


- Volunteering (Middle-aged) Volunteering (Olde) — 
Variables Wave 1 Wave 2 Wave 3 Wave 1 Wave 2 Wave 3 
Employed .057* -.093 .204** .043 -.028 -.034 
(.071) (.085) (.080) (.049) (.051) (.061) 
adi PE: 1002555 E AES c 
Child in Household .010 .022 .014 .042 .005 -.133* 
(.059) (.069) (.064) (.049) (.051) (.061) 
Married .086 .004 au .031 .032 -.05 
(.064) (.074) (.069) (.043) (.045) (.054) 
E —° .148** — — — 
Age -.010* -.008 Oa -.004 -.008** 0425 
(.005) (.006) (.005) (.003) (.003) (.004) 
= -.015*** — E -.011** -.019*** 
Female 017 021 102 -.013 -.000 -.017 
(.057) (.067) (.063) (042) (.044) (.053) 
— — MS — —° — 
Black -.082 -.060 .099 -.076 -.041 -.018 
(061) (.071) (.067) (.046) (.047) (.057) 
— -417* — — — — 
Education 050 -.001 .025* 047157 1022 "° 10235 
(.011) (.013) (.012) (.006) (.007) (.008) 
E 1032058 SH = .050*** 0397 
Income .018 .022 -.003 .030** -.005 .006 
(.014) (.016) (.015) (.011) (.011) (.013) 
mt .034** pev ip ad afe 
Chronic Conditions .021 -.025 -.016 .007 -.006 -.024 
(.023) (.030) (.029) (.014) (.016) (.020) 
Self-esteem .076** .055 .005 .048* -.012 .011 
(.026) (.033) (.031) (.019) (.021) (.026) 
= igs .066* E — => 
Formal Meeting mee .034 .029 .166*** .040** E0322 
(.017) (.021) (.020) (.012) (.013) (.016) 
= EAS ors I ka lee 
Church Attendance SITO .050* .015 076" .038** -.002 
(.017) (.020) (.019) (.012) (.012) (.015) 
E! 23555 0825 = .090*** .059*** 
Informal Social Integration .056* -.035 .044 0557 .003 -.041 
(.027) (.031) (.030) (.019) (.020) (.024) 
— — .056* E => s= 


Note: Exogenous variables were measured at Wave 1. Coefficients and standard errors obtained 
from the same model as in Table 2. Model estimates: x? = 1899.23; df = 733; N = 1,642 (older- 
age group) and N=867 (middle-age group). 

* Direct effects: unstandardized coefficients, standard errors in parentheses. 

> Total effect omitted as it is the same as the direct effect. 

* Total effect omitted as it is not statistically significant. 

3 Total effects. 

*p«.05 **p<.01 **p<.001 (two-tailed) 
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long-term volunteer participation for middle-age adults. Children living in the 
household were associated with less volunteer participation among older adults. 

The analysis also revealed that education, church attendance, self-esteem and 
social integration are associated with greater volunteer participations for people 
of all ages. Uniquely in middle-age, black people were less likely to volunteer 
while women were more likely to volunteer over time. Age was associated with 
a decline in volunteering in the middle-age group at both W1 and W2 but an 
increase in volunteering by W3. It probably suggests that middle-age persons 
are decreasing the overcommitment common to this life stage, but seeking new 
associations as they become senior citizens. It was also clear that advanced age 
is associated with declines in volunteering. 


Discussion 


Volunteering is a critical part of the social organization of modern societies, but 
the involvement in and benefits from participation in voluntary associations are 
likely to change during adulthood. The objective of the present study is to better 
understand the barriers to and benefits from volunteering that change over the 
adult life course. The findings demonstrate important life course shifts in the 
barriers and benefits associated with volunteering. In the sample of older adults, 
benefit and compensation mechanisms characterized the structural relationships. 
In the sample of middle-age adults, depressive symptoms were shown to be a 
barrier to volunteer participation over time. 

This study provides crucial evidence forthe existence of differential relationship 
between volunteering and health outcomes over the adult life course (Musick 
and Wilson 2003, Van Willigen 2000). We found that volunteering contributes 
to better mental health in the older age group, and at the same time, reduces 
the pace of functional decline. The salubrious effects of volunteering underscore 
the benefits of continued social engagement in later life for both physical and 
mental health. In comparison, we did not find a benefit mechanism operant in 
the sample of middle-age adults. The lack of health benefits may be linked to 
the role context and volunteer motives in middle adulthood. Middle-age people 
occupy more extensive social roles than older adults, so volunteer work may not 
be as central a component among these multiple social roles. Volunteer work 
may even place a burden upon busy lives if it is added on top of family and work 
responsibilities. Middle-age adults are also more likely to volunteer for extrinsic 
causes that do not promote health (Omoto et al. 2000). As middle-age adults are 
generally in good health, we cannot rule out the possibility that the salubrious 
influence of volunteering may be too modest to be detected in these data. 

Our research has also demonstrated that mental health is a very important 
resource that influences social engagement over the life course. The finding 
that depressive symptoms are a health barrier in middle adulthood provides 
evidence for social selection (Wethington et al. 2000). Middle-age people who 
are less depressed are more likely to volunteer and thereby enjoy the benefits of 
being volunteers (Rietschlin 1998). The pattern does not hold in later life. Rather, 
depressive symptoms spark volunteering in later life as a means of compensation. 
Older adults are subject to greater risk of depression in the face of rapid functional 
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and cognitive decline, and role losses such as widowhood (Umberson, Wortman 
and Kessler 1992). Role losses and attenuated social relations may make volunteer 
activities an attractive means of compensation. The present research did not, 
however, uncover a compensatory effect of volunteering in middle age. The roles 
occupied in middle age may render volunteer activities less consequential to 
well-being than for older people who have undergone major role changes such 
as retirement and widowhood. 

The finding that functional limitations are not a barrier to volunteer participation 
should be interpreted with caution. First, middle-age people in the ACL study 
experienced far fewer functional limitations than older people (see Table 1), partly 
because we used the measures of the lower body function available in the data. 
Measures of functional limitations such as levels of difficulty walking blocks and 
difficulty climbing stairs are generally more appropriate indicators of physical 
functioning for older adults than for middle-age adults. Thus, it is possible that 
the analysis may have underestimated the effect of functional limitations on 
outcomes because of the relatively small proportion of middle-age persons with 
lower body limitations. Second, also shown in Table 1, the means of functional 
limitations for incomplete data at all three waves were substantially higher than 
thosefor complete data, suggesting that non-random attrition has struck unusually 
hard on the measures of the lower body function and may have attenuated the 
estimated effect of functional limitations on volunteering more than it should be. 
As a result, the analysis only showed slight evidence that functional limitations 
prevent older adults from volunteer participation over time (b = -.017, p < .10). 

We also found that social roles are more likely to promote volunteering in middle 
age than in later life. The parental role that has been considered a stimulant for 
volunteer activities appeared to be a barrier to volunteer participation in later life. 
Our findings suggest that social roles are related to different expectations and 
activities as people age. It is also plausible that social role absence rather than 
social role occupancy becomes a more important predictor of volunteer activities 
in later life as people experiencing major role-identity absence may take on a 
volunteer role as a means of social engagement (Greenfield and Marks 2004). 
Indeed, early research has shown that some forms of role loss (e.g., retirement) 
lead to an increase in volunteer work (Mutchler et al. 2003). 

The analyses presented herein demonstrate that the relationships between 
volunteering and health vary over the adult life course. Volunteering may be 
salubrious, but the results from the present analysis show that this is for older 
adults only. The fact that the barrier mechanism was identified for middle- 
age adults and compensatory mechanism for older adults, however, suggests 
more attention be given to social selection processes on the study of social 
engagement and health.? The analytic approach used herein also makes clear 
that both social and sample selection processes are important for this line of 
inquiry. We applied multi-group structural equation models by both age groups 
and patterns of missing data. By modeling both the complete and incomplete 
data, the analysis accounts for the considerable nonrandom attrition in ACL data 
over the nearly eight years of observation. 


The comparison of results from accounting forthe incomplete data patterns with 


r 
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those using only the complete sample (list-wise deletion of cases) was revealing. 
The results for the middle-age group were similar, but for the older-age group, 
significant beneficial effects of volunteering at W1 on depressive symptoms and 
functional limitations at W2 were identified after analyzing the incomplete data. 
Baseline physical limitations also significantly increased depressive symptoms 
at W2. Our results show that analyzing the biased sample (complete data over 
three waves) is likely to result in an underestimation of the true relationships 
(Berk 1983). The fact that we obtained varied results for middle-age sample and 
older sample after handling missing data suggests that nonrandom attrition may 
be particularly serious in the sample of older adults. Future research using panel 
data of older adults needs to account for nonrandom attrition in order to avoid 
drawing misleading conclusions. 

In our research, the use of lower body function to measure functional limitations 
is a limitation that needs to be considered. Another limitation is that the differential 
interrelationships among volunteering, depressive symptoms and functional 
limitations between the two age groups may reflect either age changes or cohort 
differences due to historical and social changes (Elder, Johnson and Crosnoe 
2003; Herzog et al. 1989).'° In our longitudinal sample, we could not isolate the 
age effect from historical influences on the two age cohorts. Therefore, these 
results must be interpreted with caution. " 

Finally, an important aspect of the life course perspective is how life contexts may 
affect attitudes, behaviors, resources and well-being (Elder 1985). The association 
between volunteer participation and health may change with age, but more drastic 
changes may occur with life transitions (George 1993). Those dynamic processes 
of role entry and role exit at different life stages influence social participation as well 
as health. The age-group comparison in this study is a first step toward explaining 
volunteering and health dynamics over the adult life course. We welcome others 
to confirm or refute the findings on the benefit, barrier and compensation effects 
observed here. It is clear that volunteering is a more important and consequential 
social activity in later life, but it is harder to maintain. 


Notes 


1. The ACL survey is based on a multistage area probability sample. Identifica- 
tion of the ACL cross-sectional sample respondents was conducted using 
a four stage sampling process - a primary stage sampling of U.S. Standard 
Metropolitan Statistical Areas (SMSAs) and counties, followed by a second 
stage sampling of area segments, a third stage sampling of housing units 
within sampled area segments and concluding with the random selection 
of a respondent from selected housing units. The sample were stratified 
primarily by SMSA and county, and secondarily by area segment, and finally 
by housing unit (House 1995). 


2. The age of children living in the household is unfortunately not available in the 
data set. With our focus on middle- and later-life, however, the vast majority 
of subjects who are parents are unlikely to have young children in the home. 
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di 


The single imputation method substitutes a case with a value that is ob- 
tained from another case which has a similar response pattern over a set of 
matching variables. PRELIS 2 imputes only missing values that have match- ` 
ing cases, and the variance ratio in the imputation is small. We did not im- 
pute missing values for the respondents who answered fewer than four of 
the six CES-D items in any of the three waves. We used a standardized score 
that was derived from taking the arithmetic mean of the 11 items as the 
matching variable (Radloff 1977). Thus, 27 cases were assigned values and 
restored to the middle-age sample and 90 cases to the older sample. 


We did not divide the incomplete data into two groups because if we did, 
the number of cases in each incomplete data group would be too small in 
relation to the number of estimated parameters. Also, when we divided the 
incomplete sample and tested mean differences, we found no significant 
differences between the two incomplete samples (p > .05). Within each age 
group, subsamples have common as well as distinct sets of variables. For 
the older-age sample, all variables are present in the complete data group. 
These variables measured at W3 do not exist for the data group that is miss- 
ing on the third wave. The same variables measured at both W2 and W3 
do not exist for the data group that is missing on both W2 and WS. As for 
the middle-age sample, all variables are present in the complete data group 
whereas same variables measured at both W2 and W3 do not exist for the 
incomplete data group. - 


Pseudo-values were assigned to the covariance matrices and the mean vec- 
tors for the incomplete data groups to make their sizes exactly the same as 
those for the complete data group within each age group. The estimate for a 
missing variance was assigned a value of one whereas the estimates for the 
respective missing covariances and means were assigned zero. All specified 
factor loadings in the A, matrix and specified error variances and covariances 
in the O, matrix were freely estimated in the complete data group. In the 
incomplete data groups, fixed values were assigned to the missing variables 
in the A, and 6, matrices. In the A, matrix, each element in row / was fixed 
at zero for a particular missing y, variable. In the 6, matrix, the variance of e, 
was fixed at one while the covariances associated with e/ were fixed at zero. 
The free parameters in the A, matrix for the incomplete data groups were 
constrained equal to the equivalent parameters in the complete data group. 
Other invariant parameters include parameter estimates (B) and errors (Y) 
across complete and incomplete groups (Bollen 1989). Between age groups, 
the only invariant parameters are factor loadings for the endogenous latent 
constructs in the A, matrix. The invariance of parameters in the @ matrix 
was tested by the likelihood ratio chi-square statistic of constrained model 
against a less restrained alternative. Parameters were allowed to vary to ac- 
count for group differences and to improve the model fit. 


The correlations between volunteer index and volunteer hours are .78 for the 
older age group and .76 for the middle age group across all three waves. The 
distributions of volunteering variables for both age groups show that nearly 
half of the volunteers in both age groups participate in more than one type of 


Volunteering in Middle and Later Life * 515 


organization. Volunteers who work for more organizations show higher mean 
volunteer hours than people who volunteer for only one type of organization. 


7. The correlations between depressive symptoms and volunteering range 
from -.13 to -.24 and are stronger at W3. The correlations between depres- 
sive symptoms and functional limitations are generally stronger and more 
consistent by age group and time of measurement, ranging from .43 to .46. 
The correlations between functional limitations and volunteering range from 
-.16 to -.26 and are strongest for older respondents at W3. 


8. All paths for the measurement structure of volunteering variables in all three 
waves (^) are significant (p « .001). The ^, values for each item are the same 
across time because equality constraints were placed upon repeated indica- 
tors to ensure consistent measurement for change. 


9. Supplementary analyses in which we omitted paths for selection processes 
(i.e., barrier and compensation mechanisms) resulted in models for which 
the values for goodness-of-fit were significantly poorer than the models pre- 
sented herein. 


10. Respondents were born before or during 1946 and represent a "long civic 
generation" in American history (Putnam 2000). Social change during the ACL 
follow-up period is also important. The survey waves covered three presi- 
dential administrations, and the rhetoric regarding the need for volunteerism 
varied considerably over this time. 


11. We also considered using an accelerated longitudinal design to separate age 
and cohort effects, and looked at the work of Duncan, Duncan and Hops 
(1994) and Miyazaki and Raudenbush (2000), but concluded that cohort-se- 
quential or accelerated longitudinal designs based on growth curve models 
may not be optimal for these data. First, growth curve analysis can be com- 
pleted with three waves of data, but the methods are ideally suited for more 
measurement occasions. Second, the cohort-sequential design requires 
that the number and spacing of assessments be the same for all individuals 
(Duncan et al. 1994: 589). Thus, we were unable to create birth cohorts for 
each age because of the unequal spacing across three waves of ACL data. 
To address the implications of some middle-aged respondents moving into 
the older age group (ages 60-67), we compared the means for volunteer 
index and volunteer hours for the subjects who had age overlap and did not 
find the two cohorts significantly different in the levels of volunteering in 
the same age range (between 60 to 67 years; p > .05). This result suggests 
minimal impact due to the movement of some middle age respondents into 
the older age group. Thus, we believe it is reasonable to interpret the results 
as principally due to an aging effect. 
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When Does Social Capital Matter? Non-Searching For Jobs 
Across the Life Course 


Steve McDonald, North Carolina State University 
Glen H. Elder, Jr., University of North Carolina-Chapel Hill 


Abstract 

Non-searchers — people who get their jobs without engaging in a job search — are often 
excluded from investigations of the role of personal relationships in job finding processes. 
This practice fails to capture the scope of informal job matching activity and underestimates 
the effectiveness of social capital. Moreover, studies typically obtain average estimates of 
social capital effectiveness across broad age ranges, obscuring variation across the life 
course. Analysis of early career and mid-career job matching shows that non-searching is 
associated with significant advantages over formal job searching. However, these benefits 
accrue only during mid-career and primarily among highly experienced male non- 
searchers. The results highlight the need to examine life course variations in social capital 
effectiveness and the role of non-searching as an important informal mechanism in the 
maintenance of gender inequality. 


Social capital theorists have long argued that personal relationships provide 
people with labor market opportunities. Conventional wisdom suggests that by 
relying on personal contacts with friends, relatives and acquaintances, workers 
are able to find employment that might not be readily accessible through more 
formal job search channels, such as reading the want ads or applying directly to 
employers. In this way, informal job searching affords certain advantages over 
formal job searching. Further, the benefits of relying on personal contacts are 
likely to be proportional to the quality of social resources available in people's 
social networks. Because people vary in the extent to which their social networks 
contain valuable social resources, these embedded resources constitute social 
capital (Lin 2001b). People are able to invest in their social capital by "networking" 
and inquiring about potential job openings in order to receive benefits in the labor 
market (Lin 2001b). People with the greatest social capital receive the greatest 
benefits from using personal contacts to get jobs (Lin 1999). 

The benefits of using personal contacts are less apparent when examining 
evidence from survey research. First, people in the most disadvantaged social 
groups are most likely to use personal contacts when job searching, despite the 
fact that they are least likely to benefit from them (Lin 1999). Second, using only 
personal contacts to search for jobs does not provide any benefits in terms of 
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wages or job prestige. In fact, personal contact users often receive lower income 
jobs than people who search for their jobs through more formal means (Elliott 
2000, Green et al. 1999, Smith 2000). These findings have led some to question 
whether social capital really matters for job outcomes (Mouw 2003). 

These findings are puzzling because the benefits of relying on personal 
contacts in the labor market appear obvious to most casual observers. The 
problem is that personal contact use does not fully capture the extent of informal 
job matching activity in the labor market. Most research in the social networks 
literature focuses exclusively on active job seekers and therefore fails to consider 
non-searchers — people who get their jobs while not actively looking for work. 
Non-searchers tend to receive unsolicited job information from personal and 
professional contacts through routine conversations (Granovetter 1995, Lin 
2000). As such, non-searchers are informally matched to their jobs and represent 
the "invisible hand of social capital." (Lin 2000) Because non-searchers tend 
to receive better jobs on average than both formal job seekers and personal 
contact users, the exclusion of non-searchers from analyses downwardly biases 
estimates of the effectiveness of social capital. 

Another problem with the current research in this area is that studies have 
yet to adequately account for variations in social capital effectiveness across 
the life course. Most examinations of informal job matching assess the average 
effectiveness of personal contact use across samples with broad age ranges, 
paying little attention to age and career stage heterogeneity within those samples. 
The occupational histories and personal biographies of workers influence both 
the likelihood and effectiveness of informal job matching. In addition to asking 
whether social capital matters, we should ask when social capital matters, how it 
matters, and under what conditions. 

Given these two limitations — the exclusion of non-searchers and the lack of 
attention to variation overtime-—a re-evaluation ofthe effectiveness of social capital 
on job outcomes is in order. In this study, we examine non-searchers alongside 
personal contact users to assess the extent to which people with the greatest 
social resources are more likely to get their jobs through informal job matching 
and whether this process leads to better job outcomes. The effectiveness of 
informal job matching is examined across two points in the work career (early in 
the career and during the middle phase) in order to reveal contingencies in the 
effectiveness of social capital across life stages. The findings show that social 
capital matters very little early in the work career. However, during mid-career, 
people with the greatest social resources are more likely to get their jobs without 
searching, and non-searchers tend to receive better jobs than formal job seekers. ' 
These payoffs mainly accrue to male non-searchers who have a great deal of 
work experience. The results highlight the necessity of examining variations in 
social capital effectiveness across people's lives, since informal job matching is 
embedded in a broader process of social capital accumulation over the entire life 
course. The results also suggest that non-searching may serve as an important 


informal mechanism in the maintenance and reproduction of gender stratification 
in the labor market. 
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Excluding Non-Searchers 


The social networks literature has tended to focus on job searching rather than 
job matching (Granovetter 1995, Hanson and Pratt 1991). As such, most research 
in this area excludes people who did not search for their jobs. Non-searchers 
constitute a substantial proportion of workers (—30 percent) and most receive 
their job information through personal and professional contacts. Between 73 
percent (Granovetter 1995) and 83 percent (Hanson and Pratt 1991) of non- 
searchers receive their job information from someone they know. These estimates 
are probably conservative because many non-searchers could be unknowingly 
linked to the people who provided the job information. In the end, non-searching 
may serve as "a proxy for finding jobs through personal contacts...since jobs 
that ‘fall into your lap’ are unlikely to do so without some personal intermediary.” 
(Granovetter 1995:145)' 

The emphasis on goal-directed, instrumental action in recent versions of 
social capital theory may explain why non-searchers have been left out of most 
empirical and theoretical analyses. Initial conceptualizations of social capital 
included all “actual or potential [social] resources.” (Bourdieu 1986:248) However, 
this formulation was criticized for applying the term “capital” to attributes that 
are acquired “automatically” (Baron and Hannan 1994) Financial capital, in 
the classical sense, does not refer to all available resources, but only to those 
resources that are invested in order to make a profit (Marx [1849]1978). Human 
capital is similarly defined as conscious investments in education, health and 
Skills with expectations for market returns (Becker 1993). Consequently, recent 
formulations of social capital theory (see Lin 20018, b) emphasize the role of 
instrumental action as a prerequisite for social capital. In other words, people 
must mobilize their social resources through a job search in order to activate 
their social capital. To do this, people engage in "networking" — attempting to 
make contact with those of higher status and authority in order to obtain useful 
job information (Lin 20013). 

By embedding social capital theory within the context of an investment model, 
Lin provides analogous conceptual language with other formulations of capital. 
But non-searching is problematic from this perspective. Because non-searchers 
are, by definition, not looking for work, it becomes difficult to characterize 
their gained access to employment information as the result of their purposive 
attempts to get a new job. But instrumental action is not impossible for non- 
searchers. Workers may engage in a variety of networking activities, such as 
attending conferences and professional meetings without actively looking for 
work, and then receive unsolicited job information as the direct or indirect result 
of these activities. 

Unfortunately, little is known about the extent to which non-searchers' receipt 
of job information results from purposive networking or from simply being in the 
right place at the right time. Still, it seems clear that, "information, unlike most 
other commodities, may often be acquired as the by-product of other activities." 
(Granovetter 1981:248) In other words, "investments" in social capital are not 
always made with instrumental goals in mind. Granovetter (2003) explains that 
the context of social interaction often includes an intersection of labor market 
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institutions and "expressive socialization routines." As he puts it, "information 
about jobs does pass among partygoers." (38) This intersection produces a 
mixture of economic and social motivations in market relationships that cannot - 
be described by self-interest alone. 

Regardless of the reason for the omission, excluding non-searchers from 
analyses of informal job matching processes results in sample selection bias 
(Granovetter 1995), as personal contact users only constitute part of the informal 
matching activity. This becomes consequential when considering the overall 
effectiveness of social capital on employment outcomes. Preliminary evidence 
on non-searchers suggests that they receive higher annual incomes (Granovetter 
1995) and higher wages (Elliott 2000) than active job seekers. Non-searchers are 
also more likely than active searchers to obtain managerial jobs as opposed to 
other jobs (Elliott 2000, Granovetter 1995). 

Lin (2000) has suggested that the labor market advantages of non-searching 
represent the invisible hand of social capital. A worker who maintains a resource- 
rich network of contacts is more likely to receive useful job information "through 
routine exchanges and without actively seeking such information." (192) In other 
words, job information and offers come to people with the best social networks, 
while others need to search for their jobs. Lin's thesis is corroborated by his 
recent study (Lin 2003), which finds that workers who reported receiving job 
information through routine personal exchanges during the past year had superior 
network resources over other workers. Workers who received job information ` 
through routine exchanges were also employed in jobs with greater occupational 
prestige, more supervisory power and higher incomes than other workers. 

As a result, when non-searchers are excluded from analyses, assessments of 
the effectiveness of social capital are downwardly biased. To illustrate, consider 
the weighted frequency of non-searching and personal contact use across 
different job class categories in the 1998 NLSY displayed in Figure 1. Personal 
contact use among active job seekers is most prevalent among general workers 
and least prevalent among managers, while non-searching is much more likely 
to lead to managerial jobs. Removing the non-searchers from the graph would 
present the illusion that informal job matching is negatively associated with 
job class. In reality, informal job matching more closely reflects a U-shaped 
distribution (Boxman et al. 1991, Lin 2000), with non-searchers filling the bulk of 
the most desirable positions in the labor market and non-searchers and personal 
contact users filling the least desirable jobs about evenly. 

This bias is reflected by findings in the research literature. Data from 
surveys of individual workers (MCSUI, NLSY) consistently show that the most 
disadvantaged workers (less educated, blue collar workers, African Americans, 
etc.) are the most likely to rely on personal contacts to search for jobs (Lin 1999), 
and the use of personal contacts is often associated with lower income jobs 
than those received through more formal job searches (Green et al. 1999, Smith 
2000). This evidence has been used to call into question the extent to which 
social capital matters for job outcomes (Mouw 2003). However, research on the 
hiring practices of individual firms (Fernandez and Weinberg 1997, Fernandez et 
al. 2000, Petersen et al. 2000) demonstrates that workers who are referred by 
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Figure 1. Informal Job Matching by Class of Current Job, Mid-Career (Age 33-41) 
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current employees are much more likely to be hired by the firm than people who 
were not referred. The inconsistent findings between the two types of studies 
can be explained by the fact that analyses of individual firms include all workers 
matched to their jobs, while studies of individual workers include only workers 
who searched for their jobs. 


Social Capital and the Life Course 


Another problem in assessing the effectiveness of social capital is the failure to 
adequately consider variation across the life course (McDonald 2005). Prevailing 
theories of social capital (e.g., Burt 1992, Lin 2001b) do not account for how 
social capital accumulates and transforms labor market opportunities over time. 
Social capital is recognized as a valued resource in society, one that is distributed 
hierarchically (Lin 2001b). As such, inequality in social capital has been a central 
theme (see Lin 2000) and much attention in the literature has focused on the 
distribution of social capital resources across various ascriptive characteristics 
and occupational hierarchies. However, few have considered the role of the life 
course in shaping social capital resources. 

Life course theory provides a framework that is essential for the study of social 
capital and employment outcomes (McDonald 2005). This perspective recognizes 
that lives are lived interdependently and have the potential for dramatic change 
over time (Elder 1998). The impact of any particular event is dependent on the 
life stage at which it is experienced (Elder 1994). For example, early entrance into 
military service can serve as a springboard to success in later life (Sampson and 
Laub 1996), but for people in their 30s, entrance into military service can disrupt 
family formation and career advancement and eventually lead to higher rates of 
marital dissolution, income loss, physical decline and even early death (Elder 
et al. 1994, 1997). Examples of the life stage principle are also apparent in the 
area of employment. The extent to which workers achieve earnings benefits by 
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changing jobs is contingent on the age at which that change occurs (Spilerman 
1977). In this way, personal biographies provide a context for jobschanges (Heinz 
2003). The meanings and consequences of work transitions depend on the timing 
of the change. 

Likewise, a person's lifetime experiences structure the character of their social 
networks (Fischer and Oliker 1983), which becomes consequential for the kinds 
of jobs they receive. Generally, increases in work experience lead to a broad set 
of work contacts (Bridges and Villemez 1986), expanding the set of labor market 
opportunities available to an individual. Consider the hypothetical difference 
between the social networks of younger and older workers. A young worker whose 
primary contacts consist of roommates who work at the local fast food restaurant 
will have a vastly different set of opportunities than a middle aged worker whose 
contacts are managers and professionals. As a result, relying on personal contacts 
can lead to different outcomes depending on a persons life stage. 

Social network resources are also structured by gendered work trajectories. 
Women are more likely than men to experience work career instability (Williams 
and Han 2003), leading to a deficit in their social resources (Campbell 1988, Ensel 
1979, Moore 1990). The friendship networks of women tend to shrink relative to 
men during marriage and parenthood, but then expand as familial responsibilities 
recede (Fischer and Oliker 1983). These differences have important implications 
for employment outcomes, as women tend to learn about job opportunities from 
other women and men from other men (Hanson and Pratt 1991). d 

Few studies have taken advantage of a life course perspective on job matching 
by modeling variation in social capital effectiveness over time. When assessing the 
effectiveness of informal job matching on employment outcomes, most studies 
have tended to aggregate survey responses from a wide age range and report 
average characteristics or outcomes (e.g., Aguilera and Massey 2003, Green 
et al. 1999, Smith 2000, Stoloff et al. 1999). While this research has generated 
valuable insights, a focus on the “overall” effectiveness of social capital may 
mask contingent associations across the life course. Such an approach may also 
imply that the effectiveness of social capital is independent of the biographies of 
the individual workers who are being studied. 

Of course, sociological research in other areas has examined social resources 
over the life course. Research on social support has examined the health 
outcomes of relationships across the life course (Ishii-Kuntz and Seccombe 1989, 
Tamir and Antonucci 1981) and change in networks over time (Morgan et al. 1997, 
Wellman et al. 1997). Likewise, Coleman's (1990) research on social capital and 
subsequent extensions focusing on adolescent attainment and well-being have 
(at times) incorporated the life course perspective into their analyses (Furstenberg 
and Hughes 1995, Hagan et al. 1996, Sampson et al. 1999). But these studies have 
rarely informed research on social networks and job outcomes. Within the social 
networks literature, some have invoked life course explanations for curious findings 
(Campbell and Rosenfeld 1985), while others have examined age-specific cohorts 
thereby focusing on a single life stage — primarily the beginning of the work career 
(Holzer 1987, Rosenbaum et al. 1999). Such studies address life course issues 
but do not build comparisons across the life course into their analytic models. 
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One exception has been qualitative research on immigrant networks which has at 
times contained an explicit life course component (Hagan 1998). 

Life course theory also recognizes the role of history and spatial variation in 
shaping people's lifetime experiences (Elder 1998). While this issue is only dealt 
with cursorily here, we note its potential importance for studies of social capital 
and job outcomes. The use of personal contacts and networking is likely to vary 
in frequency and effectiveness across historical time and changing labor markets. 
Evidence of an increase in the use of headhunters during the past decade (Finlay 
and Coverdill 2002) suggests that the prevalence of relying on networking to 
get jobs is on the rise. The effectiveness of informal job matching also appears 
to vary at different times during the business cycle (Osberg 1993). Research on 
social capital before and after particular historical events (Angelusz and Tardos 
2001, Portes and Jensen 1989) and during periods of disruption (Beggs et al. 
1996) provides additional opportunities to examine the intersection of history 
and social capital. 


Towards an Assessment of the Effectiveness of Social Capital 


The problems noted above have confounded efforts to determine the effectiveness 
of social capital. In order to make an accurate assessment of the impact of social 
capital on employment outcomes, research should include non-searching as an 
indicator of informal job matching and examine the effectiveness of social capital 
across the life course. The analysis presented here attempts to make such an 
assessment. Mouw (2003) has outlined the criteria necessary for determining 
the extent to which social capital has a causal impact on employment outcomes, 
claiming that (1) people with the best social capital should be the most likely 
to get their jobs through personal contacts and that (2) personal contact use 
should yield higher wages. We expand on this framework by examining the 
effectiveness of both non-searching and the use of personal contacts. We also 
explore the potential benefits to non-searching and personal contact use beyond 
wages, testing the relationship between informal job matching and occupational 
status, job autonomy, supervisory authority, job class and job satisfaction. 

Finally, we conduct our examination at two time points in order to capture 
potential variation in the effectiveness of social capital at different stages of the 
life course. The results from recent research provide reason to suspect variations 
in outcomes across the work career. Recent research has demonstrated that there 
is a great deal of heterogeneity among non-searchers (McDonald 2005), contrary 
to the prevailing view of this group as highly advantaged workers in the labor 
market (Lin 2000, Marsden and Gorman 2001). Early in the work career, "entry- 
level" non-searchers tend to be students who are entering their first jobs, often 
parttime. During mid-career, "reentry-level" non-searchers tend to be women 
with little work experience who are transitioning back into the labor market after 
periods of caring for family responsibilities. Also during the mid-career, elite non- 
searchers tend to be male, have a great deal of work experience, and often enter 
executive and administrative positions. 

These three typologies represent distinct non-search processes that are 
embedded within and are structured by the life stages that workers occupy 
(McDonald 2005). They are likely to yield a variety of outcomes. As such, our 
analysis is guided by the following research questions: 
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1. Are people with the greatest social capital resources 
more likely to get their jobs through non-search or through 
personal contact use as opposed to searching for a job 
through more formal means? r 
2. Do non-searchers or personal contact users receive 
significantly higher average wages than formal job 
seekers? 
3. Do non-searchers or personal contact users gain 
significant non-wage employment advantages over formal 
job seekers? 

4. Do these associations differ across the work career and 
life stages? 


Data and Methods 


Data from the 1979 cohort of the National Longitudinal Survey of Youth (NLSY) 
are used to answer these questions. The NLSY is one of the few datasets that 
contains a direct measure of non-searching. The data also allow for an examination 
of informal job matching at “different points” in the work career. We focus on two 
time points in the dataset that contain detailed information about job matching 
and outcomes. In 1982 the study members were beginning their work careers 
(17-25 years old) and in 1998 they were in the middle of their working careers - 
(33-41 years old). 

The analysis only includes respondents who were employed at the time they 
were interviewed. People who were working in a family business and those who 
were self-employed are also excluded from the analysis. To correct for potential 
selection bias based on these deletions, separate Heckman selection models 
were run for the 1982 and 1998 cross-sections. The results produced estimates 
of the inverse Mill's ratio for each individual, which serve as a measure of the 
probability of being selected into the analysis (Breen 1996). The inverse Mill's 
ratio is included as a control variable in all of the multivariate analyses presented 
here to account for any selection bias that might exist. We also apply year-specific 
weights to the data to account for race and income oversamples and for bias due 
to sample attrition.? 

Table 1 displays the weighted mean values and number of missing cases 
for the variables included in the analysis. The respondents' primary jobs were 
identified as the job at which the respondent was currently working and at which 
they worked the most hours per week. Non-searchers were identified as people 
who responded negatively to the question, "Were you looking for work when you 
were offered this [current] job?" The remaining active searchers were asked a 
series of questions about the job search methods that led to their being offered 
their current job. Personal contact users claimed to have contacted friends or 
relatives to help them get their jobs. All other active searchers used formal 
search methods only, which include answering job advertisements, contacting 
employment agencies and applying directly to employers. Early in the career, 
close to half of the workers got their jobs through personal contacts (47 percent), 
while the remainder were about evenly matched to jobs through non-search 
(26 percent) and formal search (27 percent). During the mid-career, 21 percent 
obtained jobs by contacting friends and relatives, 27 percent received their jobs 


Table 1: Descriptive Statistics 
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General, prior job 


1982 1998 
Age 17-25 Age 33-41 
(n = 6348) (n = 6072) 

Non-search .26 46 2m 95 
Contact .47 46 .21 95 
Formal search .27 46 .52 95 
Male .52 0 .54 0 
Female 48 0 Ap 0 
White IE 60 12 42 
Black 10 60 "3 42 
Hispanic .05 60 .05 42 
Other race .10 60 .10 42 
In school 29 0 
Education (years) 12.38 1 13.53 0 
Number of jobs ever worked 3.87 0 
Work experience (years) 15.93 0 
Tenure (weeks) 74.69 64 303.76 66 
Working when offered job .36 46 Mi 97 
Father's occupational status (SEI) 36.48 1652 35.61 1679 
Father's education (years) 11.99 739 11.84 830 
Mother's education (years) dir 337 11.61 336 
Number of siblings 3.20 6 3.31 8 
Occupational aspirations (SEI) 48.37 1181 48.12 1140 
Part-time job 44 15 21 24 
Union job E 336 .16 43 
Government job .10 0 .24 0 
Hourly wages (in dollars), current job 5.27 356 15.97 122 
Hourly wages (in dollars), prior job 4.28 3377 13.83 1797 
Occupational status (SEI), current job ^ 28.30 1 38.69 21 
Occupational status (SEI), prior job 290.12 3202 35.25 2174 
Autonomy 45 1494 a 51 
Supervisor .38 53 
Authority .08 93 
Managerial .05 1 .19 11 
Skilled .08 1 20 11 
General .87 1 .61 11 
Manager, prior job .03 3202 14 2161 
Skilled, prior job .05 3202 Jr 2161 

.92 3202 .69 2161 


High job satisfaction .38 5 .48 0 
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without searching, and half of the workers (52 percent) got their jobs through 
formal job searches. , I 

While most of the variables in the analysis are self explanatory, some require 
clarification. A measure of the occupational aspirations of respondents (in 1979) 
is coded in terms of the 1980 Duncan Socioeconomic Index (SEl). Part-time work 
refers to jobs where respondents worked less than 40 hours per week. Union 
jobs in 1982 represent those covered by a collective bargaining agreement. For 
1998, union jobs were considered jobs where respondents were members of a 
labor union or employee association. Also, outliers on the hourly wage measures 
were recoded so that values range from $1 to $125. 

We examined a number of non-wage job quality indicators. Occupational status 
is measured according to the 1980 SEI. Job autonomy is measured differently in 
the two cross-sections. In 1982, respondents were asked about the extent to 
which their jobs allowed for independent thought and action. Those who claimed 
to have "quite a lot" or the "maximum amount" were coded as having a great deal 
of job autonomy. Respondents in 1998 who were not supervised by anyone were 
coded as having a great deal of job autonomy. Measures of supervisory authority 
were available only in the 1998 survey. Respondents who supervise others are 
considered supervisors. Respondents who have full responsibility either for 
setting the wages of people whom they supervise or for promoting the people 
that they supervise are considered to have a great deal of authority. 

Class categories (managers, skilled and general workers) were constructed 
according to similar coding rules as those used by Elliott (2000), based on 
each respondent's occupation and level of job authority. Managers are those 
that supervise others and have much job authority. Skilled workers are those in 
professional/technical occupations (1980 Census occupation codes 43-242) that 
do not have such supervisory authority. General workers were employed in non- 
technical occupations (codes 243-905) that lack supervisory authority. Since the 
job authority measures were not available in the 1982 data, the class categories 
are based solely on the occupation codes. Finally, global job satisfaction is 
measured as a dummy variable for high satisfaction, with a value of 1 referring to 
workers that claimed to like their jobs "very much." 

Some of the variables in the analysis were constructed across multiple waves. 
In the 1998 survey, job search questions were asked only in reference to new 
jobs at the time of the interview. Therefore, data from the 1996 and 1994 surveys 
were used to fill-in the gaps in job search information for the respondents' current 
jobs in 1998. Prior job variables are based on the most recent job worked by the 
respondent so long as the start date was before the start date of the current 
1998 job. Information on prior job was obtained from the main waves (1982 
and 1998) as well as from previous waves (1979-1981 and 1992-1996). For prior 
jobs reported in the 1998 and 1996 files, the class categories were constructed 
similarly to those of the current jobs. Prior jobs from the 1992-1994 files lacked 
the authority measures and, therefore, only the occupational classifications were 
used to construct the class variables for these jobs. 

For the main job in each year of the NLSY, 1980 Census occupation codes 
are offered in the NLSY main file extract. For other jobs, only the 1970 codes 
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were available. When available, the 1980 codes were used. Otherwise, the 1970 
codes were converted to 1980 codes based on probabilistic recoding (Hirsch 
and Macpherson 1997). The 1980 occupation codes were converted to 1980 SEI 
Scores to obtain the occupational status measures for current and prior jobs. 

Embedded dummy variables are included in the regression models to account 
for a substantial number of missing values. In this procedure, zeros are used in 
place of missing values for the substantive dependent variables, while dummy 
variables noting the absence of this information are also included in the models 
(Hardy and Reynolds 2004). We note the use of embedded dummy variables in 
the regression tables, but the coefficients are not reported. 


Results 
Social Capital and Informal Job Matching 


The first thing to consider is whether or not workers who have the greatest social 
capital resources are more likely to get their jobs through non-search or through 
personal contact use as opposed to searching for a job through more formal 
means. Prior research has shown that high quality social network characteristics 
are not associated with the use of personal contacts to get jobs (Mouw 2003). 
In fact, some negative social network indicators (such as proportion of friends 
on welfare) are positively associated with using personal contacts. For non- 
searchers, the question is more difficult to answer. Survey questionnaires on 
job searching tend to screen out respondents who claim to not have searched 
for their jobs, asking few additional questions about social networks or the 
contacts that helped them get their jobs (Granovetter 1995). Consequently, no 
known dataset has information on the social network characteristics of non- 
searchers (Mouw 2003). However, Lin (2003) has demonstrated that workers 
with the greatest social network resources were most likely to have received 
job information through their routine personal exchanges during the last year, a 
hallmark of non-searching. 

In the absence of direct social capital measures, we draw on measures of prior 
job characteristics (occupational status, wages and job class) in order to gain a 
sense of the social capital resources of respondents. Because previous research 
has shown that job status is correlated with social resources (De Graaf and Flap 
1988, Marsden and Hurlbert 1988), these variables can serve as estimates of 
the social resources available to workers at the time that they changed jobs. 
However, the extent to which prior job characteristics are associated with social 
capital resources of the respondents is likely to vary across the life course. In 
other words, early job characteristics may be less reflective of social capital 
resources than later job characteristics, as social resources earlier in life may 
be shaped more by family background while occupation likely plays a larger role 
later in life. To account for this possibility, father's occupational status is included 
in the analysis as well. ? 

Multinomial logistic regression is used to examine the factors associated with 
the likelihood of informal job matching. Table 2 displays the odds of non-searching 
or using personal contacts versus relying on formal search methods. The results 
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from Model 1 show that the associations between informal job matching and 
prior job characteristics are inconsistent. Early career non-searchers are not 
significantly different from formal searchers in terms of their average prior wages 
or their father's occupational status. Workers with lower job status at their prior 
jobs are significantly less likely to get their jobs without searching, but former 
skilled workers are more likely than general workers to get their jobs without 
searching (marginally significant, p — .056). Early career personal contact users 
tend to move from jobs that pay about 28 percent less on average than the jobs 
that formal searchers come from, otherwise the prior occupation variables are 
not significantly related to the likelihood of personal contact use. Overall, these 
variables appear to be unrelated to informal job matching early in the career. 

For workers in mid-career (see Models 2 and 3), people with the best prior 
occupational characteristics are generally more likely to get jobs without searching. 
For example, in Model 2 father's occupational status and wages at prior job are 
positively associated with the likelihood of non-searching as opposed to formal 
searching. The association between prior occupational status and non-searching 
is positive but insignificant and former managers are 36 percent more likely than 
general workers to get their jobs without searching (marginally significant, p — 
.064). None of the social resource indicators are significantly associated with 
the likelihood of using personal contacts for the 33-41 year olds. Model 3 more 
accurately specifies the model by including an interaction term between gender . 
and work experience. The interaction effect shows that, during mid-career, the 
likelihood of non-searching for men is highest among workers with the most 
experience, while the likelihood of non-searching for women is highest among 
workers with little experience. The distinction between elite non-searchers (highly 
experienced male non-searchers) and reentry-level non-searchers (female non- 
searchers who lack work experience) will become more important later when 
considering the payoffs to non-searching. For now, it is only important to note 
that the relationship between social network resources and the likelihood of non- 
searching remains the same after the inclusion of the interaction term. y 

These results suggest an answer to the question of whether or not people with 
the greatest social capital resources are more likely to be informally matched to 
their jobs. Social resources seem unrelated to informal job matching early in the 
career. The same is true later in the career for personal contact users. However, 
during mid-career, workers from high wage, managerial jobs are more likely to 
get their jobs without searching than through more formal means, as are workers 
whose fathers come from advantaged occupational backgrounds. This suggests 
that, during mid-career, people with the greatest social resources are the most 


likely to receive (and act on) unsolicited job information from their personal and 
professional contacts. 


Informal Job Matching and Hourly Wages 


The next issue to consider is whether informally matched workers receive higher 
wages than formal job seekers. For this, we ran a series of linear regression models 
on the log of hourly wages at respondents’ current jobs. In Table 3, Models 1 
and 3 provide the baseline estimates, which include dummy variables for non- 
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Table 3: Unstandardized Coefficients for Linear Regression on the Log of Hourly Wages, 


Current Job 


Non-search 

Contact 

Female 

Black 

Hispanic 

Other race 

In school 

Education 

Number of jobs 

Work experience 

Work experience squared 

Job tenure 

Part time 

Government 

Occupational aspirations (SEI) 
Occupational status (SEI), prior job 
Constant 


N 
R-squared 


*p<.05 **p <.01; two-tailed test 


Early Career 
(age 17-25) 
Model1 — Model 2 
-.084** -.083** 
(.019) (.019) 
-.046** -.045** 
(.014) (.014) 
-.125* -.130** 
(.015) (.015) 
-.042** -.041** 
(.014) (.014) 
.023 .021 
(.017) (.017) 
.022 .022 
(.022) (.022) 
.011 .008 
(.023) (.023) 
.010 .008 
(.007) (.007) 
.003 .003 
(.004) (.004) 
.0012** .0012** 
(.0001) (.0001) 
-.129"* -.126** 
(.016) (.016) 
- 0538 -.052** 
(.019) (.019) 
.0005 .0005 
(.0004) (.0004) 
.003** 
(.001) 
1.844** Sons 
.105 .108 
5554 5554 
.244 .247 


` 


Mid Career 
(age 33-41) 
Model 3 Model 4 
.059** .048* 
(.019) (.019) 
-.014 -.015 
(.018) (.018) 
-.181** -.188** 
(.022) (.022) 
-.103** -.091** 
(.016) (.015) 
-.006 -.016 
(.024) (.024) 
-.044 -.050 
(.028) (.027) 
.096** .082** 
(.007) (.007) 
OST tn .040** 
(.008) (.008) 
-.0003 -.0005 
(.0003) (.0003) 
.0004** .0004** 
(.0000) (.0000) 
-.113** -.102** 
(.024) (.024) 
"43128 lao" 
(.020) (.019) 
.0017** .0013** 
(.0004) (.0004) 
.006** 
(.001) 
1007 .161** 
.135 ¿139 
5460 5460 
373 .390 


Included in the models are additional controls for the inverse Mill's ratio and for missing values 
on occupational aspirations and prior occupational status (only in models 2 and 4). 
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searching and personal contact use (referencing formal search) as well as controls 
for personal characteristics, human capital, and job characteristics. Models 2 and 
4 control for prior occupational status. Early in the career, both non-searchers and 
personal contact users receive significantly lower average wages than formal job 
seekers. However, at mid-career, non-searchers receive significantly higher wages 
than formal searchers. There is no significant difference between the wages 
of contact users and formal searchers. The positive association between non- 
searching and wages is modestly reduced after controlling for prior occupational 
status, suggesting that a portion of the wage advantage of non-searching can be 
explained by the advantaged labor market positions of non-searchers' prior jobs. 

The wage advantage of non-searchers over formal searchers during mid-career 
requires further scrutiny. The higher average wages of non-searchers could be 
due to the fact that "employed workers who are not actively searching for work 
are satisfied with their current job and simply set a high reservation wage for 
competing offers." (Mouw 2003:881-82) Reservation wages reflect the lowest 
wage that people are willing to accept in order to change jobs (Devine and Kiefer 
1991). It remains possible that non-searchers merely have higher standards than 
people who are actively searching for a job and are willing to hold out for better 
job offers. Two of the most important factors determining the level of reservation 
wages that people set are unemployment and prior wages (Devine and Kiefer 
1991). Unemployed workers set lower reservation wages than employed workers 
because the opportunity cost of holding out for a better offer is higher (Mortensen 
1986). Prior wages are also important for determining reservation wages, as past 
earnings provide a reference point for expected future earnings. 

The role of reservation wages in explaining the mid-career wage advantage of 
non-searching is examined in Table 4. Model 1 controls for whether respondents 
were working at the time that they were offered a job and for the log of wages 
at respondents' prior jobs. When controlling for these factors, non-searchers 
receive wages that are, on average, no different from formal job seekers. This 
suggests that the overall wage advantage of non-searchers over active job 
seekers can be explained primarily by the advantaged labor market positions of 
non-searchers and the high reservation wages that they are able to set. During 
the mid-career, non-searchers are more likely to be working at high wage jobs 
at the time that they are offered the new job (see Table 2). This allows them to 
be very selective when considering new job offers or information about new job 
openings. The result is that non-searchers receive wages that are significantly 
higher than formal job searchers. 

In order to account for heterogeneity among mid-career non-searchers, 
a pair of interaction terms are introduced in Models 2 and 3. The first is an 
interaction between non-searching and female, which shows that the "effect" 
of non-searching on wages is significantly different for men and women. That 
is, female non-searchers fail to receive the same wage advantage over formal 
job seekers that male non-searchers receive. The relative wage advantage of 
non-searching also varies across different levels of work experience, as shown in 
Model 3. The interaction between non-searching and years of work experience 
indicates that non-searchers with very little work experience receive significantly 
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Table 4: Unstandardized Coefficients for Linear Regression on the Log of Hourly Wages, 


Current Job Mid-Career (Age 33-41) -- 
ARRE Niet pS Sal | Modell ikea MOJO AS ea ASS 
Non-search .007 041 -.149* 
(.019) (.025) (.074) 
Contact -.023 -.022 -.023 
(.017) (.017) (.017) 
Female -,147** -.128** -.146** 
(.021) (.021) (.021) 
Black -.086** -.086** -.087** 
(.014) (.014) (.014) 
Hispanic -.011 -.012 -.012 
(.023) (.022) (.023) 
Other race -.048 -.050 -.049 
(.027) (.027) (.027) 
Education 079 .074* .073** 
(.006) (.006) (.006) 
Work experience .036** 036" .034** 
(.008) (.008) (.008) 
Work experience squared -.0007* -.0007* -.0007* 
(.0003) (.0003) (.0003) 
Job tenure .0004** .0004** .0004** 
(.0000) (.0000) (.0000) 
Parttime -.079** -.076** -.076** 
(.023) (.023) (.023) 
Government -,114** -.114** -.113** 
(.019) (.019) (.019) 
Occupational status (SEI), prior job .003** .003** .003** 
(.001) (.001) (.001) 
Occupational aspirations (SEI) .0014** .0013** .0014** 
(.0004) (.0004) (.0004) 
Working when offered job 420% 120% 421^ 
(.015) (.015) (.015) 
Log of hourly wages, prior job OT ota ot 
(.018) (.018) (.018) 
Non-search*female -.077* 
(.035) 
Non-search*work experience .010* 
(.004) 
Constant ort ot .405** 
129 129 128 
N 5456 5456 5456 
R-squared 452 453 .453 


*p<.05 **p < .01 (two-tailed test) 
Additional controls for the inverse Mill's ratio and for missing values on prior occupational 
status and occupational aspirations are included in the model. 
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lower wages than formal job seekers with similarly low work experience. Non- 
searchers with a great deal of work experience, however, receive significantly 
higher wages than formal job seekers, beginning at around 20 years of work 
experience. To provide a clearer picture of these relationships, we ran Model 3 
separately for men and women and plotted the predicted hourly wages for non- 
searchers and formal searchers across different levels of work experience. Figure 
2 shows that the relative wage advantage for non-searching (compared to formal 
searching) is apparent only among people with the most work experience. But 
this interaction is driven mainly by men, as the wages of female non-searchers 
are not significantly higher than formal searchers even among women with the 
greatest amount of work experience. 


Figure 2. Predicted Hourly Wages for Men and Women by Job Matching Method and Work 
Experience, Mid-Career (Age 33-41) 
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To summarize, informal job matching early in the work career is associated with 
lower wages than formal job searching. During the middle of the career, personal 
contact users receive wages that are not significantly different from formal job 
seekers. But non-searchers, overall, receive higher average wages than formal 
searchers in this life stage. This advantage can be explained primarily by the fact 
that non-searchers are more likely than formal job searchers to be working at 
high wage/status jobs just prior to their being offered their new jobs. This allows 
them to set relatively high reservation wages for new job offers. However, elite 
(highly experienced male) non-searchers maintain wage advantages over formal 
job searchers even after these factors are controlled. 
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Informal Job Matching and Non- Wage Outcomes 


` 


We also examine other employment outcomes of informal job matching, 
beginning with occupational status. Table 5 displays the coefficients and 
standard errors for linear regression on occupational status. The results in Model 
1 show that, net of controls, non-searching is unrelated to status early in the 
work career. The SEI scores of personal contact users are lower, on average, 
than people who got their jobs through formal search, although this relationship 
is only marginally significant (p — .058). At mid-career, the SEI score differences 
between the informally matched workers and the formal job seekers are not 
statistically significant (see Model 2). However, the inclusion of an interaction 
between non-searching and female in Model 3 demonstrates that the status 
payoff differs for male and female non-searchers. Male non-searchers receive 
jobs that are significantly higher (1.798 points) on the SEI than male formal 
searchers, whereas female non-searchers obtain jobs that are .525 points lower 
on the SEI than female formal searchers. This provides further evidence that 
only elite non-searchers gain labor market advantages over people who search 
through formal means. 

Table 6 summarizes the results from logistic regression analyses on a number 
of other employment outcomes: job autonomy, supervisor, job authority, job 
class and job satisfaction. The table reports only the effects for personal contacts 
and non-searching, but each of the models contains an extensive set of control ' 
variables. The regressions from the 1982 cross-section show no employment 
advantages for the informally matched workers early in their careers. Non- 
searching is unrelated to all job outcome variables early in the career and personal 
contact users tend to receive worse jobs on average than formal searchers during 

the early career. Contact users are more likely than formal job searchers to receive 
- general jobs and are less likely to be highly satisfied with their jobs. During the 
mid-career, personal contact users are about 18 percent less likely than formal job 
seekers to be supervisors. Relying on personal contacts is negatively associated 
with most of the other job outcome variables as well, although none of these other 
relationships are statistically significant. By contrast, non-searchers tend to receive 
better jobs than formal job seekers later in the career; that is, managerial jobs with 
a great deal of autonomy and supervisory authority. In terms of job satisfaction, 
though, non-searchers do not differ statistically from formal searchers, even after 
including a gender*non-search interaction term (not shown). 

As before, there is little difference between the jobs that non-searchers and 
formal job seekers receive early in their work careers. Personal contact use, 
however, is generally associated with less desirable job outcomes than formal 
job searching. During mid-career, there does not appear to be much difference 
between the jobs obtained through personal contacts and those obtained through 
formal search. Non-searchers, however, consistently receive greater non-wage 
employment advantages overall when compared to formal job seekers. 

It is important to note that the differences in non-search behavior and 
outcomes reported here could reflect changes in labor market structures over 
time rather than changes across the work career. However, evidence from other 
data sources on non-searchers suggests that the patterns revealed here are 
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Table 5: Unstandardized Coefficients for Linear Regression on Occupational Status, Current Job 


Early Career Mid-Career 
(age 17-25) (age 33-41) 
ri PO QU sil, sni Model PAED X Model2^-— 72: Model 3 6s 
Non-search -.197 .746 1.798* 
(.576) (.548) (.731) 
Contact -.914 -.971 -.946 
(.482) (.620) (.620) 
Female 2.470** 1.989** 2.574** 
(.479) (.678) (.729) 
Black =1.315"* -3.581** -3.582** 
(.508) (.548) (.549) 
Hispanic 1.588** -.091 -.083 
(.571) (.720) (.720) 
Other race -.661 -.704 -.766 
(.675) (.789) (.788) 
In school -1.461* 
(.694) 
Education 2.416** 3.296** a 
(.222) (.215) (.214) 
Number of jobs -.147 
(.106) 
Work experience .134* 127 
(.068) (.068) 
Working when offered job 1.186** .147 deo 
(.433) (.493) (.493) 
Job tenure .0008 .0042** .0042** 
(.0037) (.0012) (.0012) 
Occupational status (SEI), prior job .284** SON .310** 
(.035) (.020) (.020) 
Father's occupational status (SEI) .015 .032 .031 
(.015) (.019) (.019) 
Father's education .109 2159 155 
(.084) (.093) (.093) 
Mother's education .164 .062 .064 
(.094) (.110) (.110) 
Number of siblings -.080 -.101 -.104 
(.091) (.097) (.097) 
Occupational aspirations (SEI) 1055% .064** .064** 
(.012) (.014) (.014) 
Non-search*female -2.323* 
(1.039) 
Constant -11.620** -28.286** -28.711** 
3.524 3.889 3.880 
N 5556 5443 5443 
R-squared .256 .423 .424 


*p«.05 **p <.01 (two-tailed test) 
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more likely to reflect changes across the work career. First, the NLSY estimates 
of non-search prevalence are similar to those from 30 years earlier (Campbell 
and Rosenfeld 1985, Granovetter 1995). Second, data from the Multi-City Study 
of Urban Inequality (MCSUI) — a cross-sectional study conducted between 1992 
and 1994 that contains a relatively inferior measure of non-searching — display 
similar patterns as those presented here. For example, in the MCSUI data, non- 
searching is positively associated with hourly wages among 33-41 year olds but 
unrelated among 21-25 year olds (17-20 year olds were not surveyed). The use 
of personal contacts is negatively associated with wages among both cohorts. 
Taken together, these findings suggest that the differences between the 1982 
and 1998 samples presented here are primarily the result of changes across the 
work career. 


Conclusion 


Studies of job searching have found negative job consequences for relying on 
personal contacts. However, the impact of social capital has been underestimated 
through the practice of excluding non-searchers from analyses. The results 
presented here reveal that, during the middle of the work career, (1) people with 
the best social capital resources are more likely to get their new jobs without 
searching than through a formal job search and (2) non-searchers receive better 
Jobs on average than formal job seekers. Social capital clearly does matter for job 
outcomes, but its impact depends on when and how a person is matched to their 
job. Early in the career, personal contact use is generally associated with negative 
job outcomes when compared to formal job searching. Later in the career, the 
advantages of social capital only accrue to non-searchers and not to personal 
contact users. Elite non-searchers receive the lion's share of those benefits. 

George(2003)has emphasizedthe needforextendingthelife course perspective 
into a variety of research areas. The analysis presented here provides such a 
contribution by highlighting the usefulness of life course theory when examining 
social capital effectiveness. As people age, they engage in new experiences and 
acquire new information, expertise, and contacts. This transforms the character 
of their social capital which, in turn, alters their labor market opportunities. Greater 
attention needs to be focused on the dynamic processes by which social capital 
accumulates and translates into labor market advantages. Informal job matching 
methods are paramount in this respect, as they serve as the conduits through 
which many of the social capital benefits are realized. 

Social capital theorists can no longer afford to ignore the role of non-searchers 
in this process. The instrumental reductionist view of economic action, pervasive 
in the economic sociological literature, is too limited to understand labor market 
embeddedness (Granovetter 2003). Focusing solely on the purposive mobilization 
of social resources underestimates the utility of social capital (Lin 2005). Non- 
instrumental forms of action — such as non-searching - need to be incorporated 
into a theory of social capital (Granovetter 1985) in order to reflect the full extent 
to which social capital can provide benefits in the labor market. 

However, much remains unknown about non-searchers. Why is it that they 
are able to gain significant employment advantages over formal job seekers (at 
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least later in their careers), while personal contact users are unable to receive 
any advantages? This is most likely due to a lack of social capital available 
to personal contact users. People with the greatest social resources are the 
most likely to receive unsolicited job information through routine personal 
exchanges (Lin 2003). As Granovetter (1995) put it, “those who are searching 
[for a job] would not if they didn't have to, and wouldn't have to if they were 
good enough.” (36) But the advantages of non-searching do not span the entire 
work career or work force. Early in the career, people have not had much time 
to build up high quality work contacts. As a result, the non-searching at this 
career stage generally leads to jobs that are no better (and sometimes worse) 
than jobs received through formal search. Later in the career, non-searching is 
associated with many employment advantages, but these advantages apply 
mainly to highly experienced workers. Work experience generally leads to a 
broader range of work contacts (Bridges and Villemez 1986), which are likely to 
help people gain access to better job information. 

Many of the best jobs are obtained without an active search on the part 
of workers. Elite non-searchers are far more likely to work in executive and 
administrative occupations than in any other occupational classification (McDonald 
2004). Overall, the likelihood of elite non-searching is sorted according to job 
hierarchy, as the jobs with greater prestige, autonomy and supervisory authority 
are most often obtained without a search. These positions may not be advertised 
to the general public and instead are often filled with employer recruitment that ` 
targets specific individuals, limiting access to the best connected workers. In this 
way, non-searching may serve as an indicator of how closed a labor market is 
(Elliott 2000). This possibility should be considered more fully in the future. 

The findings presented here suggest that informal networks play an important 
role in the maintenance of gender stratification in the labor market. Women with 
a great deal of work experience are much less likely than men to get their jobs 
without searching. And, when these highly experienced women get their jobs 
without searching, they do not receive the same payoffs that men receive. It 
is possible that women receive job information at about the same rate as men 
but choose not to act on that information. However, women's deficit in social 
capital seems the more likely culprit. Women tend to have fewer social resources 
and fewer work contacts in their social networks than men (Campbell 1988, 
Fischer and Oliker 1983, Moore 1990), hindering their chances of receiving job 
information informally. 

Ibarra (1992) has noted that men maintain greater network centrality within 
organizations than women, making them more likely to receive valuable job 
information. This supports Kanter's (1977) notion of the informal isolation of 
women, such that women are excluded from informal socialization within 
organizations and are therefore slow to receive information. Similarly, the findings 
could result from the existence of "good ol' boy" networks that keep women from 
information about quality openings in management. Informal recruitment allows 
employers the discretion to select male employees to fill job openings (Reskin 
2003) and increases the likelihood of a man occupying a managerial position 
(Reskin and McBrier 2000). To the extent that informal recruiting is used to fill 
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higher level positions, non-searching may serve as an important mechanism in 
maintaining the "glass ceiling effect." (Cotter et al. 2001) 

Unfortunately, very little is known about non-searchers and the non-search 
process. Qualitative data on the history and context of non-searchers' receipt of 
job information are sorely needed, as are longitudinal data on the social network 
composition of non-searchers and active job seekers. With these kinds of data, 
social scientists and policy makers can achieve greater understanding of the 
informal mechanisms by which people attain success in the labor market and 
how social resources shape employment opportunities over the life course. 


Notes 


1. Not all non-searchers receive their job information through their personal 
contacts. People who get their jobs through executive search consultants or 
headhunters serve as a possible example (Finlay and Coverdill 2002). Head- 
hunting does, however, involve a great deal of informal information flows 
as it involves "cultivating networks, monitoring information flows, ...paying 
attention to the gossip and rumor that characterize all industries" (Finlay and 
Coverdill 1999: 16). 


2. The results from the unweighted analyses are substantially similar to those 
from the weighted analyses and reveal essentially the same story. 


3. We also tested mother's occupational status, father's education level and 
mother's education. None of these variables were significant in any of the 
models nor did they alter the interpretation of the other variables. 
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Family Instability and Children's Early Problem Behavior 


Shannon E. Cavanagh, University of Texas at Austin 
Aletha C. Huston, University of Texas at Austin 


Abstract 

This study investigated the association between family instability and children's problem 
behavior during the transition to first grade. In a sample (n — 1,015) drawn from the 
NICHD Study of Early Child Care and Youth Development, we found that a quarter 
of sample members experienced at least one family transition between birth and age 6. 
Instability was also related to family structure at birth: those born into cohabiting parent 
families experienced the most instability, followed by those born into single mother families 
and finally, those in two-biological married parent families. Children who experienced 
instability had higher teacher and observer reports of problem behaviors than those from 
stable family structures. Finally, differences in problem behavior associated with family 
instability varied by family structure at birth and the emotional, social and material 
resources in the family. 


The structure of American families has undergone dramatic change over the 
past 50 years. Increases in non-marital fertility, divorce and cohabitation coupled 
with declines in marriage and remarriage have translated into more complex 
living arrangements for American children (Casper and Bianchi 2002). Point 
estimates indicate that most children live with both biological parents, but life 
course estimates suggest that more than half of them will spend some time 
in an alternative family structure by their 18th birthday (Bumpass and Lu 2000, 
Teachman 2003). This growing diversity in the structure of families has fueled 
public debate and policy initiatives, including state and federal policies designed 
to encourage marriage (Ooms, Bouchet and Parke 2004). These initiatives have 
prioritized the need for understanding the implications of family transitions for 
child development. 

The social and behavioral sciences have accumulated a solid foundation of 
evidence that divorce and single parenthood pose risks for the behavioral and 
emotional development of children and youth (Amato 2005; Clarke-Stewart et 
al. 2000; Coleman 1998; DeLeire and Kalil 2002; Hetherington, Bridges, and 
Insabella 1998; McLanahan and Sandefur 1994). We know less, however, about 
the dynamic nature of children's living arrangements and its implications for 
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behavioral problems. This research addresses the issue in an effort to contribute 
to the body of theoretically grounded empirical research that delineates the 
circumstances under which family structure change and child development are 
linked over time. 

According to the life course paradigm, fresh insights into the link between 
family structure and child well-being can be gained by considering life trajectories, 
transitions and social contexts (Elder 1998, George 1993). In this study, we 
investigate the developmental implications of trajectories or histories of family 
structure change, which we refer to as family instability. Given the increasing 
fluidity of children’s living arrangements, examining parents’ marital histories 
— rather than their current marital statuses — better captures the circumstances in 
which children are raised (Hao and Xie 2002). Our first goal in this study, then, is to 
describe these histories. Next, we consider how these family circumstances affect 
children’s behavioral adjustment during a major transition in the early life course: 
the transition to elementary school. Specifically, the family instability that children 
experience before entering this new social institution is presumed to affect how 
well they adjust, both socially and emotionally, upon entry. Finally, recognizing that 
the experiences of family instability are not uniform across families, we examine 
how resources (and deficits) in the family context buffer (or exacerbate) the effects 
of family instability on children’s behavior during this transition. 

We pursue these three objectives using a valuable data set for research on . 
child development: the NICHD Study of Early Child Care and Youth Development 
(SECCYD). The quality and quantity of prospective, longitudinal data on family 
structure, family environment and child behaviors collected in this national (but 
not nationally representative) study is unmatched in larger studies to date: 
maternal reports on family composition were collected at least four times a year 
for the child's first six years of life, observer- and parent-reported data on the 
home environment were collected at multiple time points, and observer- and 
teacher-reported data on child behaviors were gathered after the transition to 
elementary school. The richness of these data maximize our ability to understand 
the effects of family instability and alternative family structures on children's 
socio-emotional adjustment at a time when family structure change is becoming 
an ubiquitous feature of American childhood. 


The Dynamic Structure of American Families 


Over the past half-century, the structure and stability of American families have 
changed dramatically, with changes in marriage, divorce, remarriage, non- 
marital fertility and cohabitation contributing to the fluidity of children's living 
arrangements. From the perspective of a child, family structure can include spells 
of parental marriage, divorce, single parenthood, cohabitation, remarriage or all of 
the above (Teachman 2003, Bumpass and Lu 2000, Raley and Wildsmith 2003). 
This increasing fluidity has complicated social and behavioral research on the 
developmental significance of family structure, but an emerging literature has 
begun to address the issue (Hao and Xie 2002; Hill, Yeung, and Duncan 2001; 
Teachman 2003; Wu and Martinson 1993; Wu 1996; Wu and Thompson 2001). 
Combining insights from stress theory with evidence of the negative effects of 
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residential mobility, the instability and change perspective posits that changes in 
a parent's marital or romantic histories (e.g., divorce, remarriage, cohabitation) 
constitute a major stressor in a child's life, one that cumulates with each transition. 
Family structure change represents the loss or addition of a parental figure, which 
may disrupt a child's sense of security and create ambiguity in household rules, 
family relationships and parental expectations about behavior (Amato 2000, 
Simons and Associates 1996, Wu and Martinson 1993, Wu and Thompson 2001, 
Teachman 2003). It may also create dramatic changes in family income and parents' 
employment patterns (McLanahan and Sandefur 1994, Amato 2000). Additionally, 
residential moves, which often accompany family structure change, can exacerbate 
the stress of family transitions by breaking neighborhood and school ties, extended 
family relationships and children's friendships (Astone and McLanahan 1994, 
McLanahan and Sandelur 1994, Speare and Goldscheider 1987). 

Beyond highlighting that family structure change is stressful, the instability 
and change perspective also emphasizes the potentially cumulative nature of 
family structure change. Although many children never experience a family 
structure change, those who experience one family transition are at a greater 
risk of experiencing subsequent transitions and the concomitant stresses and 
residential moves that they involve (Martinson and Wu 1992). Thus, young 
people who experience multiple family transitions are expected to experience 
more compromised well-being than those who experience no such transitions or 
only one (Teachman 2003). 

The instability and change perspective has been generally indifferent to the 
effects of different family structure statuses (Martinson and Wu 1992, Teachman 
2003). However, the family structure statuses from which and into which children 
are transitioning may have different implications for children's behavior. Theories 
that emphasize changing family resources predict that the effects of family 
instability will vary by initial family structure. For example, a single mother's 
transition to marriage may produce a family structure change that adds material, 
social and emotional resources to the household. Indeed, much of the current 
effort to promote marriage is based on the assumption that a two-parent family, 
even one formed after the child is born, will contribute to positive adjustment 
for children. The literature on child well-being in stepparent families, however, 
suggests that parental remarriage is not a panacea, with children in stepparent 
families doing about the same and sometimes worse than children in single mother 
families on a host of behavioral and emotional outcomes (see Coleman, Ganong 
and Fine 2000 for review). That is, remarriage may not alleviate the disadvantages 
associated with single parenthood even when it raises family income and parental 
monitoring (McLanahan and Sandefur 1994). Given the growing public interest 
in strengthening marriages and reducing family instability, the developmental 
implications of different family structure statuses at birth must be explored. 

Following the emphasis on trajectories in the life course paradigm in general 
and the instability and change perspective in particular, the first objective of this 
study is to determine the level of family instability in children’s lives. We do so by 
examining the changes in children’s family structure from birth through the start 
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of elementary school, using measures of family structure captured at more than 
20 time points in the SECCYD data. 


Children's Problem Behavior during the Transition to Elementary School 


The life course paradigm also calls special attention to transition points that direct 
children's social pathways (Elder 1998). The start of elementary school is one such 
transition. It marks children's movement from the family into the larger society, 
their introduction to a large-scale peer culture, and the start of an educational 
trajectory that will be a defining factor in their long-term prospects (Pianta and 
Cox 1999). How well children make this transition — not just academically but also 
behaviorally and emotionally — goes a long way toward predicting their futures 
(Entwisle and Alexander 1993, 1999). 

One key dimension of this transition involves children's problem behavior 
in school, including their disruptive behaviors with teachers and peers in the 
classroom. These behaviors can interfere with children's ability to learn and impair 
their formation of positive relationships with both teachers and peers (Entwisle 
and Alexander 1999). Importantly, evidence from past family structure research 
indicates that such behavior can be a product of family instability (Doherty and 
Needle 1991, Simons and Associates 1996). 

Fortunately, the NICHD study provides an excellent resource for studying . 
problem behaviors during the transition to elementary school. Children's behaviors 
are assessed by teacher reports and by direct observations in the classroom. 
Teachers complete a report of behavior problems, including externalizing 
behaviors such as disobedience, noncompliance with requests, and lying and 
cheating, which provides both an evaluation of the children's adjustment and an 
assessment of how the children are living up to expectations of school authority 
figures (Entwisle and Alexander 1999; Ehrenberg, Goldhaber, and Brewer 1995; 
Rimm-Kauffman et al. 2000). Trained observers with no prior knowledge of the 
children also rate them in the same classrooms on negative, disruptive behaviors 
with peers and negative, disruptive behaviors with the teacher. These observations 
complement teacher reports and avoid some of their potential biases, providing a 
reasonably objective measure of problem behavior in the classroom. With these 
rich data, the second objective of this study is to link histories of family instability 
with. children's socio-emotional adjustment in the classroom. 


The Importance of the Family Environment 


The link between family structure histories and children's problem behavior 
during the transition to elementary school cannot be understood independently 
of the other circumstances that define the families undergoing these changes. 
Indeed, as emphasized by the life course paradigm, context matters. The 
material and emotional resources in the family may buffer or exacerbate the 
stresses and benefits induced by family transitions, so children's experiences of 
family instability are likely to vary depending on the overall family environment in 
which they live (Booth and Amato 2001, Jekielek 1998, Wheaton 1990). The third 
objective of this study, then, is to consider how family circumstances moderate 
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the association between family instability and children's problem behavior at the 
start of formal schooling. 

One important circumstance is family structure at birth (i.e., marriage, single 
parenthood or cohabiting union). For example, we expect that the children born into 
two-biological married parent families would be less affected by subsequent family 
transitions than those born into other family types if the greater social, economic 
and emotional resources that often mark the former offer lasting benefits. For the 
same reason, we expect children born into single mother and cohabiting families 
would be more negatively affected by subsequent family transitions. 

Other important family circumstances include maternal depression, maternal 
sensitivity, material and emotional resources in the home environment, and 
the family's income-to-needs. Past research has demonstrated that these four 
family circumstances mediate the link between family instability and child well- 
being (Amato 2000, Hetherington 1999, McLanahan and Sandefur 1994). In 
the present study, we consider whether these factors condition or moderate 
the developmental significance of family instability. In general, we expect that 
the negative consequences of family instability will be less pronounced in 
family environments that support children's socio-emotional adjustment and 
more pronounced in less supportive family environments. Because maternal 
depression, parenting behavior, the home environment and income status 
change as family composition changes, we use unconditional linear growth 
curve models to compute individual intercepts and slopes for each indicator 
across early childhood. Then, we investigate how both the levels (intercepts) 
and changes (slopes) in these factors moderate the relations between family 
instability and child well-being. 

Overall, we expect that if higher levels of positive family factors (e.g., maternal 
sensitivity, home resources, income) weaken the link between family instability 
and children's problem behavior at the start of elementary school, the mean 
levels of these factors protect children in unstable families. Similarly, if increases 
in these positive factors across early childhood weaken the link between family 
instability and children's problem behavior, these increases serve protective 
functions in families undergoing change. Alternatively, if higher levels of or 
increases in negative factors (i.e., maternal depression) strengthen this linkage, 
we infer that these factors compound the risks of family instability. 


Overview of the Study 


In this study, we use the life course paradigm and the instability and change 
perspective to (a) describe the histories of family instability during the preschool 
years for children born into different family structures; (b) examine how family 
instability from birth through early childhood affect children's problem behavior 
during the transition into formal education; and (c) explore how the developmental 
significance of family instability is exacerbated or reduced by initial family 
structure as well as the material, social and emotional resources of the home, in 
order to identify circumstances in which children are most and least vulnerable 


to family instability. 
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Research Design and Methods 


` 


The sample was drawn from the NICHD SECCYD, a comprehensive longitudinal 
study following children from birth through elementary school (see NICHD ECRN 
2005, public.rti.org/secc for more details). The families who participated in this 
study were recruited from hospitals located in or near Little Rock, AR; Irvine, 
CA; Lawrence, KS; Boston, MA; Philadelphia, PA; Pittsburgh, PA; Charlottesville, 
VA; Morganton, NC; Seattle, WA; and Madison, WI. The families lived in urban, 
suburban and rural communities near these hospitals. During selected 24-hour 
sampling periods in 1991, 8,986 women were visited in the hospital shortly after 
giving birth. Of these women, 5,265 met the eligibility criteria for the study and 
agreed to be contacted after their return home from the hospital. The mother 
had to be at least 18 years of age and conversant in English, the infant had to be 
a singleton and healthy, and the family could not be planning to move within the 
following year. When infants were 1 month old, 1,364 families (58 percent of those 
contacted) were enrolled in the study. Although the sample was not nationally 
representative, it was diverse: 24 percent ethnic minority children, 11 percent 
mothers without a high school education, and 14 percent unmarried mothers. 

The original purpose of SECCYD was to understand the developmental 
significance of early child care, however, the breadth and depth of data collected 
makes it a valuable resource for studying child development in general. The study . 
is useful for this project because it contains prospective information on family 
structure, parenting quality, the home environment, maternal psychological well- 
being, family income, and child outcomes from birth through middle childhood. 

The analytic sample used in this study includes 1,015 children (74 percent 
of the original sample). Attrition analyses comparing the analytic sample to the 
original sample (n — 1,364) indicate more sample loss and missing data for 
children who were more likely to experience family instability and to exhibit 
externalizing behaviors or engage in disruptive behavior with peers or teachers 
(see Table 1). For example, children in the analytic sample were less likely to be 
born into single parent families, to be African American or to have mothers with 
less than a high school degree. These differences, however modest, must be 
kept in mind when interpreting results. 


Measures 


Three indicators of child problem behavior were examined. First, first-grade 
teacher reports of externalizing behaviors were assessed with the age-appropriate 
version of the Teacher Report Form (TRF), which lists 120 child behaviors. The 
externalizing problem behavior scale listed 34 items, including being explosive, 
demanding, disruptive to other pupils and disobedient at school. For each item, 
the teacher was asked how well it described the child in the past two months 
(0 = Not True; 1 = Somewhat or Sometimes True; 2 = Very or Often True). 
Raw scores were totaled and converted into age-standardized T-scores based on 
national norms (Achenbach 19971). 

The other two indicators of problem behavior were derived from classroom 
observations conducted during the winter or early spring of first grade using the 
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Table 1: Comparison of Full and Analytic Sample 


——— ÁN 
Full Sample Analytic Sample 
(n = 1364) (n = 1015) 
Percent M Percent M 
Family structure at birth 


Married 77.4 80.1 
Single 13.6* 11.8 
Cohabiting 8.9 8.1 
Mother's education 
Less than high school 10.3* 8.1 
High school graduation 21.0 19.9 
Some college 33.4 32.9 
At least college graduation 95.3 39.1 
Mother's age 26.8 28.6 
Child's gender (Female) 48.3 49.6 
Child's race 
White 80.4 82.6 
African American 12.9% 11.0 
Other 6.7 6.4 


*p«.05 “p<.01 **p<.001 


Classroom Observation System for First Grade (COS-1, NICHD ECCRN 2002). All 
observations began with the official start of the school day and continued for 
approximately three hours. 

For the COS-1 observation, two 44-minute cycles were completed. In each 
cycle, trained observers made time-sampled recordings for three 10-minute 
periods of 30-second observe and 30-second record intervals. Thus, there were 
30 different minutes in which discrete behaviors were sampled in each cycle, 
for a total of 60 different minutes (i.e., 60 intervals). The observers recorded the 
frequency of 44 key behaviors including classroom activity, content, teacher 
behavior, child behavior toward teacher, child behavior with peers and child 
engagement. All observers had to pass a videotaped reliability test involving six 
44-minute cycles for both behavioral coding and qualitative ratings. Criteria for 
passing were at least a 60 percent match with a master coder on time-sampled 
codes (see NICHD ECRN 2001 for details). 

For this study, we chose two sets of observed behaviors. The count of negative 
or disruptive behavior with peers was the number of times a child was observed 
being physically aggressive with peers or displaying other negative behaviors 
toward them. The measure ranged from 0 to 8, with most children displaying 
very few of these negative behaviors (M = .29, SD = .85). The count of negative 
or disruptive behaviors with teachers reflected the number of times a child was 
observed not complying with the teacher's request, behaving negatively with the 
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teacher, being disruptive with the teacher, or being inappropriate in the classroom. 
The measure ranged from 0 to 29, with children, on average, displaying just under 
2 negative or disruptive behaviors with the teacher (M = 1.68, SD 327r 

Family instability is indexed as the number of changes in parents' relationships 
with partners (i.e., marriage, singleness, cohabitation, divorce) living in the child's 
home from birth through the end of kindergarten, rather than as changes in 
household composition (e.g., movement of aunts, grandparents, family friends, 
etc. into/out of the home). In telephone interviews (at 3, 9, 12, 18, 21, 27, 30, 33, 
42. 46, 50, 60 and 66 months and the fall and spring of kindergarten) and home 
interviews (at 1, 6, 15, 24, 36 and 54 months), each mother completed a household 
roster in which she listed each household member and that person's relationship 
to her and to the study child. At each data point, family structure was coded into 
one of nine mutually exclusive categories: (1) two biological parents (married), (2) 
two biological parents (cohabiting), (3) biological mother and stepfather (married), 
(4) biological father and stepmother (married), (5) biological mom and cohabiting 
partner, (6) biological father and cohabiting partner, (7) biological mother-only, 
(8) biological father-only, or (9) all other family types. Next, we constructed a 21- 
variable array, with each variable reflecting a different data collection point and 
its value reflecting family structure composition at that time. From this array, we 
constructed a count of the number of family transitions from birth through the 
end of kindergarten (M = .50, S.D. = 1.05, range 0-6).* 

Given the skewness (2.5) of the distribution of this measure, with most sample 
members having fewer than three transitions, we constructed two additional 
measures of family instability (i.e., a set of dummy variables that indicated no 
transitions, one transition, two transitions and three or more transitions and a 
quasi-count variable where 0 = no transitions, 1 = 1 transition, 2 = 2 transitions, 
. 9 — 8 or more transitions). We used these measures as alternatives to estimate 
the baseline model (described below) and compared model fit for each with that 
of the original continuous variable. We relied on the BIC (Bayesian Information 
Criterion) statistic, a model fit statistic less influenced by sample size or the 
number of variables in the model, to identify the most parsimonious model that 
best fits the data (Rafferty 1995). For each outcome, the quasi-count variable best 
fit these data, so it was used in subsequent analyses. 

We considered five factors that might moderate the linkage between family 
instability and children's problem behavior during the transition to elementary school. 
The first was family structure at birth, defined as family structure at month 1, drawn 
from the family structure array described above. The proportions were: married 
couple (80 percent), single mother (12 percent), and cohabiting couple (8 percent). 
The last category was composed mostly of two biological parents but six children 
were born into families that included their biological mother and her partner. 

Second, maternal depressive symptoms were assessed at 6, 15, 24, 36 and 54 
months, using the Center for Epidemiological Studies Depression Scale (CES-D), 
a self-report measure that assesses depressive symptomatology in the general 
population (Radloff 1977). Cronbach's alpha coefficients ranged from .88 to .91 
in the present sample. Depression scores were correlated over time, with cross- 
time correlation coefficients ranging from .52 to .58. The individual intercept, 
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centered at the child's median age in early childhood (27 months), and the linear 
slope were included as factors in the current analyses. 

Third, maternal sensitivity was measured using semi-structured 15-minute 
videotaped observations at 6, 15, 24, 36 and 54 months. The observed tasks 
provided a context for assessing age-appropriate qualities of maternal behavior. 
At the six-month visit, mothers were asked to play with their infants and were 
told that they could use any toy or object available in the home or none at all for 
the first seven minutes. For the remaining eight minutes, mothers were given a 
standard set of toys they could use in play. At 15, 24 and 36 months, mothers 
were asked to show their children age-appropriate toys in a set order and have 
their children play with the toys (Vandell 1979). 

Videotapes of the mother-child interactions were shipped to a central location 
for coding by raters who were blind to other information about the families. 
Inter-coder reliability was determined by assigning two coders to 20 percent of 
the tapes randomly drawn at each assessment period. Coders were unaware of 
which tapes were assigned to double coding, and reliability assessments were 
made throughout the period of coding. Inter-coder reliability was calculated as 
the intra-class correlation coefficient. Reliability for the composite scores used in 
the current report exceeded .83 at every age. 

At 6, 15 and 24 months, composite maternal sensitivity scores were created 
from the sum of three four-point ratings (maternal sensitivity to child nondistress, 
intrusiveness [reversed] and positive regard). At 36 and 54 months, composite 
maternal sensitivity scores were created from the sum of three seven-point ratings 
(supportive presence, hostility [reversed] and respect for autonomy). Cronbach's 
alphas exceeded .70 at every age. These scores were standardized at each age. 
As expected, maternal sensitivity scores were correlated over time (cross-time 
correlation coefficients ranged from .39 to .52) (see NICHD ECRN 1999, 2003). 
The intercept, centered at 27 months, and the linear slope of maternal sensitivity 
were included in these analyses. 

The third factor was the Home Observation for Measurement ofthe Environment 
(HOME) administered during home visits at 6, 15, 36 and 54 months. The focus 
of this instrument is on the child /n the environment, child as a recipient of inputs 
from objects, events, and transactions occurring in connection with the family 
surroundings (Caldwell and Bradley 1984). The Infant/Toddler version of the 
inventory (ITHOME) was used at the 6- and 15-month home visits. Its 45 items are 
clustered into six subscales: (1) parental responsivity, (2) acceptance of child, (3) 
organization of the environment, (4) learning materials, (5) parental involvement, 
and (6) variety in experience. At the 36- and 54-month home visits, the Early 
Childhood version of the inventory (EC-HOME) was used. Its 55 items are clustered 
into eight subscales: (1) learning materials, (2) language stimulation, (3) physical 
environment, (4) parental responsivity, (5) academic stimulation, (6) modeling, (7) 
variety of experience, and (8) acceptance of child. For both inventories, each item 
was scored in binary fashion (yes/no). Information was scored during the course 
of each home visit through observation and semi-structured interviews. 

A centrally located system of training was used for data collectors at each age. 
Every four months, each observer coded videotaped visits, and the coding was 
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compared with gold-standard codes provided by expert coders. All observers were 
required to maintain a criterion of scoring like the master coder on 90 percent of 
the items. Cronbach's alphas for the total score at each age exceeded .77. 

Unlike maternal depression and maternal sensitivity no HOME score was 
collected during the 24-month home interview. To ensure comparability, the 
four HOME scores were standardized at each age and the mean of the 15- 
and 36-month HOME scores was used to impute the 24-month HOME score. 
As expected, HOME scores were correlated over time (cross-time correlation 
coefficients ranged from .57 to .69). The intercept, centered at 27 months, and 
the linear slope were included as factors in the current analyses. 

Finally, family income status was based on maternal reports of family income 
at 6, 15, 24, 36 and 54 months and included all sources of household income. 
An income-to-needs ratio was calculated for each family at each data collection 
time point by dividing the total family income by the poverty threshold for that 
family size (cross-time correlation coefficients ranged from .76 to .84). Again, the 
intercept, centered at 27 months, and the linear slope were included as factors 
in the current analyses. 

Child's gender (dummy variable for female) and race (dummy variable for 
African-American, with all others serving as the reference category) were based 
on mother reports collected at the month-1 home interview. Mother's age and 
educational attainment were also based on mother reports at the month-1 home 
interview. Age was presented as a continuous variable, and education was coded 
into three dummy variables: college graduation or more, some post high school 
education or less than high school graduation, with high school graduation as the 
reference category. 


. Analytic Techniques 


The first objective of this study was to create family structure histories that 
captured stability and change in family structure from birth through the end of 
kindergarten. From these histories, measures of family instability and family 
structure statuses were constructed. 

The second objective was to examine the linkage between family instability and 
children's problem behaviors during the transition to elementary school. Using 
ordinary least squared (OLS) regression, teacher-reported externalizing problem 
behavior was regressed on measures of family instability, family structure at birth, 
the family environment in early childhood and control variables. Similar models 
were estimated for observed negative behavior with peers and negative behavior 
with teachers, but Poisson regression models, not OLS, were estimated on these 
low frequency counts of problem behaviors. 

The third objective was to identify the conditions under which family instability 
was more or less important to children's behavior at the start of formal schooling. 
We considered family structure at birth and four aspects of the family environment 
as potential moderators of instability. Because maternal well-being, parenting 
behaviors, the home environment and income status change as family composition 
changes, we used unconditional linear growth curve models to estimate individual 
intercepts and slopes for each indicator to describe average levels and longitudinal 
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Table 2: Descriptive Statistics for Analysis Variables 


Percent M SD N 
Family Structure History 
Family Structure at Birth 
Married 80.1 - - 811 
Single 11.8 = - 119 
Cohabiting 8.1 - = 82 
Family Structure Changes .50 1.05 1012 
0 74.8 - - 757 
1 10.9 - - 110 
2 8.8 - - 89 
3+ 55 - - 56 
By end of kindergarten, ever lived in: 
Two-biological married parent family 82.7 - - 837 
Any alternative family 34.9 - - 353 
Two-biological cohabiting parent family 11.0 - - 111 
Married stepparent family 4.0 - - 40 
Cohabiting stepparent family 8.3 - - 84 
Single parent family 29.5 - - 299 
Sociodemographic Characteristics 
Mother's Education 
Less than high school 8.0 - - 81 
High school graduation 19.9 - - 201 
Some college 33.0 - - 334 
At least college graduation 39.1 - - 396 
Mother's Age - 28.57 9.09 1012 
Child's Gender (Female) 49.5 - - 501 
Child's Race 
White 82.6 - - 836 
African American 11.1 - - 112 
Other 6.3 - - 64 
Outcomes 
Teacher Reports 
Externalizing behaviors - 50.63 8.65 1002 
Classroom Observations 
Negative behavior with peers - .29 .85 961 


Negative behavior with teachers - 1.68 oU 961 


patterns of change. Building on the models described above, interactions between 
family instability and each moderator's slope and intercept were entered into 
separate regression models. The interaction between the intercept for maternal 
depression and family instability, for example, tested whether the mean level 
of maternal depression moderated the link between family instability and child 
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behavior. The interaction between the slope for maternal depression and family 
instability tested whether changes in maternal depression over time moderated 
this link. Only interaction models with significant interactions are presented. All 
analyses were estimated with Stata (StataCorp 2001). 

In the unconditional linear growth models estimated for this study, each of 
the individual intercepts and slopes was statistically significant*. For maternal 
depression, there were systematic individual differences in the intercept (z= 
20.9, p < .001) and linear change over time (z = 6.8, p < .001). The same was 
true for maternal sensitivity (z = 19.1, p < .001 for intercept and z = 3.6, p « .001 
for slope), the HOME score (z = 22.9, p < .001 for intercept and z = 10.8, p < 
.001 for slope), and the income-to-needs ratio (z — 23.9, p « .001 for intercept 
and z = 6.9, p < .001 for slope). 


Results 
Family Structure and Instability 


Most children (80 percent) were born to two-biological married parents; 12 
percent were born to single mothers; and 8 percent were born to cohabiting 
parents. Most children experienced no family structure change from birth until 
the end of kindergarten. A quarter of the sample, however, did experience the 
exit or entrance of a biological parent or a parent's romantic partner at least 
once: about 11 percent experienced one family structure transition, 9 percent 
experienced two, and 6 percent experienced three or more transitions in these 
first six years of life. 

Over the six-year window in which family structure was observed, about 83 
percent of the sample spent some time living with two-biological married parents, 
more than a third (35 percent), however, spent time living in some alternative 
family structure. The most common alternative family was a single-parent family 
— about 30 percent of children spent some time in this structure. About 11 percent 
spent some time living with their cohabiting biological parents, 8 percent spent 
time living with their biological mother and her partner, and 4 percent spent some 
time living with their biological mother and her husband (see Table 2). 

As expected, family instability was related to family structure at birth, with 
fewer than 20 percent of children born into two-biological married parent families 
experiencing any family instability compared to more than half of those born into 
single parent families and nearly two-thirds of those born in cohabiting families 
(see Table 3). Among those born into two-biological married parent families who 
experienced change, the most common alternative family structure was a single- 
parent family (16 percent). Many children born into single-parent families spent 
time residing with two parents - their cohabiting biological parents (24 percent) 
or their biological mother and her partner (23 percent). More than half of those 
born into cohabiting unions (60 percent) spent some time in single parent families 
during their first six years of life. 

Table 4 presents means in the measures of family environment and children's 
behavior for different levels of family instability. On the whole, these means 
suggest that maternal depression was higher and increased over time at higher 
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Table 3: Living Arrangements in Early Childhood by Family Instability and Family Structure at Birth 





Number of Family Transitions 0 7 2 J+ 
n=757 n=110 n - 89 n = 56 
Family Structure at Birth 


Married (n = 811) 83.4 9.0 4.8 2.8 
Single (n = 119) 42.9 21.9 Beak, 12.6 
Cohabiting (n = 82 36.6 13.4 28.1 22.0 
Family Structure at Birth Married Single Cohabiting 
n= 811 n= 119 n= 82 
Ever lived in: 
Two-biological married parent family 100 11.8 22.0 
Two-biological cohabiting parent family T 24.4 92.7 
Married stepparent family 25 10.1 9.8 
Cohabiting stepparent family 3.0 22: 354 
Single parent family 16.2 100 59.8 


rates for mothers in unstable families. The levels of maternal sensitivity, the home 
environment and the income-to-needs ratio were lower for mothers in unstable 
families than for those in stable families, but there was no apparent association 
between family instability and changes in these family characteristics over 
time. Finally, children in unstable families had generally higher levels of problem 
behavior than those in stable families. 

Overall, these patterns demonstrate that a substantial number of children 
experienced family instability in their first six years of life. Furthermore, family 
instability was associated with both the home environment and children's behavior 
in first grade, making clear the need to control for family environment in analyses 
examining the links between family instability, structure, and children's behavior. 
The next set of analyses explored these associations in a multivariate context. 


The Link between Family Instability and Children's Problem Behavior 


In the first set of multivariate analyses, we investigated the associations between 
family instability and the three measures of children's problem behavior in first 
grade (see Table 5). Family instability was modestly related with teacher reports 
of externalizing behaviors and observed negative behavior with peers and was 
significantly related to observed disruptive behavior with teachers. Children 
in less stable homes displayed somewhat more of all three types of problem 
behavior than did children from more stable homes, net of all other factors. 
Family structure at birth was also related to observed negative behavior with 
peers and with teachers. Those born into single parent families engaged in more 
disruptive and negative behaviors with peers and with teachers than those born 
into married families, net of other factors. Children born into cohabiting families 
were also more negative with teachers and tended to be more disruptive with 
peers than children born into married families. There were no associations 
between family structure at birth and teacher-rated externalizing behavior. 
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Table 5: Main Effects of Family Instability on Children's Problem Behavior at the Transition 
to Elementary School 


Externalizing Disruptive with Disruptive with 


Behaviors Peers Teachers 
b SE b SE b SE 

Family Structure History 
Family instability Bak" "GA 10+ (.06) AA iy 
Family structure at birth 

Cohabiting -30 (1.07) .39* (.21) 2207 (.09) 

Single 48 (99 ët 2 (29) 24" ^ (.08) 
Family Environment 
Maternal Depression 

Intercept d'Ge .01 (.01) -.01 (.01) 

Linear slope 69+  (.39) .19* (.08) .03 (.04) 
Maternal Sensitivity 

Intercept 4.16%! 4.55) -.19 (.12) SE (£05) 

Linear slope -26.5+ (142) -1.51 (3.02) -4.92*** (123) 
HOME Score 

Intercept -1.18*  (.61) -.06 (.13) -.05 (.05) 

Linear slope -65.3 (40.13)  -30.9"* (8.35) -5.65 . (3.58) 
Income-to-Needs 

Intercept "10. 19 12) .02 (.03) 03% (.01) 

Linear slope -12.7 (17.8) 6.774 . (3.94) 245. (1.83) 
Child Characteristics 
Site -10  (.10) .02 (.02) 01 (.01) 
Female 13M (93) -.09 (.12) -54"* LU 
African American 3.1180 (099) .02 (.18) .00 (.08) 
Maternal Characteristics 
Age -02  (.06) -.01 (.01) Ose 101) 
Education 

Less than high school JJ 21.15) -.10 (22) a US) 

Some college -22 — CU) .07 (.16) -.02 (.07) 

At least college degree -1.10 — (.89) -.06 (.21) es (.09) 
Intercept 50.21 (1.88) -1.45 (.43) 1.30 (.18) 

r 1 - - 
-2 log likelihood - -702 -2327 
n 1002 961 961 


TUI "pist05.! **p-« 01 o p'<:001 


The mean levels and changes in the four dimensions of family environment 
also predicted child behavior. First, although the average level of maternal 
depressive symptoms was not related to behavior, children whose mothers 
experienced increases in depression during early childhood exhibited more 
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disruptive behaviors with peers and somewhat more externalizing behaviors. 
Second, children whose mothers displayed lower average levels of maternal 
sensitivity and declining maternal sensitivity engaged in somewhat more 
externalizing behaviors and manifested more disruptive behaviors with teachers. 
Third, children in homes with lower average HOME scores were rated slightly 
higher on externalizing behaviors, and those who experienced declines in home 
quality displayed significantly more disruptive behaviors with peers. Finally, 
children from families with higher average levels of income displayed more 
disruptive behavior with teachers than did children from lower income families, 
and there was some tendency for those whose family incomes had increased to 
show more disruptive behavior with peers. 

Some control variables were also associated with these behaviors. Girls 
and children of older women were less likely to engage in negative behaviors 
with teachers. Teachers reported that African American children exhibited more 
externalizing behaviors than other children, net of other factors, but race was not 
related to observer reports of behavior. Mother's educational attainment was 
related to children's negative behaviors with teachers: children of mothers with 
the least education (less than a high school degree) and the most education 
(college graduate or greater) displayed fewer negative behaviors than those 
whose mothers had a high school degree. 


Moderators of the Link between Family Instability and Children's Problem Behavior 


The next set of analyses explored the conditioning influence of family structure at 
birth and other measures of the family environment on the association between 
family instability and children's problem behavior. The results of these analyses 
are presented in Table 6. 

Family structure at birth moderated the links between family instability and 
all three types of behavior. Among children born into married couple families, 
those with more family transitions were rated higher on externalizing behavior 
than were those with fewer transitions. No significant difference in behavior was 
detected among children born into single parent families with different amounts 
of instability, but children born into cohabiting parent families that were unstable 
displayed fewer externalizing behaviors than did those in stable cohabiting . 
families (see Figure 1). 

Family structure at birth also conditioned the association between family 
instability and disruptive behaviors with peers and with teachers. For both 
outcomes, these associations were partially consistent with the hypothesis 
that children born into alternative families would be more negatively affected by 
family instability than those born into two-biological married parent families. For 
children born to two-biological married parents, negative behaviors with peers 
did not differ as a function of the number of family transitions. Children born to 
single parents who experienced more instability, however, did displayed more 
negative behaviors with peers than those who experienced less instability. Those 
born to cohabiting parents that experienced more instability also displayed more 
negative behaviors compared to those in stable families, but these differences 
were not statistically significant (see Figure 2). A similar pattern occurred for 
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Figure 1. Teacher Reports of Externalizing Behaviors: Family Structure at Birth by Family Instability 


negative behaviors toward teachers. 

The patterns for maternal depression were contrary to those predicted. 
The differences in externalizing behaviors and negative behavior with teachers 
associated with family instability were greatest for children whose mothers 
experienced the smallest increase in depression during the preschool years. This 
pattern is partly due to the fact that mothers who experienced the lowest increase 
in depression also showed higher average levels of depression, suggesting 


570 » Social Forces Volume 85, Number 1 * September 2006 





























+ 
c 
Ty 
z 
= 
+ 
wD 
E 
= 
E 
= Im 
> E 
A Se 
£ w 
= 
* © 
Fa © 
3 E 
o c) 
= 
5 
o 
: "EE 
Ë o 
2 = 
> 2 
T = T 
E o| = 
& s| 2 
S EU f 
E — I 
A e = 
G 
= vu) BS 
z — o 
= eno 
t š 1 
S E 
5 
ea = 
° < 
Ë £ £ 
: ° 
5 © 
m 
a E: 
a S 
E = 
c 
$ i 
v 
Z 
S 
° 
we 
i 
° 
E: 
ES 
5 
š 
v 
F c 
- N S 
SERS = So c KR N o 
=< 
S S1Ə9Əd UJIA SlOIAPeU9g eAndnusiq 
= 


that persistently depressed mothers may be driving this interaction. Moreover, 
increasing depression during the preschool years predicted negative behaviors 
regardless of family instability. The patterns for negative behavior with teachers 
are shown in Figure 3. As family instability increased beyond one transition, 
children of mothers who experienced modest increases in depression exhibited 
more disruptive behaviors than did all others. 

Maternal sensitivity also moderated the effects of family instability on children's 
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Figure 3. Observer Reports of Negative or Disruptive Behaviors with Teachers: Linear Chan 


Instability 


negative behaviors with peers and with teachers, and the patterns were partly 
consistent with predictions. For both types of behavior, the differences associated 
with family instability were greatest for children in homes with low average levels 
of maternal sensitivity. As illustrated in Figure 4a, for children with more sensitive 
mothers, the number of negative behaviors with peers was similar regardless 


572 ° Social Forces Volume 85, Number 1 * September 2006 


of family instability. For children whose mothers had low levels of sensitivity, 
however, those in unstable families had higher levels of disruptivé behavior than 
those in stable families. A similar pattern occurred for negative behavior with 
teachers. 

Changes in maternal sensitivity over the preschool years also conditioned 
the association between family instability and children's negative behaviors 
with peers. For children whose mothers demonstrated the greatest increase in 
sensitivity, there were larger differences associated with family instability than for 
children whose mothers did not increase their sensitivity (see Figure 4b). Once 
again, this pattern should be placed in the context of the main effect of changing 
maternal sensitivity — children whose mothers increased in sensitivity had lower 
rates of disruptive behavior with peers, independent of family instability. 

The average level of the HOME score moderated the association between 
family instability and children's negative behaviors to teachers. For children in 
homes that offered less support and stimulation, those experiencing family 
instability exhibited significantly more negative behavior with teachers than 
did those whose families had remained stable. Conversely, for those in home 
environments that offered more support and stimulation, there were smaller 
differences associated with family instability (see Figure 5). Changes in the home 
environment did not moderate the association between family instability and 
child behavior. 

Finally, the income-to-needs ratio during early childhood moderated the effect 
of family instability on observed negative behaviors with peers. For children in 
homes with low income-to-needs ratios, negative behaviors with peers were more 
evident for those with unstable families than for those in more stable families. In 
higher-income families, however, these behaviors were at similar levels regardless 
of family instability. Changes in income-to-needs did not significantly moderate 
the association between family instability and child behavior. 


Conclusion 


The increasing fluidity of family structure across childhood has fueled scholarly 
investigations of family instability and its implications for child development. This 
study grew out of and contributes to this emerging literature. Integrating two 
theoretical perspectives with a longitudinal, multi-reporter data source on early 
childhood, we examined how families change over time, how children navigate 
their entry into the educational system, and how these two processes are linked. 
We also investigated the conditions under which family instability is more or less 
consequential to students' classroom behavior at the start of formal schooling. 
Three key themes emerge from this research. 

First, studying the dynamic nature of family structure offers a valuable lens 
through which to view the role of the family in child development. Because of 
the unique characteristics of the SECCYD data set, we can view family structure 
prospectively in a large sample of American children. Although these data are 
not nationally representative, the picture of family structure and instability that 
emerges is similar to (though more dramatic than) what we have learned from 
other studies that rely on retrospective life history reports. For instance, roughly 
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equal proportions of respondents in the analytic sample drawn from the 1995 
NSFG sample (Teachman 2003) and our sample ever lived in an alternative family. 
The difference, however, is that the NSFG sample included family transitions that 
occurred up through age 15 and the SECCYD data reflects family transitions in 
only the first six years of life. Although other factors distinguish these two data 
sets (e.g., they are different cohorts), the prospective nature of SECCYD, we 
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Figure 4b. Observer Reports of Disruptive Behaviors with Peers: Linear Change in Maternal Sensitivity by Family Instability 
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argue, provides a more accurate picture of family structure and a more in-depth 
examination of the conditions within the family. In other words, it better captures 
the lived experiences of children. Our findings also indicate that it is conceptually 


and empirically useful to disentangle a family's structure at a point in time from 
its stability over time. 
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Second, we have reported a direct link between family structure history and 
problem behavior at the start of formal schooling. Family instability is related to 
each dimension of children's problem behaviors, and family structure at birth 
is related to observed problem behaviors in the classroom. These associations 
reached statistical significance even after important dimensions of the family 
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environment were taken into account. Children's behavior, including problem 
behavior, begins to solidify during the transition to elementary school, justas their 
academic sense of self and their parents' and teachers' behavioral and academic 
expectations of them become established. Consequently, how children make 
this transition, both academically and socially, is predictive of later academic 
achievement and behavior (Coie and Dodge 1998, Entwisle and Alexander 1993). 
Although these family effects are modest (the effect sizes of family instability 
and family structure at birth, net of other factors, exceed .40), they can magnify 
and accumulate over time, compromising children's developmental trajectories 
and exacerbating inequality among children (McLanahan 2004). 

Third, and perhaps most interesting, the link between family instability and 
children's behavior is not uniform, but is conditioned by the family context in 
which it occurs. We find evidence that family structure at birth moderates the 
link between family instability and children's problem behavior. Other moderating 
factors include levels of and changes in maternal depression and sensitivity, 
support and stimulation in the home, and family income. 

Family structure at birth serves as a proxy for different socialization contexts 
into which a child is born. It also captures unmeasured selection factors that 
contribute to whether a child is born into a married, single or cohabiting family. 
The fact that family instability appears to work differently across different family 
structures suggests that both socialization and selection processes matter 
to children's responses to instability. The interactions uncovered in this study 
indicate that children born to single mothers, a group likely to experience multiple 
family transitions, are generally more negatively affected by family instability than 
are those born to married couples. This finding is particularly interesting given 
the fact that a single mother family that makes one family transition is adding 
(typically) a male figure who might be expected to add income and parental 
resources. Of course, subsequent changes may remove any such benefits. 
Nevertheless, children who live in stable single-mother homes manifest fewer 
problem behaviors with peers and teachers than do children whose mothers 
acquire (and sometimes un-acquire) partners. 

The results for children born into cohabiting families are less consistent. The 
effect of increasing instability on children's negative behaviors with peers and with 
teachers was largely equivalent for children born into cohabiting unions and into 
married couple families. Interestingly, this pattern is quite different for children's 
externalizing behaviors. Children born into married couple families showed 
more externalizing behaviors as family instability increased, but the externalizing 
behaviors of children born into cohabiting unions decreased with higher levels of 
instability. This intriguing finding deserves investigation in the future. 

As expected, material and emotional aspects of the home environment also 
conditioned the link between family instability and children's problem behavior 
at the transition to elementary school. On the one hand, homes with high levels 
of emotional and material resources (e.g., maternal sensitivity, stimulating home 
environments, income) buffered the negative effects of family instability on 
children's problem behaviors. These findings suggest that the disruptive effects 
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of family instability on child behavior are more pronounced in family environments 
that already pose some hazard to child development (Scarr 1992). 

On the other hand, changes in supportive home contexts (measured over the 
child's first six years of life) did not buffer against the risks of family instability. 
In fact, relatively small increases in maternal depression appear to increase the 
negative effects of family instability. This is probably a ceiling effect, explained 
by persistently depressed mothers. Overall, children whose mothers had low 
average levels and declining levels of maternal sensitivity over their preschool 
years manifested more problem behavior in the classroom, as one might expect. 
Although high average levels of maternal sensitivity appeared to buffer the effects 
of family instability, changes in sensitivity had the opposite effect, suggesting that 
declining sensitivity reduces the differences associated with instability. Possibly, 
large declines in the sensitivity of mothers who are already relatively insensitive 
increase children's problem behaviors regardless of family instability. Children 
whose mothers had smaller decreases in sensitivity may have already been more 
sensitized to the disruptive effects of instability. 

Together, these findings expand our understanding of the linkage between 
family instability and children's problem behavior during a crucial developmental 
period. We have gained unparalleled insight into the topic, thanks to the 
prospective, longitudinal measures of family structure that capture family 
change as it is happening, longitudinal measures of the family environment, 
and the multiple sources of information about children's problem behavior. Still, 
the study has its limitations. First, the sample is not nationally representative, 
which limits the generalizability of these findings. Second, although this analytic 
sample is about 18 percent non-white, it is less ethnically diverse than the U.S. 
child population, restricting our ability to understand the ways the processes 
investigated here differ by race and ethnicity. 

The composition of American families has undergone significant changes over 
the past 50 years, and current patterns of marriage, cohabitation, and fertility 
suggest that this fluidity will continue. The time has come to better understand 
the ways families change and the implications of these changes for child well- 
being. Moreover, we need to move away from judging the health of families 
based on a snapshot of who is in the home, to a more holistic conception of 
families, where its structure and level of stability are considered along with the 
emotional, social, and material resources that define it. These resources clearly 
can buffer or exacerbate the impacts of family structure and instability for young 
children. 


Notes 


1. Twelve sets of cohabiting biological parents later married but this transition 
was not included in the family instability measure. Subsequent transitions 
such as divorce were included. 


2. Quadratic terms were also estimated, but none of these individual terms was 
Statistically significant. 
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Science of Science and Reflexivity 

By Pierre Bourdieu 

University of Chicago Press, 2004 [French edition 2001], 138 pages. $55 
(cloth); $22.50 (paper) 


This book is based on Bourdieu's final lecture course at the Collége de France. 
It attempts to show how science produces transhistorical truths despite the 
fact that scientific production occurs in specific cultural and socio-historical 
conditions. The aim is inspired by a fear that science is losing its autonomy 
to political and economic interests, a development he believes makes science 
"dangerous." Bourdieu claims that "relativism" is largely responsible for current 
political and corporate designs on science, and while he claims to seek a third 
way between "logicism" and relativism, the latter is his critical target. The book 
concludes with a discussion of reflexivity, where he claims to apply the same 
critique to himself that he earlier applies to the "new sociology of science." In 
fact, he does not; he presents an introspective on his biography, a reflection on 
his professional achievements in historical context. 

Before addressing the substantive aim of the book, it is necessary to say a 
word .about his terminology. When he speaks of "logicism" he is referring to 
the epistemological tradition associated with the Vienna Circle in the 1930s, 
particularly that espoused by its exiles to America, such as Rudolf Carnap and 
Hans Reichenbach, and currently known as "analytic philosophy." Clarification is 
also necessary with respect to his use of the language of "the new sociology of 
science." The notion of a new sociology of science is broad in the sense that it 
includes a wide range of work published since Kuhn's Structure. Bourdieu's target 
is just one sub-section of this work, that known as the "sociology of scientific 
knowledge" (SSK), with a few non-SSK "postmodernists" (such as Foucault) 
thrown in for good measure. 

Applying his theory of "the habitus and the field" to science, Bourdieu claims 
to "have been able to resolve the problem of the relationship between reason 
and history or of the historicity of reason, a problem as old as philosophy and one 
which, especially in the 19th century, had haunted philosophers." (54) He suggests 
that the contextual perspectives brought to the field by individual scientists (by 
virtue of their habitus), are disciplined and corrected by the structuring force of the 
field. But how can the field as a whole achieve transhistorical truth? His answer 
lies in the concept of the "autonomy of the scientific field," an autonomy that 
permits science to escape the corrupting force of outside interests. Autonomy 
is achieved when science erects a barrier between itself and the outside world. 
The process is explained by mathematization. With Newton (and Leibniz), 
mathematical mastery became the "price of entry" into the scientific field, 
drawing a sharp distinction between professionals and amateurs. Second, the 
idea of explanation is removed from the empirical/temporal world and grounded 
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in the mathematical/universal abstractions of "logicism." This leads to what he 
calls “desubstantialization,” an effect of mathematization whereby'science breaks 
free of Aristotelian substances, substituting the “logic of the manipulation of 
symbols” for the empirical observation of appearances. (49) The cumulative effect 
of mathematization is simultaneous closure and autonomization of science. It is 
this “closure upon itself of the autonomous field [that] constitutes the historical 
principle of the genesis of reason and the exercise of its normativity." (54) 

So how does this teleological vignette of the scientific revolution in physics 
relate to the science of science when applied to sociology? The answer is difficult 
to fathom, except to the extent that the authority of the field, its "censorship," 
becomes a "liberating censorship, which leads one to dream of the censorship 
of an ideally constituted field that would free each of the participants from the 
biases linked to his or her position and dispositions." (114) He parenthetically 
notes that mathematization occurred first and primarily in physics, had different 
"effects" in different contexts, and that its impact on the social sciences has 
"not [been] certain." (48) Thus from the outset he reinscribes the old (and now 
debunked) hierarchy of the sciences, placing math-physics at the pinnacle and 
social science at the bottom. This seems to run counter to his argument that 
sociology is a science like any other, and leads him to do precisely what he 
condemns Foucault and Althusser for (i.e., treating social science as a lower form 
of knowledge with a doubtful claim on the title of science). (104) It is only one of 
many contradictions and inconsistencies in this book. 

Bourdieu's argument about the censoring power of the field is not particularly 
new. Kuhn's conceptualization of a paradigm as a "disciplinary matrix" (1962/1970) 
captures the idea precisely. Indeed, the way Bourdieu promotes the importance 
of his own work makes this book read like a priority of discovery argument. He 
. claims to have discovered, and discovered more properly, what others have 
claimed for themselves. Indeed, he implies that he discovered the social character 
of science before SSK: “I wrote my first text on the scientific field, in the early 
1970s, that is to say, at a time when the ‘new sociology of science’ had not yet 
emerged.” (96) His article was published in 1975, but key works by Barry Barnes 
and Harry Collins were published before that, and Michael Mulkay's foundational 
critique of Mertonian functionalism was published in 1969. The journal of SSK, 
Science Studies (Social Studies of Science) was first published in 1971, and David 
Bloor published his book on “the strong programme” in 1976. 

Unfortunately, this is only one of many inaccuracies and misrepresentations. 
Bourdieu egregiously conflates SSK with post structuralism, thus implicating 
SSK in what he calls a "cynical" and "nihilist" view of science. Nothing could 
be further from the truth. SSK was the first attempt at a "science of science," 
and with a staunchly empirical research agenda, it revealed how all belief, 
including scientific belief, is socially constructed in particular historical and 
cultural contexts. Yet SSK said nothing about the truth or falsity of such beliefs. 
AS far as SSK is concerned, it is for those credentialed in a field of knowledge to 
decide, with respect to their own fields, what is true and what is false, what is 
good method or not. Indeed, SSKers think it rather immodest for philosophers or 
sociologists to believe that they could instruct other scientists in how to do their 
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Science better. Rather, through sociological research, SSK revealed that science 
was a tremendous cultural achievement, and an achievement to be lauded. Thus 
it is a gross misrepresentation to suggest that SSK was nihilist about scientific 
knowledge, or that it sought "destroy" science. SSKers love science, and that 
is one of the reasons they are led to study it. But they love it as it is, not as one 
might wish it to be. 

Unlike Bourdieu, SSK does not begin with a commitment to the idea of 
universal truth and then teleologically explain its genesis. Yet this does not mean 
that SSKers are epistemological and/or ontological relativists. Rather they are 
methodological relativists. Nor are SSKers the philosophical sociologists for 
which Bourdieu expresses such distain. SSK simply would not have been possible 
without a confrontation with analytic philosophy. Bourdieu expresses a similar 
distain for what he calls the philosopher historians (e.g. Foucault). He explains 
that he "firmly" held to the title “sociologist,” eschewing the "double game" and 
"double profit" of interdisciplinarity that he found "profoundly antipathetic," "not 
least because ... (it) ... seemed to me to announce a lack of ethical and scientific 
rigour." (106) It is for this reason, he claims, that he "could not enter the debates 
on science as they were conducted in the 1970s." (106) 

Bourdieu's negative perception of SSK in the early 1970s, no matter how 
wrong-headed, can perhaps be forgiven. But for him to continue to hold to it 
in 2001, and then to exhibit such vitriol about a field he failed to understand or 
contribute to, and in addition to claim that he in fact discovered all that SSK lays 
claim to before that field was even formed, is an undeniable failure of ethical 
and scientific rigor. For this reviewer, who was enlivened by Bourdieu's work 
as a student, and who has never subsequently been hostile to it, this book is a 
profound disappointment. 


Reviewer: Patrick Carroll, University of California, Davis 


95 Theses on Politics, Culture, and Method 
By Anne Norton 
Yale University Press, 2004. 160 pages. $26 (cloth) 


Norton's forcefully argued manifesto will appeal to the many students of politics 
and society who are alienated from the relentless foundationalism, essentialism 
and positivism of variable-oriented political science. Weaving together 
philosophical critique with empirically grounded argument, she fights on behalf 
of the many heterodox students and scholars who have broken with a bankrupt 
research paradigm. Her heroic mission is to bolster those research programs 
which respect social life's multivocality, complexity and contingency and do not 
treat social reality as a collection of preexisting factums awaiting discovery by 
the unbiased observer. 

Yet Norton's polemics fail short because she refuses to engage her adversaries 
on their own territory and prefers to fire slings and arrows from the safe distance 
of philosophy and metatheory. Her preference for this kind of argumentation 
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means that she often slips into the very kind of "abstract, universalistic" theorizing 
which she decries in the work of her essentialist and positivist opponents. By 
recruiting Hegel, Nietzsche, Lacan, Foucault, Wittgenstein, Derrida, etc. as allies 
in her struggle against the essentialist-positivist fallacies of the mainstream, 
Norton effectively limits her readership to those students and scholars already 
conversant with and sympathetic to the concerns of these thinkers. 

She opens the book with a group of theses (1-23) dealing with culture and social 
inquiry. Here Norton enlists arguments from anthropologists and philosophers in 
order to dispose of essentialist and positivist straw men. First on the chopping 
block is the unsupportable proposition that "Culture" and "cultures" both exist 
independently of language, time, social practice and relations of domination. 
A number of equally essentialist generalizations about identity, power and 
institutions are then dispatched. But it remains unclear who would endorse such 
propositions besides the authors of the frequently mentioned book Designing 
Social Inquiry. 

The next block of theses (24-50) concern institutions, identities, politics and 
power. Here she veers between uncontestable propositions such as "living within 
a culture entails a complex of relations to its practices and institutions" and 
overreaching propositions such as "there is no culture without resistance and 
critique." These latter propositions have no place in a book that aims to combat 
the universalist and absolutist formulations of mainstream social science. Norton's . 
methodological commentary, by far the most promising section of the book, 
begins with thesis 51: Facts do not Speak for Themselves. Here she pursues her 
constructionist agenda in a more fruitful way by unmasking the methodological 
myths of mainstream political science. Facts, she contends, actually issue 
from interpretative operations and “truth regimes” even if they are presented 
by mainstream political science as unconstructed datums. In the heart of this 
section, Thesis 64: , she questions the alleged rigor of variable-oriented research 
built around measurable properties and units of analysis. Here she rehearses 
familiar arguments about the rhetorical, performative and ritualistic dimensions 
of social science research, Norton's goals would have been better served had 
she gone after the fundamental premise of variable oriented research, namely 
that decontextualized abstractions can stand in as proxies for social action and 
interaction. 

Moreover, on closer inspection the propositions which appear the most 
subversive turn out to accord well with the prevailing methodological wisdom. 
The weaknesses of falsifiability, replicability and predictive success as criteria for 
theory testing are well-known within the community of variable-oriented social 
science practitioners. Her caveats about conflating causality and correlatedness, 
for example, would elicit no dissent from even the most blinkered partisan of 
"normal science." Moreover, for an empirically-minded reader, such weaknesses 
as those Norton diagnoses pale in comparison to the shortcomings of variable- 
oriented research analyzed by Ragin (1987) and Abbott (1997). 

The real strength and contribution of 95 Theses is apparent when one takes 
a step back from the particular arguments and considers the purpose of the 
book as a whole. Norton's multifaceted arguments and examples show how 
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mainstream political science appears to an observer standing at the margins of 
the discipline's positivist core. From her heterodox standpoint, she succeeds 
in baring the normally cloaked lines of power and authority which sustain the 
core's activities. Most importantly, she demonstrates that students of politics 
and society pay a steep price when they are unable to deal honestly and openly 
with fundamental issues like the role of power in the production of knowledge. 
Such silencing does no service to analysts interested in the multidimensionality, 
contingency and richness of social life. 


Reviewer: Jeremy Schulz, University of California, Berkeley 





Methods for Testing and Evaluating Survey Questionnaires 

Edited by Stanley Presser, Jennifer M. Rothgeb, Mick P. Couper, Judith T. 
Lessler, Elizabeth Martin, Jean Martin and Eleanor Singer 

John Wiley & Sons, 2004. 606 pages. $59.95 (cloth) 


Reliance on various methods to improve survey questions has grown steadily 
over the past two decades. Despite the increasing use of question testing and 
evaluation techniques among survey researchers, the theoretical and empirical 
literature on the topic remains relatively sparse. As the editors of Methods for 
Testing and Evaluating Survey Questionnaires note, existing texts universally cite 
the importance of conducting questionnaire pretests, but provide almost no 
guidance to researchers on how to do so. This monograph attempts to fill this 
gap in the survey methodology literature, and serves as a foundation for research 
on testing and evaluating survey questionnaires. The 24 papers included in this 
volume originated from the International Conference on Question Development, 
Evaluation, and Testing (ODET) held in November 2002. Following an introductory 
chapter by the editors, the monograph chapters cover all current areas of research 
in improving survey questions: cognitive interviewing, behavior coding, expert 
reviews, experimental designs, statistical modeling, focus groups and multi- 
method approaches. 

The next four chapters concentrate on cognitive interviewing techniques. In 
Chapter 2, Willis explores the possible theoretical underpinnings of cognitive 
interviewing, one of the few theoretical discussions in the monograph. What 
makes this chapter more remarkable is that, along with the next three chapters, 
it points out the generally-accepted usefulness of cognitive interviewing despite 
a strong theoretical basis. The last of these chapters, by DeMaio and Landreth, 
provides a useful comparison of various cognitive interviewing techniques and 
the kinds of results they produce. 

Chapters 6 through 8 provide examples of techniques to supplement 
conventional questionnaire pretesting, including analysis of behavior coding, 
response latency, and respondent debriefings. The chapter on van der Zouwen 
and Smit on behavior coding reminds researchers of the usefulness of this 
technique pioneered by Cannell and his colleagues at the University of Michigan 
more than 30 years ago. The authors show the value of analyzing behavior 
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patterns between respondents and interviewers during interviews for diagnosing 
problems with questions. 2 

The three papers comprising Part Ill address experimental approaches to 
evaluating and improving survey questions. Chapter 9 by Fowler makes the 
case for an experimental approach that has been around for some time, split 
sample designs. Through a series of experiments comparing pairs of original and 
alternative versions of questions administered to randomly split sample members, 
this research shows how such designs can indicate the magnitude of the effects 
of poorly designed questions on survey data. These examples also show how one 
evaluation technique, cognitive interviewing, is useful for identifying problems 
with questions, but the second technique, a split sample experiment, is needed 
to answer the question of how significant the problems are. 

The next section of the monograph includes three papers on an emerging area 
of research in question testing and evaluation, statistical modeling. In Chapter 
12, Biemer demonstrates how latent class analysis (LCA) can be used to identify 
survey questions that contribute significantly to measurement error in survey 
estimates. Such methods represent a relatively new approach to questionnaire 
testing and evaluation that hold the promise of predicting survey questions 
likely to produce unacceptable levels of measurement error. In this way, these 
techniques function as the statistical analog of qualitative techniques such as ` 
cognitive interviewing. 

Parts V and VI expand the usefulness of the monograph to survey practitioners 
by applying testing methods to a variety of common survey contexts. Chapters 
15 through 18 address various survey modes, including paper self-administered 
surveys, computer-assisted surveys and internet surveys. Chapters 19 through 
22 provide examples of question evolution with special populations such 
establishment surveys, child and adolescent surveys, cross-national surveys and 
surveys that require questionnaire translation. 

In contrast to nearly all of the preceding papers, the last three chapters 
illustrate multi-method approaches to questionnaire testing and evaluation. The 
final chapter by Forsyth, Rothgeb and Willis brings together several elements 
from the monograph by assessing a multiple-method, multi-stage approach to 
question assessment and improvement. This study was designed to answer two 
questions: whether pretesting results accurately predict problems in survey data 
collection and (2) whether efforts to revise questions based on pretest results 
actually improve data quality. The answer to the first question was clearly "yes;" 
the answer to the second question was "maybe not." Despite the mixed results, 
this paper points out one of the few important issues that the monograph does 
not directly address: what series of testing and evaluation methods provides 
the most useful information for a given survey context? Answering this question 
will require further research that evaluates alternative sets of procedures against 
one another in experimental conditions. lin Chapter 11. of the monograph, 
Tourangeau's discussion about the design features necessary for methodological 


Book Reviews * 589 


experiments in questionnaire testing suggests that survey methodologists are 
thinking seriously about how to move the research in a more scientific direction. 
To that end, this monograph represents a substantial step forward for research 
on improving survey questions. 


Reviewer: Douglas B. Currivan, Research Triangle Institute 
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The Chosen: The Hidden History of Admission and Exclusion at Harvard, 
Yale, and Princeton 

By Jerome Karabel 

Houghton Mifflin, 2005. 711 pages. $28 (cloth) 


In The Chosen, Jerome Karabel traces the emergence of the admissions process 
at the Big Three universities (Harvard, Yale and Princeton), and the evolving 
definitions of merit. He demonstrates that many contemporary practices, such 
as attempts at regional diversity and finding “well-rounded” students, emerged 
through the anti-Semitic and anti-intellectual stance of early 20th century college 
presidents and admissions deans. For example, facing a ‘Jewish problem” in the 
early 1920s, Harvard president Lawrence Lowell enacted limits on the size of 
incoming classes and a new emphasis on “aptitude and character” in admissions 
decisions. This standard, which became a predominant qualification for entry 
into these elite universities for several decades, was built on a white Anglo- 
Saxon protestant model of manliness. Using an ostensibly value-neutral yardstick 
allowed these universities to exclude many Jews Se well as women, blacks and 
Asians) from consideration. 

Admissions decisions are not simply the purview of admissions deans and 
college presidents. College alumni, faculty, private school administrators, students 
and the media play influential roles in the creation of admissions policies. In 
particular, college alumni and faculty place oppositional pressures on admissions 
departments, demanding consideration for less-gifted alumni children and 
more exclusive focus on academic qualifications, respectively. As 7he Chosen 
demonstrates, when admissions deans seek to balance these constituencies 
with the increasingly competitive college market, the goals of diversity and equal 
opportunity are alternately shunted and elevated depending on their potential 
to affect the status of the university. While the legitimacy of these institutions 
depends upon the appearance of diversity, their financial success often rests 
on exclusionary practices, such as legacy and early decision admissions. These 
competing forces of legitimacy and financial interests have complicated these 
universities' definitions of merit throughout the 20th century. 

The Chosen illustrates not only the past foibles of anti-Semitic and chauvinistic 
college presidents. Harvard, Princeton and Yale — along with elite colleges and 
universities throughout the country — currently espouse the ideals of regional, 
racial and talent diversity. Yet they miss incorporating class diversity in their 
student body. As the price of college attendance rises, and financial aid remains 
minimal in relation to what the average person would require to attend an elite 
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college, Karabel calls for attention to a remaining source of homogeneity in the 
student body - that of social class. While colleges and universities have become 
more visibly diverse in the past few decades, post-secondary education remains 
deeply stratified by class differences. As Karabel suggests, incorporation of 
social class background in a manner similar to current treatment of race within 
the admissions process might well be beneficial in light of the growing income 
and wealth inequality within American society. 

The Chosen is a tome befitting the shelves of anyone interested in the history 
of the elite, post-secondary education or the shifting and ever-present political 
battles over the contours of "merit." Though largely historical in nature, Karabel's 
work brings sociological analysis to bear on the subject of college admissions 
and convincingly demonstrates the weight of past policies on the present. The 
author uses historical documents, which are not often used in sociological 
research, such as internal communications, admissions handbooks and reports, 
student newspapers and yearbooks, and government reports to shed light on the 
changing nature of these three schools. In addition, he demonstrates the impact 
of historical events such as immigration in the early part of the century, World War 
Il, the space race and the race riots of the 1960s on college admissions policies, 
tying the narrow historical process of institutional policy with larger fluctuations 
in American ethos. 

The most important contribution of this book, however, is in illuminating ways 
that meritocratic standards may be constructed to systemically discriminate 
against particular social groups. This is not a new concept, but it bears repeating, 
as gatekeepers in various settings continue to use supposedly "objective" criteria 
to control access to elite spheres of power. 7he Chosen requires us to think 
critically about many of the standards that colleges and universities continue to 
employ, and whether they are truly objective or fair. 

Recent mistakes in SAT scoring have highlighted the stakes of the college 
application process for young people. The well-worn steps leading to college 
admission are taken by millions of young people every year. They take the SATs, 
gather recommendations, write heartfelt and strategic admissions essays, and 
enumerate their academic and extracurricular achievements. For some students ` 
and their parents, this process includes significant monetary investment in test 
preparation companies, tutors and highly-paid college admissions consultants. 
As college attendance increases each year nationwide, these strategies reflect 
the ratcheting up of competition for admittance to elite universities. Jerome 
Karabel's The Chosen does an excellent job of illustrating the political and social 
history behind this influential selection process. 


Reviewer: Jessica Halliday Hardie, University of North Carolina-Chapel Hill 
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Deserving and Entitled: Social Constructions and Public Policy 
Edited by Anne L. Schneider and Helen M. Ingram 
State University of New York Press, 2005. 372 pages. $89.50 (cloth); $29.95 (paper) 


In this collection of 13 essays, Anne L. Schneider and Helen M. Ingram bring 
together diverse examples of what they have, in earlier work, called “degenerative 
politics” toward “target populations.” Degenerative politics is “characterized by 
its exploitation of derogatory social constructions, manipulations of symbols or 
logic, and deceptive communication that masks the true purpose of policy." (p. 
11) Each chapter discusses a case of defining target populations, or the groups 
to which policies are explicitly directed, as deserving or undeserving of public aid, 
civil or political rights. Though this perspective is no longer as theoretically novel 
as it once was, it remains at underrepresented in studies of public policymaking. 
The historical and institutional sweep of the chapters in this book will, | hope, 
bring the analysis of social constructions in public policy closer to the core of 
policy studies. 

Two principal trends characterize this scholarship. The first sees policies 
as outcomes of political processes in which a particular group is identified as 
a problem, often by a moral entrepreneur, who then convinces opportunistic 
politicians of the benefits of scapegoating this group through legislation. 
This is the model advanced by Paul Nicholson-Crotty and Kenneth Meier's 
contribution to the volume. It appears in more or less subtle forms in 
contributions as diverse as Kay Schriner's fascinating account of the exclusion 
from suffrage of the mentally impaired, Stephanie DiAlto's systematic tracing 
of the passage of Japanese-Americans from internal threat to model minority, 
Dionne Bensonsmith's account of racial and gender constructions in welfare 
legislation, and Mara Sidney's subtle and revelatory dissection of the public 
construction of a "deserving" African-American middle class in the context of 
federal housing anti-discrimination legislation. 

The second analytical trend focuses on the implicit construction of target 
populations in policy implementation, as members of this population encounter 
the state and private policy actors in an effort to exercise their civil or political 
rights, or receive public aid. Joe Soss' comparison of recipients of Social Security 
Disability Insurance and AFDC shows that administrative practices based on 
entitlement and on suspicion, respectively, shape beneficiaries’ sense of social and 
political membership and efficacy. Michelle Camou's catalogue of stories told by 
neighborhood-based social service-providers in Baltimore shows the ways in which 
"conduct, lifestyle, and street culture" are placed "at the center of...discourse 
about revitalization." (p. 213) Camou deftly links the kinds of stories organizational 
actors tell to the kinds of organizations (e.g., religious or secular, focused on jobs, 
housing, health, education, etc.) they are. Interestingly, she finds that the religious 
organizations in her study often employed the /east moralistic and judgmental 
stories to describe their work and their target populations. Nancy Jurik and Julie 
Cowgill's chapter on micro-enterprise lending programs tells a story of complex 
pressures that lead to “creaming” and negative, individualizing constructions of 
those women who do not complete or are screened out of the programs. 


592 * Social Forces Volume 85, Number 1 * September 2006 


Two contributions are less easily classifiable. One is a chapter (previously 
published) by Sanford F. Schram on efforts by welfare advocates to fight the 
welfare reforms of the mid-1990s, which addresses the difficulty of fighting 
degenerative politics. Another is Laura Jensen's account of Revolutionary War 
pensions. By arguing that welfare was not, as commonly believed, a "black" 
program - prior to reform, more whites received welfare than blacks — Schram 
argues that advocates unwittingly reinforced the idea that were welfare a "black" 
program, it would not be worthy of saving: "The difficulty is in introducing [the 
connection between race and welfare] in a culture that is predisposed to talk 
about such issues in the worst possible ways, which serves to further reinscribe 
the prejudices and racial barriers that create the racial injustice in the first place." 
Schram continues: "[A]s the welfare population becomes increasingly nonwhite, 
the dilemma intensifies and the situation becomes even more urgent." (p. 284) 
Jenson argues that by granting pensions to some Revolutionary War veterans, 
Congress not only constructed these veterans as deserving citizens, but also 
opened up the possibility that other veterans could also claim benefits. Moreover, 
Congress changed the very ways in which petitions for public benefits would 
henceforth be made, by opening itself up to claims for group, or categorical 
deservingness, rather than case-by-case petitions for help. Like Soss and DiAlto, 
she is interested in the ways in which targets or potential targets of policy 
try to shape politics in non-degenerative ways. Like Schriner, Jenson raises 
key connections between the mechanisms of political representation and the 
representations of social groups. 

This would be an ideal book to anchor and balance general public policy 
courses at both undergraduate and graduate levels, but | cannot use it. The page 
proofs were never edited. | have never seen a book so clearly and insultingly 
. bereft of editing. Several chapters are rife with misspellings, misstatements of 
fact and awkward writing. The otherwise fine preface by Deborah Stone contains 
instructions to the non-existent copyeditor to check the page numbers of internal 
citations. What example does that set for my students? The authors and their 
readers deserve and are entitled to better. 


Reviewer: John Krinsky, City College of New York 





Identifying Talent, Institutionalizing Diversity: Race and Philanthropy in 
Post-Civil Rights America 


By Jiannbin Lee Shiao 
Duke University Press, 2005. 295 pages. $84.95 (cloth); $23.95 (paper) 


In /dentifying Talent, Institutionalizing Diversity, Jiannbin Lee Shiao, explores 
the formation of race policy in elite private institutions in the post-Civil Rights 
era. Citing Omi and Winnart (1986), he asserts that "race must be understood 
as an unstable and 'decentered' complex of social meanings constantly being 
transformed by political struggle," and that foundation diversity policy offers 
unique insight into how "private institutions have negotiated the inherent 
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complexities and contradictions of attempting to institutionalize a revolution in 
race relations." 

Shiao argues that in the 1980s foundation thinking about race policy and 
grant making underwent an evolution from a black-centered, radical perspective 
focused on "good causes" to a non-white-centered approach that emphasizes 
good strategy and "inclusive expertise." The new paradigm is to move beyond 
confrontation to achieving advancement through professional channels in 
mainstream institutions. 

What prompted the shift? Shiao's answer rests on an analysis of the leadership 
role of The Ford Foundation. Though the new wave of immigration begins with 
passage of the family reunification legislation in 1965, it is not until the Reagan 
presidency that foundations, influenced by Ford, craft a "post-black" discourse 
to limit the effects of conservative political forces. This thesis rests on an analytic 
reading of the main field publication, Foundation News & Commentary (from 
1961-1988), and a comparative examination of the development of race-based 
initiatives at the Cleveland and San Francisco foundations. 

His history concludes that Ford is politically “activist” but “elitist,” and grantor 
funds have a positive effect but ultimately serve to de-radicalize movements. 
Translating to the diversity paradigm he cites Allen (1969) who argues that white 
business support of black protest groups can be attributed to the larger economic 
system's ability to transform “expressions of conscience into instruments of 
control.” Does Shiao buy this neo-colonial view? No, his approach is more nuanced. 
He writes that Ford's ultimate motivation is to develop particular relationships. In 
the 1960s the purpose of the relationships was to develop demonstration projects 
for adoption by governmental agencies, and in the conservative 1980s, to secure 
institutional stature in the aftermath of "losing significant political capital," as 
Ford searched for new allies in its efforts to further racial equality. 

The culmination of Shiao's work is the argument that racial formation in the 
post-Civil Rights era occurs in an institutionally segmented context. Interviews 
with foundation trustees, program officers, local political informants and 
nonprofit heads in Cleveland and San Francisco give rise to a nuanced framework 
for understanding how racial identification affects professional performance and 
institutional culture. Shiao's well taken finding is that diversity policy provides a 
limited account of how philanthropic actors view foundation participation in race 
relations. Race has secondary saliency, dependent on the individual's position 
(trustee or staff) in the institution. 

The heart of his argument is that race relations are not limited to issues 
between individuals and the state, but also occur and are shaped in various 
institutional domains. Institutional segmentation offers a theory of how a 
legislative agenda - arising out of a set of historical circumstance and affected 
by changing political, economic and demographic developments - is shaped by 
private elite organizations. The focus of this study is foundations. However, the 
analysis lends itself to other hierarchical institutions, and the use of the theory to 
understand the business sector's diversity practices is not lost on the author. 

His conclusion is simultaneously heartening and foreboding. “What 
distinguishes the post-Civil Rights era may not be a new monolithic hierarchy 
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but a crazy quilt of institutionally differentiated racial policies," and "we can no 
longer speak of race relations in singular terms; instead we must türn to studying 
its relative coherence across [and within] institutional settings." This well may be 
the case, but remember that the impetus behind the Civil Rights movement was 
to impose a universal legislative solution to what was an oppressive blanket of 
private, personalistic and coporativistic attitudes that defined race relations. 

The volume is a welcome contribution to the field. It incorporates much of the 
relevant literature, and is rightly positioned among the small number of foundation 
histories that embed foundation activity in the larger historical, demographic and 
political contexts. There are constraints. Shiao bases a great deal on his reading 
of Foundation News & Commentary, which may be better suited to reflect various 
intellectual currents than offer a key to foundation policy. Though Ford is viewed 
as the dominant institutional player, its archive is not consulted. And finally, aside 
from a selected racial and gender counting of foundation personnel, the study 
lacks an empirical basis. Interviews offer perspectives, but can not evaluate the 
effectiveness of foundation policies and projects. We are left with the same 
nagging imprecision on assessing the impact of foundations on the communities 
in which they operate. Alas, this is less a critique of this worthwhile volume, than 
of the field itself. 


Reviewer: Eugene D. Miller, City University of New York 





The Art of Surrender 
By Robin Wagner-Pacifici 
University of Chicago Press. 210 pages. $49 (cloth); $19 (paper) 


The Art of Surrender explores the cultural process through which the end of 
conflict is accomplished when one side capitulates to another after a period of 
armed struggle. Using textual materials and visual representations of surrender, 
Robin Wagner-Pacifici demonstrates the centrality of surprisingly complex 
Systems of meaning. Through in-depth analysis of the handing over of the 
city of Breda during the Thirty Years War (1625), the end of the American Civil 
War at Appomattox (1865) and the Japanese submission in Tokyo Bay (1945) 
she shows that the surrender is about far more than just "giving up" or even 
saying “we give up” in a recognizable and diplomatically acceptable way. This 
is an event in which signals are sent about identity and dignity, sovereignty and 
autonomy, beginnings and endings, forgiveness and endurance, witnessing 
and suffering. Such communicative gestures can be intended or unintended, 
contemporaneous with the military defeat or a subsequent reflection upon it. 
Indeed Wagner-Pacifici's book weights the second term in each of these binaries. 
Those meanings that are accidental, unplanned, spontaneous or "given off" in 
Goffman's sense seem to be the most telling for sociological analysis. So are 
later representations in art, literature and the collective memory. These speak to 
the surrender as a moment of problematic cultural reordering rather than simply 
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military conquest and defeat. As such it is an episode requiring deft performative 
skills, one often marked by coded ambivalences and complicities that can be 
traced through the careful reading of political documents, the works of artists 
and photographers, autobiographies and letters. 

Most obviously there is a thematic fit with Wagner-Pacifici's long running 
interest in violent confrontations and especially the ambivalent and polyvalent 
ways that these run their course and leave a stain on the collective imagination. 
Whether she is looking at the kidnapping of Aldo Moro, the Vietnam Veterans 
Memorial or the MOVE siege in Philadelphia, the story has always been one of 
traumatic events that refuse any clean cultural resolution or interpretation. In her 
analyses these episodes are shown to be haunted by the ghosts of unfulfilled 
possibility, incompleteness and existential anxiety. The closure never comes. It 
should also be noted that 7he Art of Surrender fits into a more general theoretical 
movement in Wagner-Pacifici's work away from a structuralist and institutional 
analysis of culture, and towards hermeneutics and humanism. Hence we 
encounter an exquisite and refined sensibility at work in this volume, one that 
seems at times almost melancholic in orientation. This is most evident when the 
author picks up on the little things that would simply pass by most observers, 
seen but not noticed. The enigmatic folded paper blown in the wind at the bottom 
of a Velazquez; the curious repetitions in Al Gore's concession speech to George 
W. Bush after the 2000 election; the distracted meanderings of General Grant's 
conversation with General Lee as they meet face to face to agree the terms of 
surrender; the peculiarly casual demeanor of the sailors crowding the decks of 
the USS Missouri at a turning point in history. These appear in 7he Art of Surrender 
more as accidental and poignant tokens of the human condition, less as dedicated 
signifiers of anything in particular. They are markers of our irreversible movement 
through time. Finally we should note that this book is about a particular kind of 
situation or encounter, and in this sense it continues the tradition established 
in the author's earlier study of "the standoff." This is an important move which 
enables Wagner-Pacifici to shift cultural sociology towards the territory of symbolic 
interactionism without reverting to methodological individualism or defining "a 
situation" restrictively in terms of the face-to-face encounter. This is a relational 
sociology not an essentialist one, and moreover it shows that the deep structure 
of the situation is one underpinned by a symbolic order, even if this is fragile and 
provisional. Situations are like texts. They might be populated by active bodies 
and agile minds but she insists they must be primarily understood in terms of the 
overarching cultural scripts that inform action and identity. 

The Art of Surrender is an often brilliant book from one of the foremost 
interpreters in American social science. Subtle, insightful, erudite, astute, 
inquisitive, profound: these are the appropriate descriptors that come to mind. 
This is a work from the tradition of Simmel's essay on flirting or Geertz's on the 
cockfight which propel the argument that social life is never quite what it seems, 
that things are always just a little more complex, ambivalent or emergent than we 
had previously imagined. The cost for such sensitivity has long been that readers 
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have to work just a little bit harder to make connections with more general or 
generalizable social and cultural theory. In this case the price is worth paying. 


Reviewer: Philip Smith, Yale University 





In the House of the Hangman: The Agonies of German Defeat, 1943-1949 
By Jeffrey K. Olick 
University of Chicago Press, 2005. 380 pages. $29 (cloth) 


How does a nation make sense of war crimes committed in its name? In what ways 
do the victors' assessment of guilt shape policies for post-war reconstruction? 
Jeffrey Olick addresses these questions through an intriguing historical analysis 
of post-war Germany from 1943 to1949. In this study of collective memory and 
political culture, Olick acknowledges parallels that have been drawn between 
German reconstruction debates and recent debates concerning “state-building” 
in post-war Iraq. Although these two cases exhibit many differences, the value 
of Olick's work lies in his concern with issues of collective responsibility and 
transitional justice, which speak to the human condition today as loudly as they 
did 60 years ago. 

Drawing on both primary and secondary sources, some written in German, 
Olick details the debates that constituted the "official memory" of American and 
allied military and political officials and the German public about the "Nazi past 
and German future." A dialogic sociology of memory guides the reader through 
this contested terrain as changes in collective memory are identified over time in 
dialogues with earlier images of the past. Chapter 1 poses the central question 
running throughout the book: Was the "stain of guilt" for the atrocities of World 
War Il on the German people and culture, or limited to the Nazi Party? 

The first half of the book, Chapters 2 through 6, consists of debates about 
allied occupation policies during and after the war, which Olick argues created a 
"powerful reference point" that shaped much subsequent German reaction. The 
reader reviews competing narratives of "unconditional surrender" and polemics 
. regarding whether Germany was "liberated" or "defeated," as well as the myths 
that emerged to explain occupation policies, especially the vilification of the 
Jewish-American Secretary Treasurer Morgenthau. Olickalso examines competing 
proposals of American state actors for a "soft peace" or a "hard peace," with 
President Roosevelt supporting the latter due to his refusal to separate regime 
from vo/k and his disdain for "Prussian militarism." Olick looks at theories built 
upon the presumptions of collective guilt and social pathology along with related 
policies for the re-education and de-Nazification of the German people. Allied 
officials proposed reorganizing the universities and changing school curriculum 
as solutions, and the Nuremberg trials were designed to educate Germany in 
moral lessons of responsibility. An examination of how allied occupiers were to 


deal with lower-level Nazis, mediated by questions of blame and responsibility, 
concludes the first section. | 
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The second half of the book, Chapters 7 through 13, considers the polemics 
that constituted the historical memory of the German people, examining carefully 
the construction of German history and national identity by historians, social 
scientists and philosophers, including Thomas Mann, Carl Jung and Theodore 
Adorno. The Protestant and Catholic churches' relationship to the Nazis, and 
their views on responsibility for war atrocities are reviewed as are discourses 
produced by post-war political parties, foremost the re-emergent SPD and the 
newly organized Christian Democratic Union. 

Olick employ's Ruth Benedict's framework of guilt and shame to analyze 
the German public's discourses and silences in response to war atrocities. 
Guilt requires that individuals' account to themselves for their actions and to 
transform. The writings of Carl Jaspers among others were representative of the 
"public discourse" of guilt. Shame sees confession as the betrayal of self; Martin 
Heidegger's philosophy was representative of this cultural trend and the "private 
silences" among post-war Germans. Olick concludes by pointing out that it is not 
his intention to answer questions about collective guilt but to ask them, although 
he does identify consistencies in answers presented by the German writers. One 
consistency in public discourse was the projection of Germans into the status 
of the Jews, as the occupiers had turned them into a "a pariah people." Other 
continuities were a commitment to German national identity and a preoccupation 
with refuting accusations of collective guilt. 

This book illustrates the insights gained through a sociology of memory that 
recognizes "reconstruction," the mixing of the old and new in a process of critical 
appropriation by social actors. The book concludes with a discussion of the value 
of dialogical methods and what empirically constitutes historical memory, which 
would have been more helpful in the introduction. That said, Olick's work presents 
a complex and engaging integration of moral questions with political history 
and current events; it will be of value to students of German history, historical 
memory, historical methods, ethics and philosophy. As Americans grapple with 
causes and blame for the death and destruction of the Iraq war, this book alerts 
us to pathways to the acceptance and denial of guilt for war atrocities, as well as 
to our potential for social transformation. 


Reviewer: Victoria Johnson, University of Missouri-Columbia 





The Demography of South Africa 
Edited by Tukufu Zuberi, Amson Sibanda and Eric Udjo 
M.E. Sharpe, 2005. 310 pages. $121.95 (cloth) 


The Demography of South Afríca is a valuable reference book because it provides 
insights regarding quality of demographic data from censuses and vital registration 
at various points in South Africa's history. The stated purpose for the volume is 
"to outline the demographic contours of South Africa," but as part of providing 
this general demography of South Africa, many of the authors delve into issues 
of data quality, representativeness and comparability. Given how the political 
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history of the country shaped the collection of demographic data, addressing 
such questions is of course necessary. For example, the first chapter by Khalfani 
et al. gives a history of racial classification across South African censuses. It has 
a detailed section on the last census to take place under apartheid (1991), and 
describes the failure of that census to successfully enumerate the population. ! 
can easily imagine anyone seeking to analyze change over time needing to refer 
back to the detailed information on sources of undercount. 

This level of technical detail is maintained throughout the book. Perhaps this 
is not surprising since the book resulted from collaboration between the African 
Census Analysis Project and Statistics South Africa: both groups have a vested 
interest in making the available data as useful to researchers as possible. As 
such, strengths and limitations of the data are consistently documented, even 
in chapters that have more substantive aims. To again give an example, Udjo's 
fertility chapter seeks not only to document fertility levels, differentials and 
trends, but also to assess whether apartheid era projections of the future size of 
the African population were inflated by overestimation of fertility. 

My description of this volume as a valuable reference work is not entirely 
kind because | found some of the analyses fairly boring to read. Bah's technical 
appraisal of official South African life tables and how their methodologies 
changed over time in keeping with advances in demography again gives 
information vital for further time trend analysis, but holds no sociological interest. 
As a further example, Udjo documents "peculiar" patterns in mortality that are 
left unexplained. These data anomalies are good to know about when using the 
data in future work, but alone they just seem peculiar. Bah also has a chapter that 
estimates the size of the effect of HIV on mortality three different ways, only to 
conclude that although current data do not permit accurate assessment of cause 
of death, HIV/AIDS is a major health problem. 

The final two chapters of the book showed a very high quality of writing; | 
also found them straightforward and simply more interesting than most of the 
rest. Sibanda and Lehloenya provided a descriptive analysis of gender, race and 
regional gaps in education that seemed to me exactly what | should expect from 
a book purporting to provide a general demography. They showed little gender 
inequality in education, but much according to race and region of residence. 
Zuberi and Sibanda provided an analysis of migrant status and employment 
probabilities with special attention to whether migration had occurred after the 
collapse of apartheid. They compared internal migrants, those from within the 
South African Development Community, and other international migrants. In so 
doing, they documented labor force returns to migration within the context of 
political history. 

Two other interesting chapters on household structure - Noumbissi et al. and 
Zulu and Sibanda - seemed a little detailed for a general demography. In both of 
these chapters as well as elsewhere in the book, it seemed like part of the point 
was to do as much as possible with census data. On the one hand, demonstrating 
that census data can address questions of sociological import has the potential 
to stimulate further research. On the other hand, the rigor the authors' brought 
to the analysis also highlighted much that cannot be addressed with census 


Book Reviews ° 599 


data, and therefore might have unintentionally cast doubt on this as a valuable 
data source. For example, Noumbissi et al. show differences between surviving 
children reported by mothers and surviving mothers reported by children that are 
not consistent across racial groups; they discuss why the gap is smallest for the 
white population, but these explanations cast doubt on the utility of census data 
for further analysis of child fosterage. 

While a number of the conclusions presented were necessarily tentative 
because of data issues, the book nonetheless valuably provides plausible ranges 
of demographic levels and racial differentials. Sibanda and Zuberi find racial 
differences in age at first birth that persist even with socioeconomic and regional 
controls. Likewise, Udjo's mortality trend analysis suggests that factors besides 
HIV prevalence are necessary to explain the increases in infant mortality in recent 
years. These are the kinds of findings from a general demography that could 
spark additional fruitful work, especially because South Africa is a fascinating 
country for demographic inquiry. 


Reviewer: Laurie DeRose, University of Maryland 





America and the Challenges of Religious Diversity 
By Robert Wuthnow 
Princeton University Press, 2005. 391 pages. $29.95 (cloth) 


| write this review as the political debate on immigration is in full fury (April 2006) 
— a good time to consider religious diversity, which is often the subtext of the 
rapidly burgeoning literature on immigrant religion in the United States. And 
yet, one of the things that makes this book so appealing, at least to me, is that 
Robert Wuthnow does not confine himself to a simple equation of immigrants 
— diversity. While immigrants bring new religions to this country now as in the 
past, non-Christian immigration is not the sole source of contemporary religious 
diversity. The individuation and privatization of religion — also an historical reality 
— has increased in the United States since the social and cultural changes of 
the 1960s. Thus, religious experimentation, or "spiritual shopping," may involve 
engagement with Buddhism, Islam or Sikhs, but it need not. Plenty of Americans 
are constructing their own bricolage of faith from Judeo-Christian sources, and 
this diversity, Wuthnow argues, has significant cultural implications as well. 

Wuthnow studies the sources and consequences of religious diversity in the 
contemporary United States. He examines where diversity comes from, who is 
attracted to it (or not), how it affects American culture — from national identity 
formulations of a "Christian nation" to the ability of people to manage religiously- 
mixed marriages, and how we might best go from the demographic facts of 
"diversity" to the cultural valuing of "pluralism." 

In what | found to be a bit of a strange decision, the book is organized so that 
the opening chapters focus on the consequences and significance of religious 
diversity, then come chapters on Americans’ relative openness to diversity, then 
a chapter on the social sources of exposure to and acceptance of diversity, then 


600 * Social Forces Volume 85, Number 1 * September 2006 


chapters on two key sites for managing diversity, then a conclusion on producing 
a "reflective pluralism." While Wuthnow writes in an accessible'style, and the 
consignment of many scholarly details to endnotes shows that the press hopes 
fora certain amount of trade sales, | found that the organization kept the argument 
from flowing as smoothly as it might have. 

Also, while Wuthnow uses a new data set (Religion and Diversity Survey) to 
explore the causes and consequences of religious diversity, the presentation of 
the data was not as systematic as | would have liked. In much of the first half of 
the book we get data in the form of interview quotes or respondents' personal 
stories and anecdotes; there are occasional sections from a (presumably 
pseudonymous) respondent that narrates his or her religious biography. Not until 
Chapter 7 is there a systematic presentation of the data in a way that provides 
the reader with more generalizable information on relative exposure to diversity, 
assessments of it, religious participation and beliefs, etc. 

Nonetheless, the argument is solid, if at times unsurprising. Religious diversity 
matters, claims Wuthnow, because we have a national mythos as a "Christian 
nation." At the same time we are proud of our public civic language of tolerance 
and respect for differences. Historically, there has been in the United States a 
"theology of exclusivism" vs. a "civic code of pluralism." (10) On top of this, 
"religion is something people care enough about to fight over." (302) To the extent 
that we take any two of these elements seriously, the third one is problematic. 
Too often, balancing this has often resulted in a either or both a thin commitment 
to pluralism (usually an avoidance of truly confronting differences) or to religious 
commitment (often a non-reflective "all religions are basically the same"). The 
modern liberal solution of keeping the public open and officially pluralist, while 
religious commitment remains private, is foundering on the increasing range of 
religious diversity, and the fact that many forms of religious identity must be 
public to be meaningful. 

Wuthnow offers a typology of Americans' response to religious diversity. 
There are "spiritual shoppers," who experiment, explore and individualize their 
religion. Not surprisingly, they are comfortable with and often intrigued by 
diversity. "Christian inclusivists" consider themselves decidedly Christian, but 
welcome other faiths and innovative versions of Christianity, even if they do not 
engage them very seriously. Finally, "Christian exclusivists" are convinced of 
the superiority and even the unique truth of Christianity, and often worry about 
increasing diversity. Wuthnow notes that seldom do exclusivists actually express 
hostility to religious others or work specifically to convert them; for one thing, 
they are often too enmeshed in their own homogeneous religious environments 
to encounter much diversity. Nonetheless, they form a substantial population 
that will resist too much true pluralism. 

Exploring the social correlates of these types shows that age, education, and 
region and place of residence have fairly predictable effects on exposure to, 
interest in, and acceptance of, religious diversity. The finding that continued to 
strike me is the religious homogeneity of the social worlds of those most worried 
about and resistant to pluralism. 
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Two chapters examine institutional management of diversity. One is on 
Christian congregations, although most "management" involves implicit 
strategies of avoiding the issue. This may de-escalate exclusivist rhetoric and 
promote a certain tolerance, but Wuthnow notes it has done little to challenge 
"Christian nation" assumptions. The other chapter examines religiously mixed 
marriages; here Wuthnow finds the discourses of interfaith marriage embedded 
in the civic cultural languages of tolerance, diversity, cultural sophistication, and 
the privatization and individuation of religious identity and conviction. 

At a time when public religious life and national identity are so debated and 
contested, this book is welcome. It is reasoned, carefully documented, and both 
sympathetic and critical. | hope it finds an audience beyond the academy. 


Reviewer: Rhys H. Williams, University of Cincinnati 





One Nation Uninsured: Why the U.S. Has No National Health Insurance 
By Jill Quadagno 
Oxford University Press, 2005. 274 pages. $28 (cloth); $15.95 (paper) 


In One Nation Uninsured, Jill Quadagno provides a sweeping and detailed 
account of the U.S. health insurance system's development during the 20th 
century. Drawing on extensive research of primary sources (including archived 
documents, written oral histories and interviews), Quadagno challenges the four 
political theories of the welfare state that are commonly used to explain the lack 
of national health insurance in the United States and offers compelling evidence 
for an alternative explanation. 

The first explanation for why the United States has no national health insurance is 
the existence of an antistatist political culture, which tends to lead towards private 
solutions to public problems. Quadagno argues that although antistatist values 
have emerged in debates about various national health insurance proposals, it is 
not clear how these values are translated into policy or why legislation has been 
enacted that contradicts these values. The second explanation is that the trade 
unions in United States are relatively weak, particularly compared labor-based 
political parties in other countries. Yet, as Quadagno notes, in several cases labor 
unions have mobilized to shape welfare policy. 

The third explanation focuses on racial politics. Quadagno provides evidence 
that during the first half of the 20th century southern Democrats colluded with 
conservative Republicans to undermine national health insurance proposals. 
However, she also demonstrates that the ability of southern leaders to shape 
national welfare policy was weakened during the 1960s. The fourth explanation 
attributes the lack of national health insurance in the United States to the 
decentralized structure of the American state. Such a system reifies existing policy 
and inhibits innovation by creating special interests that pressure politicians at 
various levels of the government to raise opposition to proposed changes. But 
Quadagno points out that this perspective does not explain how the interests of 
various groups are translated into particular political actions. 


602 * Social Forces Volume 85, Number 1 * September 2006 


Quadagno readily acknowledges that these four explanations can be used to 
understand why specific national health care insurance proposals were defeated. 
However, they do not provide a comprehensive understanding of national health 
care insurance debates over the course of the twentieth century. To do that, 
one has to examine the "persistent threads in the campaign against universal 
health care." (15) Quadagno reveals those "threads" in chapters 1 through 7 by 
unraveling the historical fabric that surrounds the ongoing struggle to establish 
national health care insurance. 

Chapter 1 provides a detailed account of how during the first half of the 20th 
century national health insurance was portrayed by key interest groups as a 
socialist-inspired plan to undermine American liberties. Chapter 2 documents how 
the activities of organized labor from the end of World War Il until the middle of 
the 1960s fostered the development of the private health insurance system and 
promoted disability insurance and Medicare. Chapter 3 details how the sovereignty 
of private hospitals was undermined during the 1950s and 1960s by court rulings 
and federal legislation. Chapter 4 focuses on Medicare implementation, which 
made health care profitable and led to skyrocketing health care costs. Chapter 5 
demonstrates how during the 1970s through the 1990s stakeholders were able 
to fend off national health insurance proposals and federal government attempts 
to contain costs. In the meantime, according to chapters 6 and 7, corporations 
used their purchasing power to promote managed care and the position of private 
insurers as the primary provider of health care insurance was solidified. 

Throughout these chapters, Quadagno builds a case for the stakeholder 
mobilization explanation of why the United States has no national health 
insurance. She demonstrates how proposals for universal health care coverage 
were effectively resisted by stakeholders including physicians, businesses and 
insurers. The power of particular stakeholder groups waxed and waned over 
time, but all of them were associated with organizations that had a federated 
structure, which is characterized by national leadership, state level offices and a 
local network, that enabled the development of grassroots campaigns. 

Therefore, any successful reform effort has to involve a coalition of national 
health insurance stakeholders that includes organizations with a federated 
structure that can facilitate effective lobbying efforts. Even then the proposals 
for change that are likely to be the most successful are those that do not 
pose a major threat to private health care stakeholders. Given this, Quadagno 
advocates relatively modest but realistic changes such as Medicaid expansion 
and modernization, a federal voucher to purchase health care, allowing Federal 
Employees Health Benefits Program buy-in, and establishing a stop-loss plan 
in which the federal government would reimburse health plans for the cost of 
catastrophic cases. 

One Nation Uninsured is a clearly written, informative book that would be 
of interest to academics studying health care, politics, policy or U.S. history, 
and members of the lay public who want a better understanding of why the 
United States does not have universal health insurance. It would also be 


accessible to students, particularly those in advanced undergraduate courses 
or graduate seminars. 


Reviewer: Janet Wilmoth, Syracuse University 
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Sourcebook of Labor Markets: Evolving Structures and Processes 
Edited by Evar Berg and Arne L. Kalleberg 
Kluwer, 2001. $155 (cloth) 


Voices from the sociological tradition are easily lost in wider discussions of labor 
markets and other economic fields of study. One of the most striking features 
of this book, therefore, is its concentration in one place of work on labor market 
questions by sociologists, rather than economists (a few welcome exceptions 
notwithstanding). And concentrated it is: the book is more than 700 pages, with 
27 chapters written under names familiar to most sociologists in this area. 

The breadth of the book is impressive, covering topics as diverse as economic 
inequality (Nielsen and Alderson), the future of the labor movement (Cornfield and 
Fletcher), the decline of internal labor markets (Cappelli), sex segregation (Jacobs) 
and the gender pay gap (England et al.), race segregation (Kaufman), social networks 
(Marsden and Gorman) and immigration (Bean et al.). On the whole, the chapters 
attempt to summarize theoretical perspectives and present empirical trends and to 
describe the state of the methodological art in the areas they cover; some present 
original analyses of their own. Most of these chapters are not ground-breaking but 
rather ground-covering. The result is a book that will be useful in many ways to 
many people, but is not likely to be read beginning to end. 

The editors, acknowledging their "commitments to our contributors' rights" 
report refraining from aggressively shaping the pieces they assembled. They 
save their own views for the introductory pieces and an afterword, in which they 
argue that "the transition to price competition has been the most fateful factor 
for labor markets of all those changes noted [in the introduction], and those noted 
by our contributors, whether or not they have acknowledged the fact." (10) After 
a postWorld War Il era dominated by oligopolistic industrial giants that were 
free to strike friendly deals with labor even as huge profits rolled in, the post- 
OPEC period has witnessed the violent return of large-scale competition and 
subsequent decline of manufacturing, with devastating effects on those least 
able to protect themselves from “instability,” “flexibility,” “accountability” and 
other euphemisms for a winner-take-all economic order. 

However, far from being the inevitable result of anonymous “market forces” (a 
term toward which the editors show no deference), the new order results from 
the active decisions taken by a generation of managers whose response to the 
new demands reflected not only their business myopia but also the reality of what 
they could get away with. The labor movement was weak for the same reasons 
the industrial CEOs had become lazy bureaucrats, and when the CEOs directed 
their “adjustments” downward toward their employees, they met little successful 
resistance. Rather, they found among workers an “apparently amiable equanimity 
... very likely reflecting consumption opportunities at retailers’ import counters” 
— as well as stock-owning rates more than 50 percent. In short, the victims of 
corporate restructuring had "little to gain and much to lose" by participating 
in organized opposition, leaving open an obvious path of least resistance for 
corporate actors (22-3). 
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The return of price competition may be the "most fateful factor" affecting 
labor markets and inequality, but it is not clear how this carr explain other 
important developments, such as the decline of occupational sex segregation 
and the gender pay gap over the same period (the 1970s and 1980s) - or why 
that gender progress slowed in the 1980s and stalled in the 1990s, as reported in 
the chapters by Jacobs and England et al. An explanation for the improvements 
might link the trend toward competitive markets with the growth of the service 
sector, which positively affects women's relative labor market position. Less 
clear is why this progress would stall in the 1990s, or why England et al. find 
the penalty for working in a female-dominated job (or the reward for working in 
a male-dominated job) seems to have increased over the 1980s. The "gender 
revolution" thus joined the labor movement in the doldrums, and will either "stall 
out" or "be reenergized by new initiatives and new mobilization." (561) 

Perhaps it is significant that political factors such as fortunes of the labor and 
women's movements have coincided with the development of new "flexible" 
workplaces in which employers produce customized goods on a mass scale, 
with ever-changing employee pools and production locations. As Wallace and 
Brady argue in their chapter, this leads to work that is severely standardized from 
the point of view of the worker, but also feeds the increasingly common practices 
of outsourcing, subcontracting and temporary work - innovations reminiscent 
of the “putting out” systems of the 19th century. As Marsden and Gorman 
point out in their comprehensive review of the role of social networks, another 
seemingly anachronistic practice — the reliance on informal networks to match 
work and workers — also flourishes in this arena, with uncertain consequences for 
processes generating inequality. 

These are just a few of the moments in this volume that struck me. Other 
readers will hone in on other aspects of the book. However, | believe the common 
perception will be that this collection unites the benefits of clear overviews with 
the insights into directions for future research that we would expect from such a 
constellation of sociological perspectives. 


Reviewer: Philip N. Cohen, University of North Carolina at Chapel Hill 
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Affect, Meaning and Quality of Life 
Michael Hughes, Virginia Polytechnic Institute and State University 


Research on quality of life in sociology is largely focused on a narrow 
range of dimensions including affect, happiness and satisfaction. It 
largely avoids a concern with the meanings that provide people with the 
Purpose, significance, validity and coherence that are a basis of social 
relationships and social integration. Evidence is presented that people 
experience lowered affect in some challenging and stressful situations 
that simultaneously generate meanings that enhance the quality of 
their lives, and that experiences of high affect along with impoverished 
meanings produce low quality of life. This strongly suggests that 
meaning may be a more fundamental dimension of life quality than 
is affect. Meanings may also enhance the quality of life by motivating 
people's involvement in activities that promote social integration and 
the quality of social relationships. Sociologists should work to further 
develop the theoretical and empirical basis of quality of life so that it 
more fully includes dimensions that reflect the meanings that provide 
purpose and significance in people's lives. 


Introduction 


Social research on quality of life over the past half-century has been dominated 
by a focus on subjective evaluations and emotional states that people experience 
in their lives. With some notable exceptions, researchers have ignored other 
meanings and purposes that animate the lives that people lead. My argument 
in this paper is that we need to reformulate our empirical understanding of 
quality of life to move beyond affect, emotions and subjective evaluations, the 
positive-negative dimension of subjective feeling, and include the meanings 
and purposes that people use to generate significance, validity and coherence 
in their lives. 

Many of the tools for doing this are lying about on the ground, waiting for 
someone to pick them up and consider their potential seriously. The reason 
few researchers attempt to do this is that doing so is difficult, and there is 
no obvious pay-off in terms of research efficiency, social policy or research 
funding. It is, however, a theoretically important and socially meaningful task. 


The author thanks Walter R. Gove and John Ryan for comments on an earlier draft. This 
paper was presented as the Presidential Address at the Annual Meeting of the Southern 
Sociological Society on April 14, 2005. Direct correspondence to Michael Hughes, 
Department of Sociology, Mail Code 0137, Virginia Polytechnic Institute and State 
University, Blacksburg, VA 24060. E-mail: mdh@vt.edu. 
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The Human Condition 


` 


Knowledge and truth are not the only goals of science. As made clear by the 
motto inscribed on the dome of the great hall in the National Academy of 
Sciences in Washington, D.C, the improvement of the human condition is also a 
primary concern: 


To science, pilot of industry, conqueror of disease, multiplier 
of the harvest, explorer of the universe, revealer of nature's 
laws, eternal guide to truth. (Quoted in Fuhrman 1999:92) 


This concern has also been a primary focus in sociology from the beginning, 
and was an obvious goal or assumption underlying the work of virtually all early 
sociologists (Martindale 1960; Ritzer and Goodman 2004). In addition, | think 
it is safe to say that sociology would not exist if it were not for the promise of 
sociology to improve the human condition. 

We see this concern in a number of key features of sociology: in research on 
material well-being, inequality, social order, social integration, community, kinship 
and family. It is also reflected in the recent movement toward public sociology 
(Burowoy 2005), but perhaps the most highly developed area in the field that 
reflects this concern is the body of work that has evolved over the past 50 years 
regarding social factors in health and psychological well-being. 


Quality of Life and the Social Indicators Movement 


The contemporary umbrella under which much of this research and theorizing is 
done is "Quality of Life." While some quality of life studies stress material quality 
of life, a large body of research is about subjective quality of life. This pursuit 
grew out of the social indicators movement that can be traced to early work by 
William F Ogburn and Howard W. Odom (Ferris 1988; 2000; Land 2000; Sirgy et 
al [in press]). This movement, which reached its height in the 1960s and ‘70s, 
was organized around the task of developing social indicators that could be used 
to monitor social well-being. In the late 1960s and early 1970s, a large number of 
social scientists worked on the problem of developing a theoretical and empirical 
basis for monitoring the social well-being of our society. (See, for example, the 
collection edited by Land and Spilerman [1975].) 

A key product of this movement was a study on psychological quality of life 
funded by the Russell Sage Foundation and directed by Angus Campbell and 
Phillip Converse published in 1976 as The Quality of American Life: Perceptions, 
Evaluations, and Satisfactions (Campbell, Converse and Rogers 1976). At about the 
same time, the National Science Foundation funded Frank Andrews and Stephen 
Withey to develop "social indicators of well-being" to tap life quality. Although 
the social indicators movement was based in thoughtful considerations of a 
number of dimensions of human existence, including material and value linked 
dimensions (Land and Spilerman 1975), the most durable surviving dimension 
from this period is subjective quality of life, and it is this dimension that has 
become a primary indicator through which the quality of life is assessed. 
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The 1970s' subjective quality of life movement had its origins not only in the 
social indicators movement, but in theoretical and empirical work by Norman 
Bradburn on happiness and psychological well-being in the 1960s (Bradburn and 
Caplovitz 1965; Bradburn 1969) and by Gerald Gurin and colleagues on mental 
health in the 1950s (Gurin, Veroff and Feld 1960). A main origin of this line of work, 
however, was in the definition of mental disorder that emerged in psychiatry after 
World War Il. 

Army psychiatrists, influenced by their experience of treating soldiers 
traumatized by combat, developed a nomenclature (which became the DSM- and 
then the DSM-II) that conceived of mental disorders as reactions to environmental 
stresses. Anyone could become symptomatic if exposed to enough stress, and 
people's reactions would be manifested along a continuum of severity, from 
mild to extremely severe (Wilson 1993). This conceptualization has now been 
replaced by a DSM (Ill and IV) that conceives of mental disorders as collections 
of symptoms that are manifest as a series of dichotomies — either ill or not ill 
(American Psychiatric Association 1987; 1994). 

But the earlier conceptualization encouraged social scientists to think about a 
continuous distribution of psychological ill-health in the population, and to think 
that this variation could be caused by variability in social conditions in family, 
neighborhood, community and economy. Of course the tendency to think this 
way was also promoted by the fact that in the 1950s and 1960s, NIMH was eager 
to fund mental health research on these issues (Wilson 1993). 

If symptoms of mental disorder were reactions to noxious external stimuli, 
ranging from traumatic experiences to community disorganization and family 
troubles, then it is easy to think about unhappiness and dissatisfaction in the 
same way. And so an industry was born to use government research funds to 
try to understand what was responsible for variability in this dimension in the 
population. Practitioners emphasized that they were not studying mental disorder 
or illness and that they focused on subjective well-being because, as Mirowsky 
and Ross (2003) argue, it is a universal and common yardstick for measuring 
quality of life across the wide spectrum from negative to positive. 


The Single Continuum of Affect 


Most contemporary researchers who study psychological distress reject 
the notion of a dichotomous understanding of disorder, and they emphasize 
the importance of examining a range of well-being, from high to low, that 
characterizes the experience of large numbers of people who cannot be 
understood as having mental disorders. Mirowsky and Ross (2003), for 
example, in their book, Social Causes of Psychological Distress, and in a series 
of articles, argue that distress is one end of a continuum that has happiness 
and the enjoyment of life at the other end. 


Well-being and distress are opposite poles on a single 
continuum: more well-being means less distress and more 
distress means less well-being... As well-being goes up, 
distress goes down. (P 26; my emphasis) 
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There are other views. Some see positive mental health as being distinct in its 
own right. Some of these see positive and negative mental health às distinct and 
unrelated and examine the balance between the two dimensions (e.g., Bradburn 
1969), and some find them to be distinct and negatively related (e.g., Keyes, 
Schmotkin and Ryff 2002). 

However, these approaches focus on a dimension that has a singular 
conceptualization. They are all devoted to understanding the range of affect 
from positive to negative. A huge number of measures are now used in research 
on affect, and thus quality of life. These include a variety of distress indices, 
positive and negative affect indices, measures of depression (such as the Center 
for Epidemiological Studies depression inventory [CES-D]), and measures of 
happiness (from happy to unhappy). Distinct but related dimensions include 
satisfaction (from satisfied to unsatisfied) and overall evaluations of one's life 
(from delighted to terrible). There is also a family of indicators of other dimensions, 
sometimes referred to as psychological resources, including measures of self- 
esteem, personal efficacy and mastery. In the typical theoretical account, these 
dimensions are important because they are mediators or moderators that lead to 
(or facilitate) affect or a related dimension of subjective well-being. 

The key point that | want to emphasize is that in the subjective quality of life 
movement, the primary outcome to be understood, and the ultimate standard for 
judging the quality of human life, is variability in a variety of positive feeling states, 
particularly affect. In this view, we need to understand what causes variability in 
this dimension because, as Mirowsky and Ross (2003) argue, affect is the universal 
dimension of life quality, and as noted by Angus Campbell (1981:19), "...people 
are primarily motivated to maximize their sense of well-being." Because of the 
dominance of this point of view, sociological research and theory on subjective 
quality of life largely focus on a single dimension of the human experience, the 

- range of affect from negative to positive. 

There is an irony in this situation. Sociology, a discipline that emphasizes social 
relationships, networks, families, intimate groups and communities, has nurtured 
a mostly psychologized and essentially one-dimensional approach to quality of 
life. It is in the work of psychologists and psychologically influenced sociologists, 
that we can see the slow evolution of a more explicitly multidimensional approach 
to life quality, an approach that does not confine social factors to the independent 
variable list. This approach assumes that in the case of life quality, social factors 
are to be engaged, facilitated and explained. 

Given the complexity of the human experience, the focus on positive life 
feelings as the primary dimensions of life quality is exceptionally narrow. At 
their most complex, these dimensions reflect psychological feeling states of 
individuals across a range from negative to positive. Are you happy? Do you feel 
good? Do you feel bad? Are things just terrible? For purposes of theoretical and 
empirical effectiveness, simplicity is a virtue, but the virtue is simultaneously a 
drawback, because, by definition, it leaves out a great deal. 


Meaning 


There is another dimension of life quality that is mostly ignored in quality of life 
research, and that dimension is meaning. In one sense, meanings are simply the 
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shared conceptualizations that make social life possible. Something is meaningful 
because the way people think about it is shared with others. But meanings 
in human social life are of more than practical significance. Some meanings 
stand out because they provide people with purpose, significance, validity and 
coherence for their actions and also provide a basis for social relationships and 
social integration. 

Of course, the idea that meaning, purpose, fulfillment, etc., are critically 
important in people's lives is hardly an original observation. However, because 
meaning has been mostly ignored in research on quality of life, its importance, 
though obvious, is not well understood, either empirically or theoretically. 

Researchers rarely include measures of meaning in studies of affect. Standard 
models of quality of life assume that affect is the ultimate dependent variable 
and, that when we promote affect, we are promoting the quality of life. If 
meaning is included, it is important because it may be one of the ways we get to 
a heightened sense of affect. 

| would argue that meaning is important in and of itself. At the risk of 
contradicting Angus Campbell, | would say that people are primarily motivated to 
maximize the meaningfulness of their lives, not their affect. Probably heightened 
affect is often, but not always, a by-product or consequence of attempts to 
maximize meaning. 

A series of examples that | present below support the idea that meaning is 
a more fundamental dimension of quality of life than is affect. | will argue that 
the inclusion of meaning in quality of life studies can help us redirect quality of 
life research toward an agenda in which subjective quality of life, whether affect 
or meaning, is not the ultimate dependent variable. Theoretical models can be 
built that consider meaning as the foundation of social relations, networks and 
community (something that should not be alien to sociologists). This would be a 
study of quality of life that starts and ends in social aggregates, with intervening 
processes that carry us through affect and meaning. 


The Importance of Meaning Independent of Affect 


So how do we know that meaning is important? Unfortunately, we do not have 
good evidence to answer this question because meaning has not been of central 
importance in quality of life studies. Most of the evidence is indirect, and some 
is only suggestive or even impressionistic. 


Living Alone 


My dissertation was on living alone and mental well-being (Hughes 1979). Walter 
Gove and | published a paper out if it in 1981 (Hughes and Gove 1981). The study 
showed that unmarried people who lived alone were in no worse, and sometimes 
were in better, mental health than unmarried people who lived with others. These 
findings were somewhat surprising because they challenged the idea that social 
integration was important for mental health. 

To be honest, | was exceptionally pleased with our findings because | had 
become suspicious of what | had concluded was facile reasoning in the literature 
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linking social relationships to better psychological well-being through social 
supports. It seemed obvious to me that many life problems Were caused by 
social relationships, and this emerging framework ignored the dark side of social 
integration. Our findings did not allow us to go as far as | would have liked, 
but we did end up arguing that social integration probably had both rewards 
and costs. We stressed the character of social relationships rather than their 
meaning. However, two unpublished findings that emerged later out of our 
research suggested that if we had pushed our interpretations further, it would 
have been a mistake. 

First, we did some analyses to try to explain and document why social 
integration had negative effects. To do this, we attempted to measure the 
negative dimensions and consequences of living with others that would be 
associated with bad mental health: loss of privacy, high demands and desire 
to withdraw from others. When we controlled these dimensions, along with 
respondent's income (as a measure of positive selection into living alone), we 
found that living alone was no longer related to being in good mental health and, 
in fact, we found some significant associations between living alone and being 
in poor mental health (Hughes and Gove 1989). These findings suggested that 
some unmarried people live with others even though these living arrangements 
constitute a negative structure for mental health, and even though they could 
afford to live on their own. The most obvious, though speculative, explanation 
of these overall findings is that these people live with others because of the 
meanings associated with these social relationships, and, after we account for 
the negative impact these relationships can have, persons living alone show the 
theoretically expected evidence of lacking social integration, and thus meaning, 
in their lives. 

In a second analysis, we again examined the impact of living alone on the 
well-being of the unmarried, but we divided those living with others into those 
who lived with relatives and those who had "roommates." We found that it was 
living with relatives that was related to diminished well-being, not living with 
roommates. Again, we controlled for respondent's income and other variables 
associated with dependency. Thus, these analyses suggest that unmarried 
people live with relatives even though these relationships reduce positive affect 
and increase negative affect. Two lines from Robert Frost's (1915) poem, "Death 
of the Hired Man," provide a hint of why: 


Home is the place where, when you have to go there, 
They have to take you in. 


| realize that in the poem, the hired man is taken in by his former employers, 
not his relatives. Still, these two lines make the point. Important among the 
meanings of home is the sense of obligation that people have for one another. 
| hypothesize that people who could live alone or with roommates instead live 
with relatives even though this decreases their affect. Why do this? | further 
hypothesize that they do so because of the meanings that they attach to 
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their relationships with relatives, prominent among which is the complex and 
variegated sense of obligation. 

Two somewhat more obvious examples are the impact of children and care 
giving on well-being. 


Parenthood 


In most societies people associate parenthood with happiness and well-being. 
Many people have a very strong desire to be parents, work hard to get pregnant, 
spend enormous sums of money to overcome fertility problems, and celebrate 
both their pregnancies and the births of their children, often referred to as "little 
bundles of joy." 

But parenthood does not confer mental health benefits (Gove and Geerken 
1977; McLanahan and Adams 1987; 1989; Evanson and Simon 2005). In fact, 
most research shows that having children in the home is related to worse mental 
health. Effects are greater on single parents, but most studies find that there 
are negative effects of parenthood on people in intact marriages, and effects 
are found for both men and women. In addition, for the married, having children 
reduces marital happiness. These decrements to well-being are not large, but 
they are found in most studies. 

Furthermore, having children is not a randomly occurring burdensome 
transformation of people's lives. In 2002, 60 percent of the births that occurred in 
the United States were to women who had already given birth at least once, and 
of mothers aged 40 to 44, 78 percent had had more than one child (Downs 2003). 
In the same year, more than 98 percent of sexually experienced women of child- 
bearing age reported using contraception at some time in their lives (Mosher et 
al. 2004). Apparently, after having one child, and even though they can sense its 
effect on their marriages, their well-being, and/or their lives overall, many parents 
decide to have more children. Why does this happen? 

An interesting exception to the pattern of children being related to decreased 
quality of life was presented in a study by Umberson and Gove (1989). While they 
did find that parents of children in the home were less happy and more agitated 
than non-parents, they also found that parents reported a higher sense of life- 
meaning and higher self-esteem than non-parents. A reasonable explanation of 
this pattern is that many people find parenthood to be a meaningful role. Though 
it is challenging and burdensome, it also provides a purpose and coherence to 
one's life, and the contextual values of parenthood provide a strong sense of 
validity. Through parenthood, one can know what to do and why one is doing it, 
and that it is important. More importantly, it provides powerful and meaningful 
connections to others, those who are nurtured and those who share the 
responsibilities of nurturing. 


Care Giving 


Research on being a caregiver for a disabled relative or friend presents a similar 
pattern. Overall, research shows that such care giving is associated with higher 
distress and a greater sense of burden. But some studies, for example, a recent 
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study by Marks, Lambert and Choi (2002), finds that despite this, some contexts 
of care giving were associated with greater purpose in life amd a heightened 
sense of personal growth. 


Race and Ethnicity 


Research over the past 50 years shows that African Americans experience 
more unhappiness, dissatisfaction and alienation than whites, and for at least 
the last 30 years, these race differences have been stable (Thomas and Hughes 
1986; Hughes and Thomas 1998). However, recent research by Carol Ryff, Corey 
Keyes and Diane Hughes (2003) shows that the level of eudaimonic well-being 
(that includes the sense of purpose in life, personal growth, positive relations 
with others, environmental mastery, etc.) is higher among African Americans 
and Mexican Americans than among whites. This study does not show that 
discrimination improves eudaimonic well-being; in fact, it does the opposite. But 
including a measure of (non-racially specific) discrimination in the models indicates 
that eudaimonic well-being among these minority groups would be even higher 
without discrimination. These findings suggest that challenges other than simple 
discrimination and the intra-ethnic processes for dealing with these challenges 
generate greater eudaimonic well-being. This may underlie what Marian Wright 
Edelman (1999:xiii; quoted in Keyes [2005]) said in a recent memoir: 


As a black girl growing up in a small segregated southern 
town, | could never take anything for granted and never for 
a moment lacked a purpose worth fighting, living, and dying 
for, or an opportunity to make a difference if | wanted to. 


Other Examples 


A series of impressionistic examples also make the case for the priority of 
meaning over affect. These are mostly cases of stressful, challenging and often 
painful activities that people willingly pursue, presumably because of the way 
they enhance life's meaning in one way or another. 

To an audience of professional academics, | would probably only need to say 
the words “graduate school” to make my point. To a graduate student preparing 
for prelims or writing his or her dissertation, the question “are you happy?” is 
the wrong question to determine quality of life. If the person answered “no,” 
this would not mean that his life lacked quality — but why not? Because there 
is a range of complex meanings and purposes that people attach to organized 
purposeful actions such as graduate school, and this vast array of subjective 
meanings is not captured by the notion of affect. 

For the same reason, if you ask a mountain climber hanging on to a rock ledge 
in a blizzard if he is happy, it is again the wrong question. For more on this, see 
Walter Gove's 1994 Southern Sociological Society Presidential Address (Gove 
1994) and consult an account of one of his climbs (Gove 1980). 
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For a parent sitting up with a sick child, or even just driving a child around 
to a seemingly endless array of activities, the question, "are you happy?" is the 
wrong question. 

For a volunteer who fought in the Spanish Civil War, a combatant in Iraq today, 
a soldier on either side of the American Civil War, the question “are you happy?” 
would be the wrong question. Commitment to this line of activity, that includes 
the most stressful experiences that humans can have, is grounded in powerful 
meanings that range from one’s devotion to one’s comrades and fellow soldiers 
to devotion to the moral and/or political ends for which one is presumably fighting 
(e.g., McPherson 1997). 

For the members of the audience who attended the talk on which this article is 
based (the Presidential Address at the annual meeting of the Southern Sociological 
Society), “are you happy?” is the wrong question. We attend these talks each 
year because they are supposed to be interesting; they give us Some new ways 
of thinking and something to talk about — something that connects us. 

For the editor of an academic journal, “are you happy?” is the wrong question. 
Being editor is sometimes lonely and sometimes alienating, and it is worrisome. It 
also makes a lot of others feel unhappy (I rejected about 85 percent of the papers 
submitted when | edited the Journal of Health and Social Behavior from 2001 
through 2004). But overall, | think most editors would see their jobs as important, 
and would work to accept papers that move the field forward and stimulate 
theoretical progress. They hope not to reject any good papers or accept any bad 
ones. They do a duty to the profession and serve the literature and knowledge 
base. Happiness and affect do not really enter into it. But the rewards in terms of 
meaning, purpose and fulfillment can be very great. 

In each of these cases, we can discern a situation in which affect may be low 
but meaning is high, and heightened meaning appears to be a key reason the 
activities are pursued in spite of their potential to lower affect. 


Cases of the Opposite: High Affect and Low Meaning 


Think about a society based entirely on affect — a society in which feeling good 
is the core and dominate value, and other meanings do not provide purpose 
and coherence to people's lives. | do not believe that this would be the sort of 
a society that many people would want to live in. In fact, because virtually all 
human experience produces meaning, it is difficult to find or even imagine social 
circumstances that are without meaning. It is even difficult to find examples of 
social situations in which affect is high, but meaning is low. However, we can 
observe situations in which people pursue short-term intense positive affect but 
experience low, weak, or ephemeral meanings and do not appear to be able to 
sustain a high level of life quality. Three obvious examples that strongly suggest 
the importance of the content of meanings are drugs, sex and materialism. 


Drugs and Alcohol 


Some psychoactive drugs produce very positive feeling states. Many drug users 
report that they have never felt as good as they do when they are on these drugs. 
But the overall impact of drug involvement on well-being and quality of life is also 
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well known. Except for using alcohol in moderation, using drugs and alcohol is 
associated with poor mental health (Gove and Hughes 1979) and with suicidal 
behavior (Borges, Walters and Kessler 2000). There is also strong comorbidity 
between alcohol dependence and most mental disorders (Kessler et al. 1997). In 
short, there is strong evidence that chronic and heavy use of pleasure-producing 
substances has primarily negative effects on people that are destructive not only 
of meaning, but of life itself. 


Sexual Activity 


Similarly, evidence from a number of surveys that include well-being measures 
and indicators of sexual activity do not show that sexual activity is associated 
with being in good mental health (Hughes and Gove 1993). In fact, among the 
unmarried males and females, more sexual activity, particularly more partners, 
and particularly among younger persons, there is a marked association with poor 
mental health (Hughes and Umberson 2004). The exception is in marriage or 
cohabitation where sexual activity within the relationship is associated with better 
mental health, but the association is explained by relationship quality (Hughes 
and Gove 1993). | would hypothesize that it is not the positive feeling state, or 
affective dimension, of sexual activity that has the primary potential to enhance 
the quality of people's lives; it is rather the meanings imbedded in the social 
relationships that one has as part of sexual activity that offer this potential. 


Materialism 


Some people acquire possessions as a way to generate positive affect and reduce 
negative affect. But psychologist Tim Kasser's (2002) work on the consequences 
of materialism shows that reaching materialistic goals is associated with well- 
being only for those who also report reaching non-materialistic goals such as 
personal growth, close relationships and contributing to the community. Kasser's 
research on materialistic values follows a similar pattern and shows that people 
high on materialism are consistently less satisfied than those who are non- 
materialistic. That materialism is devoid of meanings that provide a high quality 
of life is reflected in much contemporary advertising that stresses the meanings 
that are advanced if one only uses the product. Auto and tobacco advertising is 
full of this, but the most obvious example is the series of MasterCard commercials 
that stress the pricelessness of the experiences one gets while buying things 
with a MasterCard. 


Quality of Life Research that Includes Meaning 


As can be seen from my examples, quality of life research has not completely 
ignored meaning. As noted above, the Umberson and Gove (1989) paper on 
parenthood included a measure of meaningfulness of life. An important paper 
by Russell PD. Burton (1998) used the same data as Umberson and Gove and 
presented findings on what he termed global integrative meaning, showing that 
the positive effects of some central family roles on mental health are mediated 
by the sense of meaning that they provide. Another important paper by Taylor 
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and Turner (2001) showed that the meanings that people attach to themselves, 
that they matter to others, is an important factor in depression, particularly for 
women, for whom mattering is protective against depression over time. The 
psychologist Carol D. Ryff and one of her collaborators, sociologist Corey L.M. 
Keyes, have developed a new orientation on positive mental health that includes 
the sense of purpose in life and other dimensions reflective of enhanced meaning 
(Ryff 1989; Ryff and Keyes 1995; Keyes and Ryff 2002). 

But | think we need to do more. There are four sorts of problems that | think 
we can identify in various research approaches to meaning thus far. 

First, many researchers seem to tacitly acknowledge that meaning is important, 
but for the purposes of the research, it is not. For example, in his book, The 
Structure of Psychological Well-Being, Norman Bradburn (1969) argued that in 
thinking about the experiences that are associated with well-being, "the content 
of experience is not important." (p. 54) The problem is that there are too many 
ways to be "pleased," "proud," "depressed" or "restless." This is similar to the 
approach of other pioneers in quality of life research, such as Angus Campbell 
(Campbell, Converse and Rogers 1976) and Andrews and Withey (1976), who 
simply asked people about their life "as a whole," and recorded answers on a 
continuum of satisfaction or on a range between delighted and terrible. The 
result of this is measures of feeling states that are presumably connected to 
meaningful experiences, but we never get a sense of the nature and meaning of 
these experiences. We only get the feelings. 

Second, some researchers hypothesize that meaning is of critical theoretical 
importance, but include no measures of it. Sometimes this is unavoidable, as 
in my examples above of situations of high affect and low meaning. In many 
research situations, measures of meaning dimensions are not available. This is an 
obvious problem in the well-known approach of Durkheim (1897 [1951]; see also 
Pope [1976]) in his study of suicide. Social structure influences sentiments that 
are protective against suicide. But in this kind of work, there is no direct measure 
of sentiments or meaning. If we have no measure of meaning it is difficult to be 
sure that we are observing its outcome. Also, without measures of meaning, it 
is unclear how meaning is important, and as a result our theories are devoid of 
empirically supported content about processes involving meaning. 

Third, some researchers attempt to measure meaning, but hypothesize it as 
merely an independent variable or as a mediator between circumstances and 
affect, not as having significant theoretical stature in and of itself. These studies 
are important and useful, but they are based on the assumption that meaning 
is important because it makes us feel good, and our journey is over when we 
understand what causes positive and negative feelings. 

Finally, some researchers attempt to measure meaning, but the items are often 
very similar to measures of affect and are highly correlated with measures of 
affect, and it is unclear if what is measured is something different from affect. 


How to Think toward a Solution 


Meanings are a part of culture. So, one would think that the sociology of culture 
would provide us with a way to think about this problem. But the sociology of 
culture has mostly assumed the importance of meanings and has not engaged 
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them. The sociology of culture has mostly been about culture as legitimation, 
culture as resistance, and about the production of culture (that ts, how cultural 
organizations and industries produce culture). In the sociology of culture, culture 
is primarily a handmaiden to social structure or to some structural process. 

As a result, the sociology of culture has not been much about meanings 
and has largely avoided meanings in the same way that the sociology of well- 
being has avoided meanings - and probably for the same reason. Meanings 
are elusive; they are hard to grasp, hard to measure, and vary across time and 
circumstance. : 

Over the past 15 or so years, this has begun to change. In a recent article, 
Jeffrey Alexander (2004; see also Alexander 2003) argues that cultural meanings 
can effectively emerge out of social performances whose elements (actor(s), 
context, audience, means of symbolic production, social powers and background 
collective representations) are fused. However, Alexander's empirical approach 
has been mostly to try to understand macro-cultural processes and meaning- 
making (for example, meanings that emerged out of the Watergate crisis and out 
of the Holocaust) - not how meanings are made in the micro structures of family, 
friendship, work and leisure. Macro-cultural meaning systems are important 
contexts for the more mundane meaning-making that occurs in people's everyday 
lives. But | think to understand the meanings that are important to people's 
quality of life, we must focus directly on meaning-making that occurs within 
micro structures. 

As sociologists, we are familiar with Goffman's (1959) well-known argument 
that the self is a dramatic effect that emerges out of the whole scene in which it 
is presented. If we can see this, | think we can also see how a variety of complex 
meanings other than those of the self emerge in everyday life. 


What Sorts of Elements are Important to Life Meaning? 


We will also need to develop a conceptualization of the dimensions of meaning 
that are important. We probably already know the dimensions, we just need to 
take them seriously, and perhaps think of them in a slightly new way. 

For example, in his 1991 book Meanings of Life, and elaborated in a 2005 book, 
The Cultural Animal, psychologist Roy F. Baumeister has developed a way to think 
about what people appear to need in order to formulate a meaningful life. In an 
unself-consciously functional analysis, he argues that our culture encourages us 
to seek meaning because without it culture and society would be impossible. 
Though he is making a macro level cultural argument, | think the reasoning works 
very well at the micro level. 

Baumeister's argument yields four dimensions of meaning that are not new. 
You will recognize them. They are already in use in one way or another. What 
would be new would be using them as ways to probe the actual meanings that 
provide quality of life. 


Self-Worth 


Self-worth is not important to the meaning of life simply because it makes 
people feel good, and research does not show that it is important because it 
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is a powerful predictor of better performance outcomes at work, at school or in 
normative behavior (Emler 2001; Baumeister et al. 2003). Instead, as psychologist 
Mark Leary and his colleagues (Leary et al. 1995; Leary and Baumeister 2000; 
Leary 2005) recently showed, self-esteem functions as an internal indicator, a 
subjective indicator, of a person's interpersonal success and failure. Put another 
way, self-esteem is the result of the quality of one's relationships, not its cause, 
and it indicates how well a person is integrated — | would say morally integrated 
- into family, friendship, work groups and so on. This is the sociometer theory 
of self-esteem, and it is perfectly understandable to a sociologist who is used to 
thinking of self-esteem in terms of reflected appraisals. Monitoring self-esteem 
may be a way of monitoring the quality of relationships that a person has. 

Research does not support the idea that boosting self-esteem will improve 
the quality of people's relationships. In fact, there some evidence to the contrary 
(Baumeister et al. 2003). However, because people are powerfully motivated 
to maximize self-esteem, it may be more than simply an indicator of how well 
one is integrated into social relationships. It may be a factor in shaping behavior 
and managing the contributions that people make to the social groupings to 
which they belong. If people's contributions to the group are rewarded by higher 
self-esteem, it may provide further motivation for people to contribute to and 
to promote the overall well-being of the collective. Individually experienced 
meaning, in this case, self-worth, may be an important mediator between social 
structure at time 1 and social structure at time 2. This is the sort of model of for 
quality of life that we should be building for the future. 


Purpose 


Most people require purpose to provide coherence and focus to their lives. For 
most people, having no purpose is to assume that the present is, in itself, a 
solution to life's problems. It is also to be disoriented and possibly discontented. 
Purpose carries people to a future state of fulfillment that may be personal growth, 
enlightenment, salvation or the maintenance of important social relationships 
with people who have similar or compatible goals and purposes. A focus on 
these larger purposes of life creates long range contexts that connect the past 
with the future and take us way beyond the experience of our present selves. If 
religion is the context, the time frame can be eternity. 

Purposes are important not only because they are motivating, but because 
of the way they can connect us to other people, integrate our social groups, 
and fuel social processes that sustain social lives, modify structures and create 
new meanings. 


Personal Control (or Efficacy) 


The importance of efficacy, like that of self-worth and purposes, is nothing new. 
In fact, there is a substantial literature on the importance of personal control as 
a factor to reduce distress and increase positive affect. Efficacy is the sense of 
being able to overcome obstacles, deal with life's problems, and achieve goals. 
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An important reason people seek this is because without it, purpose would be 
empty and values would be difficult to realize. y 

However, seeking efficacy has another important consequence. People 
develop a sense of efficacy by experiencing themselves as efficacious (Gecas and 
Schwalbe 1983), and the sense of efficacy reaches its peak when the achievement 
of goals is challenging and difficult (Gove 1994). At the extreme, we can see that 
the drive for efficacy, and thus meaning, can push people to pursue activities 
that may be unpleasant, dysphoric and even painful (e.g., Gove 1980). Some 
of these activities may not be instrumental or beneficial for family, friendship 
circles or community, but if we assume that these elements of meaning work 
together, | think it is reasonable to further assume that the drive for efficacy as a 
life meaning can motivate many people to pursue difficult and arduous socially 
meaningful goals and activities. 


Values and Justification 


Values are the basis for the justifications people make for their actions. They are 
the moral underpinnings that give people a sense of confidence in what they do 
and imply that their actions enjoy wide social support. Values also provide people 
with a way to see themselves in positive or negative terms. 

"Values" is also an important topic in the second chapter of the introductory 
sociology textbook. And except for sometimes acknowledging that values ` 
are theoretically important and using the concept as an element in various 
interpretations we sometimes make of our research findings, "values" appear 
in empirical research in sociology a lot less than one might expect. In research 
on affect and well-being, "values" hardly appear at all. Many researchers simply 
assume that values are important, but they often do not measure the content 
` of values or attempt to assess variability in basic values and differences in how 
values function in different social contexts. 


A New Model of Life Quality 


This scheme of self-worth, purposes, efficacy and values is a tentative foundation 
for research into the elements that make life meaningful. Other schemes should 
be developed. Research is needed to understand how specific meanings are 
made, how they are linked to the background macro-cultural and macro-structural 
context, how they emerge in the micro structure, what their nature and content 
is, and how they are linked to individual affect. Finally, and most importantly, we 
need to examine how meaning-driven actions have an impact on the quality of life 
made possible by the social structures of family, friendship, work and leisure. 


Conclusion 


There is another dimension of quality of life, and it is not the master dimension 
of affect. It is meaning. People gravitate toward some activities and structures 
because they provide a strong sense of meaning, and they experience their lives 
as better as a result. This may or may not be linked to affect. 
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Integrating the concept of meaning in studies of quality of life will require 
considerable conceptual development, a program of qualitative studies, 
and a focus on measurement that can eventually enable large scale survey 
research on meaning. 

As long as we focus entirely on affect, we will miss the fundamental dynamics 
of human beings pursuing quality of life through enhanced meaning and purpose. 
Adding meaning to our focus on quality of life will provide a more fundamental 
understanding of the human condition and how to improve it. It will also help 
us to more profoundly understand what it is about our condition that makes it 
fundamentally human. 
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Negotiating Social Membership in the Contemporary World 
Jacqueline Hagan, University of North Carolina, Chapel Hill 


One of the defining characteristics of the late 20th and early 21st centuries is 
the increasing importance of international migration, an epoch Castles and Miller 
term the “age of migration.” (2003) The precise size of the international migrant 
population is unknown. Much of this movement — such as unauthorized and other 
irregular flows — is not recorded in official statistics. Nonetheless, by all accounts 
international movements have soared in recent decades, especially since the 
1970s, with the acceleration of cross-border flows of trade, investment, ideas 
and people — key features of globalization (Castles and Miller 2003). Worldwide, 
between 1965 and 2000, the number of migrants is estimated to have doubled, 
from 75 to 150 million (International Organization for Migration 2000). 

As in the past, many of today’s international migrants move toward areas of 
economic opportunity. From the mid-19th century until World War Il, large flows 
left Europe for one of four settlement areas: North and South America, Australia, 
New Zealand and South Africa. After World War Il, and especially since the end 
of the Cold War in the 1980s, the origin and direction of migration flows have 
expanded considerably and migration has become an increasingly important 
issue for sending and receiving countries. Today's migrants hail increasingly from 
developing regions and move toward multiple destinations in the developed 
world in search of economic opportunities, not only to the countries that have 
traditionally attracted large numbers of immigrants, such as the United States, 
Canada and Australia, but also in significant numbers to countries in Western 
Europe, the Persian Gulf, Japan and the fast-growing East Asian economies 
(Castles and Miller 2003). ! 

The articles that comprise this special section of immigrant incorporation 
focus on how integration opportunities and constraints shape social membership 
among newcomer immigrant groups across a number of advanced industrial 
nations, including European countries, the United States and Canada. Social 
membership here refers to a set of basic social rights conferred on members 
of a society, including, for example, the right to work, the right to participate in 
political life, or the right to education. Social membership, however, is not always 
defined by the state and its institutions and can be expressed in informal but 
organized ways. The 2006 immigrant-led protests that swept across the United 
States exemplify how immigrants and the organizations and institutions who 
advocate on their behalf mobilize to demand rights — to claim social membership 
as a basis for legal status and inclusion.’ 

The integration of large numbers of newcomers coming from very different 
cultures poses enormous challenges to industrialized nations today, many of 
which are attempting to cope with their own domestic concerns, including the 
effects of liberalized trading policies on labor markets, increasing domestic 
inequalities, mounting racial and ethnic tensions, shrinking social safety nets and 
national security (Papademetriou 2003). Consequently, this increasing flow of 
immigrants has forced many countries to reevaluate their policies of admission 
and removal, as well as their allocation of rights and responsibilities to residents 
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and citizens alike. In this context, immigration issues have become the basis for 
popular expressions of frustration with domestic and international problems and 
are often highly politicized in the halls of government as flashpoints for issues of 
national security, transfer payments and basic criteria for naturalization. 

Yet, the failure to integrate newcomers successfully can have enormous 
consequences that reach across generations. The 2005 civil unrest in poor 
first- and second-generation immigrant neighborhoods in France tragically 
illustrates how institutional characteristics, such as rigid labor market polices 
and discriminatory hiring practices can result in unsuccessful incorporation of 
immigrants and their children and disruption of the host society. ° At the same 
time, more flexible labor markets can offer opportunities for economic growth and 
social mobility, as evidenced in the United States by the continued high rates of 
immigrant entrepreneurship and the movement of immigrant families into middle- 
class suburbs or, as Frey terms them, "melting class suburbs." (Frey 2005) 


Historical Views on Immigrant Incorporation and Social Membership 


The pressing challenges that contemporary levels of international migration pose 
for receiving states have sparked a resurgent interest by academics and policy 
makers alike in the study of immigrant incorporation and social membership, and 
how immigrant communities and nation states manage diversity and settlement. By 
and large, the models that scholars and policy makers have adopted reflect different 
approaches to incorporation by U.S. and European scholars, as well as regional 
differences in the relative roles of governments, public institutions, and immigrants 
and their communities in influencing incorporation and social membership. 

Thelong-standing model of national incorporation and social membership used 
by most sociologists in the United States is the assimilation approach. According 
` to this model, immigrants are encouraged to shed their homeland cultures and 
integrate as completely as possible into the social, cultural and political life of 
the receiving society (Gordon 1964).* Integration outcomes are largely left up 
to the immigrants themselves and the role of the state is limited to selective 
immigration policy, which provides legal entry for some groups (Glazer 1998: 
Bloemraad 2002). By and large institutional effects and state settlement polices 
are ignored, with some exceptions (Portes and Rumbaut 1996; 2001; Hagan and 
Gonzalez Baker; Jones-Correa 1998; Bloemraad 2006). As Castles points out, 
most American sociologists continue to view assimilation as a one-way process 
and as “inevitable and necessary for permanent migrants.” (2005;287) 

More recently and in response to the assumption of one-way assimilation, a 
number of scholars — mostly American sociologists and anthropologists — have 
introduced transnationalism as a new model of belonging and incorporation 
among contemporary migrants (Glick Schiller, Basch and Blanc-Szanton1992: 
Levitt 2001). This model disputes the eventual severance of ties between home 
and receiving nations and argues that migrants, their families and communities 
maintain social, cultural, political and economic linkages with home communities 
for long periods, sometimes leading to mobilized efforts that give rise to new 
forms of belonging, such as dual citizenship (Guarnizo 1998). Transnationalism is 
strongly associated with globalization, which fuels the cross-national movement 
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of ideas, money, norms and people, and ultimately supersedes the boundaries 
of the nation state. A few scholars also link assimilation to transnationalism by 
demonstrating the multiple social spaces migrants occupy, some of which also 
transcend national boundaries (e.g., Kivisto 2005; Smith 2006; Castles 2005). 
Both the assimilation and transnational model paradigms focus more on the 
immigrants and how they incorporate than on the role of the state. 

Differential exclusion, an alternative mode of incorporation, emphasizes 
the ways in which immigration policy in conjunction with labor market policies 
have historically worked to restrict the assimilation of some groups (Castles 
and Miller 2003; Castles 2005). This model is associated with temporary guest 
worker programs, such as the 1942-1964 U.S. Bracero Program for Mexican farm 
workers, or with the multiple guest worker programs that state and employer 
organizations institutionalized in Germany and France in the 1970s (Castles and 
Miller 2003). Under this model, immigrants are recruited from adjacent countries 
and upon arrival are viewed as temporary workers, legally prohibited from settling, 
forming families or becoming citizens. In this sense they are often incorporated 
into some sectors, such as labor markets, but excluded from participating in 
others, such as political life. 

An additional approach to immigrant incorporation and social membership that 
gained purchase in Europe in the 1970s is multiculturalism, which stresses ways 
in which national policies and public institutions accommodate cultural diversity 
and integrate newcomers. This model of national incorporation is associated with 
a series of official policies that Canada, Australia, the Netherlands and Sweden 
engineered and introduced to accommodate cultural differences and protect 
immigrant and ethnic rights in a number of sectors, ranging from education, 
to health to welfare (Castles and Miller 2003). Multiculturalism as an official 
policy lost some ground in the 1990s, in part because of the budgetary costs 
of implementing official social policies geared towards cultural maintenance 
and ethnic rights and also because of its unpopularity among national elites and 
social conservatives who view such policies as either catering to special interest 
groups or as a threat to national identity (Castles 2005; Papademetriou 2003). 
Scholars who use differential exclusion and multicultural approaches do bring 
the role of the state into their analyses but have not done so as comprehensively 
as the newer, developing approaches to integration developed by Penninx and 
the authors of papers introduced in this section. 


New Directions in Immigrant Integration: The Role of the State, Public 
Institutions and Immigrant Communities 


An emergent model of national incorporation that is gaining popularity among 
policy makers and in academic circles, especially in Europe, avoids some of 
the ideological and divisive elements of both assimilation and multicultural 
approaches to incorporation (Papademetriou 2003) and brings to the forefront 
the interactive role of state immigrant policy, public institutions and immigrant 
organizations in influencing incorporation and social membership. At the heart of 
national models of integration is the interaction between immigrant communities 
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and governmental institutions (Papademetriou 2006; Spencer 2006; Penninx 
2006, 2000, 2003). Rinus Penninx's conception of immigrant integration, for 
example, emphasizes the distinctive roles of communities, institutions, and the 
state in influencing integration outcomes (Penninx 2003). Penninx conceives of 
integration as the long-term process by which "individuals and groups become 
accepted members of society." In his formulation, which he and Vermeulen 
have applied to the case of the Netherlands, integration is a negotiated process 
that occurs between two groups of actors and agents: immigrants and their 
organizations and host governments and their political, economic and social 
institutions (Vermeulen and Penninx 2000). Moreover, a nation's institutional 
responses to newcomers generally contribute more to integration outcomes 
than do the actions of immigrants and their collective communities. Because 
of the influential role of national institutional structures, Penninx highlights 
the importance of integration policies that reinforce the connection between 
community and state at multiple levels to facilitate successful integration. 

As the papers in this section show, the settlement experiences of contemporary 
migrants often reflect a distinct mixture of existing and emergent models. What 
ties these articles together, however, is that they all underscore the salience of 
government settlement and social policies and public institutions in shaping 
social membership and integration fortunes. Collectively, the articles focus on 
social membership opportunities and constraints among newcomer immigrant 
groups across a number of advanced industrial states. The papers highlight the ` 
ways in which contexts of reception, public institutions and state incorporation 
policies influence a variety of social membership opportunities and outcomes, 
from religious behavior to citizenship prospects to work opportunities. Yet, 
as several of the authors demonstrate, integration is not a one-way process 
that structural opportunities alone determine. Immigrant communities and the 
institutions to which they belong, such as religious institutions and community 
based-ethnic organizations, are important vehicles of agency in that they 
mobilize immigrant concerns and negotiate with public institutions to enhance 
social membership opportunities. Although each of the papers in this section 
focuses on different dimensions of social membership — political, economic 
and religious — within different local, national and supranational contexts, 
the contributions move well beyond analyses of incorporation that focus on 
individual or group attributes; rather, they collectively emphasize the strong arm 
of national integration policies and institutional characteristics in influencing 
integration processes and outcomes. 

The paper by Prema A. Kurien is a richly textured case study of how one 
religious group - Hindu Americans - negotiates a place in America's multicultural 
religious landscape, and how both national policy regarding religion as well as the 
organizational structure and practices of Hindu-American religion are transformed 
in the process. Kurien shows that although national multicultural policy in the 
United States accepts religious diversity, such tolerance is usually restricted to 
the private sphere and officially based on notion of Christian (largely Protestant) 
tenants, and, as a consequence, non-Christian groups are often excluded from 
qualifying for and receiving public goods distributed by the state. To qualify for 
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tax breaks, for example, Hindu Americans have to work vigorously at redefining 
their organizational structure and religious practices. Kurien also reminds us 
how powerful religion is as a source of community solidarity and activism. Her 
analysis describes in detail the multiple strategies — including the successful 
Jewish American model — that Hindu Americans have adopted in their quest to 
get Hinduism officially recognized as an American religion in the post-September 
11 context. Drawing on the example of the ongoing court cases between public 
schools and the Hindu community over textbook material, Kurien keenly shows 
how social membership and integration is a negotiated and long-term process, 
involving resistance, struggle and accommodation between ethnic community 
organizations and national public institutions. 

The papers by Irene Bloemraad and by Jennifer Van Hook, Susan Brown and 
Frank Bean address a second dimension of incorporation, the timely issue of 
political incorporation and legal citizenship. Naturalization and the acquisition of 
legal status is generally an important marker for long-term integration, providing 
new employment opportunities and expanded social networks (Hagan 1994; 
Bloemraad 2002; Bratsberg, Ragan and Nasir 2002). Following the traditional 
European approach, Bloemraad emphasizes the importance of the nation state 
and its political institutions and policies in fundamentally shaping the experiences 
of immigrant residents, an approach that contrasts with another group of scholars 
who focus on the dynamics of post-national and transnational forces that suggest 
the declining relevance of national citizenship (e.g., Baubóck 1994; Carens 1987; 
Jacobson 1996; Soysal 1994). 

Bloemraad makes a major contribution to existing theories by integrating 
two theoretical bodies, social movement theory and institutional analysis, into a 
model of structural mobilization to explain the recent divergence in naturalization 
among immigrants in two of the largest immigrant-receiving nations today, the 
United States and Canada. Bloemraad's analysis of the United States-Canada 
divergence is especially telling as it occurs during a period of North American 
economic convergence resulting from globalization forces, especially NAFTA. 
Bloemraad finds that the usual theoretical explanations for naturalization, based 
either on immigrant characteristics and resources or on state immigration and 
citizenship policies, do not help solve the puzzle of divergent citizenship levels 
between the two countries as both the United States and Canada are relatively 
similar along these lines. Rather, she finds that that the institutional cultures of 
the two countries — the ways that government agencies deal with, promote and 
support citizenship — differ and that may largely explain the dichotomy. Bloemraad 
demonstrates that political incorporation is best understood as a social process of 
mobilization by immigrants, their personal networks and community organizations 
that are firmly nested in an institutional context characterized by government 
policies of diversity and newcomer settlement. Bloemraad has developed a 
theoretical framework to study cross-national citizenship behavior; no doubt, her 
efforts will stimulate more cross-national studies of political participation and 
naturalization. 

The paper by Van Hook, Brown and Bean also focuses on variations in political 
incorporation via legal citizenship, but across U.S. states. Following Bloemraad's 
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approach, the authors reaffirm the importance of political jurisdiction in shaping 
political incorporation. Their analysis studies immigrant groups during and after 
passage of the 1996 Welfare Act, legislation that required naturalization for some 
federal benefits and delegated administration of these benefits to the states. As 
such, the authors were able to examine the relative influence of the size of welfare 
benefit and state attitudes towards immigrants in pursuit of naturalization. Using 
the General Social Survey (GSS) data and longitudinal data from the Survey of 
Program Dynamics (SPD) for 1988-2002, the authors find that social context is 
the most important explanation behind naturalization. As the authors note, their 
findings challenge the common claim that the pursuit of naturalization is motivated 
by the desire to participate in social-welfare and affirmative-action programs. 
Rather, their findings "reinforce the idea that favorable contexts of reception 
foster immigrant incorporation, not welfare dependency, and point to policies that 
facilitate immigrant naturalization as more effective instruments for enhancing 
immigrant socioeconomic status than those adjusting immigration flows." 

Van Hook, Brown and Bean raise an important question for studies of political 
incorporation in the United States: how much do states differ in their approach 
to citizenship and integration? This is a novel approach in American scholarship 
and one that will become increasingly important in future years as more and 
more states take immigration matters, especially the provision of social benefits, 
into their own hands. Although the federal government is clearly charged ` 
with regulating the entry and removal of different groups of immigrants, it is 
increasingly the states that define and administer immigrant policy and the host 
of social programs available to newcomers (Fix and Zimmerman 2001). Thus, 
just as the devolution of immigrant policy under the Welfare Act led to variability 
among states in coverage and access for immigrants (Hagan, Rodriguez, Capps 
and Kibiri 2002), so might we anticipate large variations in future citizenship 
patterns, as the authors suggest. 

The papers by Irena Kogan and Christel Kesler focus on the salient and timely 
topic of economic incorporation of newcomer immigrants in the European Union, 
a region in which the economic and social fortunes of an increasing number 
of young immigrants look bleak and where a rising number of immigrants 
and their children are excluded from participating in labor markets. Immigrant 
unemployment rates are high in the Europena Union, but there is great variability 
by country, ranging from roughly 40 percent in France and Finland to a low of 
about 10 percent in Ireland, figures that in most countries are double the rates 
for natives. In her analysis Kogan assesses the relative influence of individual 
and institutional factors to explain the variability in the labor market situation 
of immigrants in the original 14 European countries. She finds that in addition 
to individual attributes, labor market characteristics and regulatory practices are 
important institutional features explaining immigrant employment prospects. In 
countries characterized by a large share of unskilled and low-skilled jobs, recent 
immigrants are less advantaged than their native-born counterparts, confirming 
the strong influence that labor market demands have on immigrant economic 
incorporation. She also finds that economic integration fortunes vary by welfare 
regimes; male immigrants residing in liberal welfare states (e.g., the United 
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Kingdom and lreland) enjoy better employment prospects than those in social 
democratic welfare states (e.g., Sweden, Denmark, Finland) or conservative 
welfare regimes (e.g., France, Germany), where many are excluded from labor 
markets. 

The paper by Kesler also emphasizes destination effects on the economic 
fortunes of immigrant men and women. The analysis considers patterns of 
joblessness among immigrants from 33 nations living in Britain, Germany 
and Sweden. Kesler finds that immigrant/native-born joblessness differences 
are considerably larger in Sweden than in Britain and Germany, net of human 
capital characteristics and origin effects. Like Kogan, Kesler turns to institutional 
characterics — occupational structure, access to welfare and discrimination 
practices — to explain these variations in immigrant incorporation. Despite the 
larger differential between the native born and immigrants in Sweden, Kesler 
finds that immigrant women in Sweden actually fare better than their sisters 
in Germany and Britian, in large part because employment policies in Sweden 
that bolster the employment prospects of native-born and immigrant women 
alike, especially those of married women with children. As such, Kesler's analysis 
highlights the central role that social policy plays in immigrant incorporation 
outcomes. Like the other papers in this section, the papers by Kogan and Kesler 
underscore the structural and institutional conditions under which decisions 
are made by immigrants as they seek opportunities in host countries. As such, 
the authors' analyses suggest the need for more cross-national research on the 
institutional arrangements that influence labor market outcomes for immigrant 
men and women. 


Future Directions 


As we press forward into the 21st century, the integration of newcomer 
populations will continue to pose enormous challenges for advanced industrial 
nations. Integration and social membership are long-term processes that reach 
across multiple state institutions and across generations. As each of these 
papers demonstrates, integration does not, as often assumed in the literature, 
take place solely at the level of the individual, where it is assessed in terms of 
an immigrant's social adaptation to the host society. As the papers by Prema A. 
Kurien and Bloemraad indicate, integration is also a collective endeavor occurring 
at the group level, as immigrant organizations mobilize resources and take action. 
Yet, as Bloemraad reminds us, these mobilizations are largely embedded in state 
public institutions and state social polices that either promote or hinder inclusion. 
Similarly, Van Hook, Brown and Bean caution against immigration policy as an 
integration mechanism; their analysis shows that social policy toward immigrants 
is far more effective in ensuring citizenship and long-term incorporation. 

While these papers beg especially for more cross-national studies that examine 
institutional effects on integration, several also remind us that integration polices 
are developed and implemented in national, institutional and even local contexts. 
As Kogan shows, laws governing labor market practices that promote or hinder 
inclusion are largely shaped by the type of national welfare regime in which they 
are implemented. Yet, as Kesler reminds us, national employment policies are 
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often gendered; Consequently, the economic fortunes of immigrant men and 
women can vary substantially depending on the nature of gendered institutional 
configurations in destination countries. Similarly, Kurien demonstrates that the 
introduction of Hindu religious institutions has posed some problems for U.S. 
social policies, reflecting the fact that America's exposure to religious diversity 
has largely been restricted to Protestant denominations and Judaism. Still further, 
as the paper by Van Hook, Brown and Bean argue, local context also matters 
when studying immigration; public attitudes toward immigrants can vary sharply 
across states, reflecting their own exposure to and historical experiences with 
newcomer groups. Similarly, state contexts of reception may indeed be more 
important than national policy when it comes to the pursuit of citizenship. Future 
studies of integration need to be placed in context if we are to understand how 
public perception and social polices change with time, and at multiple levels of 
analysis, from the nation state to the local municipality. 

The final narrative that emerges from these papers is that place matters when 
it comes to integration. Immigrants live and work in specific countries, states, 
provinces and cities, and the specific institutions and social policies of these 
places influence their integration. Thus, the importance of national and local 
jurisdictions evidenced in these papers cautions against a singular adoption of 
transnational or postnational studies when examining incorporation processes. 
While globalization allows migrants to maintain relationships simultaneously | 
with both their home and host communities, these ties are nested in larger 
structures. In this sense, social membership is simultaneously defined by both 
the immigrants and the state. 


Notes 


1. International migration between developing regions of the world is also 
substantial and rising. Scholars forecast that inter-regional labor and refugee 
flows in Africa and Latin America will constitute some of the next major 
waves of international migration (Castles and Miller 2003). 


2. From February to May of 2006, hundreds of thousands of people across the 
United States took to the streets to protest proposed legislation H.R. 4437, 
also known as the Border Protection, Anti-Terrorism and Illegal Immigrant Act 
of 2005, which would impose penalties on illegal residents, their employers, 
and churches and other groups and persons that assist them. The largest 
protests took place on April 10 in cities throughout the country. The protests 
culminated on May Day, when opponents of the bill stayed home from 
work and school, and refused to shop in support of the “Great American 
Boycott.” (Bahadur 2006) The Catholic Church played a key role in organizing 
the protests by encouraging parishioners to protest and providing clergy as 
speakers (Kerwin 2006). 


3. The civil unrest in October and November of 2005 in France, which led to 
a series of riots and clashes between youth and police, was activated by 
the deaths of two teenagers in a poor eastern suburb of Paris. The riots 
involved second generation immigrant youth who resided in poor suburbs 
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that offered few economic opportunities. Some observers argued that the 
unrest was an expression of frustration with high youth unemployment, 
discriminatory hiring practices, segregated housing and police harassment. 
The social unrest, which extended to other urban areas throughout France, 
triggered debates about integration in France (Astier 2005). 


4. This approach to immigrant integration lost currency in the United States 
in the 1960s and 1970s, during the heyday of the Civil Rights movement, 
when scholars criticized it on ideological and political grounds and it was 
replaced in part with various versions of pluralism and ethnic persistence 
(Greeley 1974). Assimilation resurfaced in the early 1990s, as a number of 
American scholars of immigration and race and ethnicity began to reconsider 
the "utility and validity" of assimilation in the modern era (Kivisto 2005:17). 
Some scholars expanded its original theoretical formulations to account for 
the role of history and gender (Morawksa 1994) and social class (Portes and 
Zhou 1997) and agency-structure linkages (Alba and Nee 2003) in immigrant 
incorporation, while others challenged its straight line assumption (Suarez- 
Orozco 2002). 


References 


Alba, Richard, and Victor Nee. 2005. "Rethinking Assimilation Theory for a New Era 
of Immigration." Pp. 235-76. /ncorporating Diversity: Rethinking Assimilation 
in a Multicultural Age. Peter Kivisto, editor. Paradigm. 


Astier, Henri. 2005 "French Muslims Face Job Discrimination." BBC News. 
Accessed Dec. 1 at" http:/news.bbc.co.uk/1/hi/world/Europe/4399748.stm. 


Bahadur, Gaiutra. 2006. "Workers Step from Shadows." Feb. 15. Philadelphia Inquirer. 
Accessed March 1 at: http:/www.philly.com/mld/inquirer/13873696.htm. 


Bloemraad, Irene. 2006. Becoming a Citizen: Incorporating Immigrants and 
Refugees in the United States and Canada. University of California Press. 


. 2002. "The North American Naturalization Gap: An Institutional Approach 
to Citizenship Acquisition in the United States and Canada.” International 
Migration Review 36:193-228. 





Basch, Linda, Nina Glick Schiller and Cristina Blanc-Szanton . 1994. Nations 
Unbound: Transnational Projects, Postcolonial Predicaments, and 
Deterritorialized Nation-States. Gordon and Breach. 


Bauböck, Rainer. 1994. Transnational Citizenship: Membership and Rights in 
International Migration. Edward Elgar. 


Bratsberg, Bernt Jr., James F Ragan and Zafar M. Nasir. 2002. “The Effect of 
Naturalization on Wage Growth: A Panel Study of Young Male Immigrants.” 
Journal of Labor Economics 20:568-97. 


Carens, Joseph H. 1987. "Aliens and Citizens: The Case for Open Borders." 7he 
Review of Politics 49:251-73. 


640 * Social Forces Volume 85, Number 2 * December 2006 


Castles, Stephen, and Mark J. Miller. 2003. 7he Age of Migration: International 
Population Movement in the Modern World. Guilford Press. ` 


Castles, Stephen. 2005. "Migration and Community Formation under Conditions 
of Globalization." Pp. 277-98. /ncorporating Diversity: Rethinking Assimilation 
in a Multicultural Age. Peter Kivisto, editor. Paradigm. 


Frey, William H. 2005. Melting Pot Suburbs: A Census 2000 Study of Suburban 
Diversity. Washington DC: Center on Urban and Metropolitan Policy, 
Brookings Institute. 


Glick Schiller, Nina, L Basch and C. Blanc-Szanton. 1992. Towards a Transnational 
Perspective on Migration. New York: New York Academy of Sciences. 


Glazer, Nathan. 1998. “Governmental and Nongovernmental Roles in the 
Absorption of Immigrants in the United States." Pp. 59-82. Paths to Inclusion: 
The Integration of Migrants in the United States and Germany. Peter H. Schuck 
and Rainer Münz, editors. Berghahn Books. 


Gordon, Milton. 1964. Assimilation in American Life: The Role of Race, Religion, 
and National Origins. Oxtord University Press. 


Greeley, Andrew, 1974. Ethnicity in the United States: A Preliminary Reconnaissance. . 
John Wiley & Sons. 


Guarnizo, Luis. 1998. “The Rise of Transnational Social Formation: Mexican and 
Dominican State Responses to Transnational Migration.” Political Power and 
Social Theory 12:45-94. 


Hagan, Jacqueline Maria, and Susan Gonzalez Baker. 1993. "Implementing 
the U.S. Legalization Program: The Influence of Immigrant Communities 
and Local Agencies on Immigration Policy Reform." /nternational Migration 
Review 27(3):513-36. 


Hagan, Jacqueline. 1994. Deciding to be Legal: A Maya Community in Houston. 
Temple University Press. 


Hagan, Jacqueline, Nestor Rodríguez, Randy Capps and Nika Kibiri. 2003. "The 
Effects of Recent Welfare and Immigration Reforms on Immigrants’ Access 
to Health Care.” /nternational Migration Review. 37 (2):444-63. 


Fix, Michael, and Wendy Zimmermann. 2001. “All Under One Roof: Mixed-Status 
Families in an Era of Reform." /nternational Migration Review 35:397-419. 


International Organization for Migration. 2000. World Migration Report 2000. 
Geneva: IOM. 


Jacobson, David. 1996. Rights Across Borders: Immigration and the Decline of 
Citizenship. Johns Hopkins University Press. 


Negotiating Social Membership * 641 


Jones-Correa, Michael. 1998. Between Two Nations: The Political Predicament of 
Latinos in New York City. Cornell University Press. 


Kerwin, Donald. 2006. "Immigration Reform and the Catholic Church." Migration 
Information Source. Washington DC: Migration Policy Institute. Accessed 
Feb. 1 at: http://www.migrationinformation.org/Feature/display.cfm?|D —395. 


Kivisto, Peter. 2005. "Social Places, Transnational Immigrant Communities, and 
the Politics of Incorporation." Pp. 299-319. /ncorporating Diversity: Rethinking 
Assimilation in a Multicultural Age. Peter Kivisto, editor. Paradigm. 


Levitt, Peggy. 2001. 7he Transnational Villagers. University of California Press. 


Morawska, Ewa. 1994. "In Defense of the Assimilation Model." Journal of 
American Ethnic History 13(2):76-87. 


Portes, Alejandro, and Rubén G. Rumbaut. 1996. /mmigrant America: A Portrait. 
University of California Press. 


. 2001. Legacies: The Store of the Immigrant Second Generation. University 
of California Press. 





Portes, Alejandro, and Min Zhou. 1993. "The New Second Generation: Segmented 
Assimilation and Its Variants.” 7he Annals ofthe American Academy of Political 
and Social Science 530:74-96. 


Reitz, Jeffrey G. 1998. Warmth of the Welcome: The Social Causes of Economic 
Success or Immigrants in Different Nations and Cities. Westview Press. 


Suarez- Orozco, Marcelo. 2002. "Everything You Ever Wanted to Know About 
Assimilation but Were Afraid to Ask." Pp. 19-42. Engaging Cultural Differences: 
The Multicultural Challenge to Liberal Democracies. R. Schweder, M. Minow 
and H. Marcus, editors. Russell Sage. 


Papademetriou, Demetrious. 2003. "Policy Considerations for Immigrant 
Integration." Washington DC: Migration Policy Institute. Accessed Feb.1 at: 
http://www.migrationinformation.org/Feature/display.cfm?lD — 171. 


Penninx, Rinus. 2005. "Elements for an EU Framework for Integration Policies 
for Immigrants." Pp. 74-83. Managing Integration: the European Union's 
Responsibilities Towards Immigrants. Rita Sussmuth and Werner Weidenfeld, 
editors. Washington DC: Migration Policy Institute. 


. 2003. "Integration: The Role of Communities, Institutions, and the State." 
Washington DC: Migration Policy Institute. Accessed Jan. 15 at: http://www. 
migrationinformation.org/Feature/display.cfm?ID — 168. 


Smith, Robert C. 2006. Mexican New York: Transnational Lives of New Immigrants. 
University of California Press. 


642 * Social Forces Volume 85, Number 2 * December 2006 


Soysal, Yasemin. u. 1994. Limits of Citizenship: Migrants and Postnational 
Membership in Europe. University of Chicago Press. 


Stalker, P. 2000. Workers without Frontiers: The Impact of Globalization on 
International Migration. Lynne Rienner. 


Vermeulen, Hans, and Rinus Penninx. 2000. /mmigrant Integration: The Dutch 
Case. Hat Spinouts. 


For Love or Money? 
Welfare Reform and Immigrant Naturalization 


Jennifer Van Hook, Bowling Green State University 
Susan K. Brown, University of California, Irvine 
Frank D. Bean, University of California, Irvine 


The Welfare Reform Act of 1996 generally restricted immigrants 
eligibility for welfare to those who had naturalized. By increasing 
the salience of naturalization, the law provides a unique opportunity 
to examine how social and economic contexts of reception influence 
immigrants pursuit of citizenship. This paper summarizes instrumental- 
legal (IL) and social-contextual (SC) theoretical perspectives on the 
foundations of citizenship and develops hypotheses on how social and 
economic contexts of immigrant reception after welfare reform influence 
naturalization behavior. Using General Social Survey (GSS) data and 
longitudinal data from the Survey of Program Dynamics (SPD) for 
1988-2002, the research finds that hypotheses about the influence of 
the social context of reception, as reflected in state-level favorability 
of attitudes toward immigrants, are most consistently supported in 
the data. The results hold important implications for both theories of 
immigrant incorporation and ideas about what constitutes the most 
effective policy instruments to enhance the social and economic status 
of immigrants. 


In 1996, the U.S. Congress passed the Personal Responsibility and Work 
Opportunity Reconciliation Act (PRWORA), more widely known as the Welfare 
Reform Act (Blank and Haskins 2001; Espenshade, Baraka and Huber 1997; Weil 
and Finegold 2002). Among its critical features was the introduction of citizenship 
as a criterion for receipt of welfare. By citizenship, we mean the legal (i.e., de 
Jure) specification of rights, privileges and obligations associated with nation- 
state membership (Aleinikoff 1990). Such rights and duties are often defined in 
political terms, but also may be extended (or delimited) to include (or exclude) 
social rights and obligations (Bendix 1977; Feldblum 2000; Marshall 1950; Ueda 
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2001). Because the Welfare Reform Act required most persons who became 
legal permanent residents after Aug. 22, 1996, to naturalize to become eligible for 
welfare (Fix and Passel 2002), it severely limited social citizenship, eliminating in 
most cases and substantially reducing in others, what had been nearly universal 
immigrant access in the United States to federal means-tested cash and non-cash 
social services. State governments were allowed to modify these restrictions. 
Nonetheless, the original legislation strongly signaled that political citizenship 
— not just legal residency — would henceforth be mandatory for inclusion in social 
welfare programs, at least at the level of federal funding. 

This action significantly narrowed the country’s definition of which members 
of the population could count on certain membership rights, in this case certain 
social rights (Aleinikoff 2001). Arguably, the last similarly important delimitation 
occurred in 1928, when legal permanent residents first were barred from voting in 
presidential elections (Raskin 1993). In addition, administrative wartime restrictions 
were directed primarily at Japanese-Americans, both legal permanent residents 
and citizens (Jensen 1988; Reimers 1992 [1985]: Takaki 1989). In the decades 
after World War Il, however, both the geo-strategic aspects of the Cold War and 
the ever more compelling appeals of the spreading Civil Rights movement to end 
legal racial discrimination fostered expansion rather than restriction of rights for 
immigrants (Bean and Bell-Rose 1999; Morris 1985). These tendencies eventually 
found further expression in the judicial system. In 1971, for example, the U.S. - 
Supreme Court ruled that "alienage" was a "suspect classification," that is, "that 
statutes disadvantaging aliens as a class are held invalid unless justified by some 
compelling state interest." (Rosberg 1977) This led to other judicial decisions 
declaring that states may not deny to legal permanent residents welfare benefits 
[Graham v. Richardson, 403 U.S. 365 (1971)], civil service employment [Sugarman 
v. Dougall, 413 U.S. 634 (1973)], access to the bar [in re: Griffiths, 413 U.S. 717 
(1973)], financial assistance for higher education [Nyquist v. Mauclet, 97 S. Ct. 
2120 (1977)] or licenses to practice civil engineering [Examining Bd. v. Flores de 
Otero, 426 U.S. 572 (1976)]. 

Such decisions illustrate that naturalization often increases access to tangible 
and intangible resources. While this is self-evident in the case of economic 
resources (e.g., naturalization entails eligibility for many kinds of employment 
unavailable to non-citizens [Bratsberg, Ragan and Nasir 2002]), it is also true for 
social resources (e.g., naturalization may generate new social networks, as well 
as increase access to old ones). Wider social contacts not only foster greater 
familiarity with employment opportunities, they also strengthen social integration 
more broadly (Bloemraad 2002; Granovetter 1973). However, immigrants’ 
motivations to seek naturalization for social benefits also seem likely to depend 
on how warmly immigrants are welcomed at new destinations. If immigrants 
face unfavorable social receptions, the social benefits of naturalization are likely 
to seem remote, or even non-existent. By contrast, if immigrants experience 
warm, favorable receptions, social benefits are likely to appear attractive and 
attainable. Thus, just as destination contexts of reception that offer greater 
tangible economic benefits may increase the possibility of naturalization, so too 
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may social contexts of reception that consist of more positive and supportive 
attitudes toward immigrants influence the pursuit of naturalization. 

The benefits from naturalization in turn facilitate economic and social 
incorporation among immigrants (Gerstle and Mollenkopf 2001; Morawska 
2003). Such possibilities further underscore naturalization's importance. In the 
case of the Welfare Reform Act, passage drew attention to the significance of 
naturalization by making welfare eligibility conditional on political citizenship. The 
law's implementation also introduced elements of uncertainty and ambiguity that 
heightened naturalization's salience even further. After 1996, state governments 
were allowed to extend immigrant benefits at their own expense; some did, 
to varying degrees. Also, Congress itself subsequently restored some benefits 
to certain legal permanent residents (Fix and Zimmermann 2001). While many 
non-citizen immigrants thus faced the withdrawal of benefits, others became 
confused and uncertain about which benefits were being restored and which 
were not (Gilbertson and Singer 2003), outcomes likely to have underscored 
the importance of naturalization. The 1996 Welfare Reform Act thus provides 
a propitious opportunity to examine how state-level contextual aspects of 
immigrant reception influence the tendency to naturalize. The present research 
investigates how the legislation by itself and in combination with state-level 
differences, both in attitudes toward immigrants and in welfare benefit levels, 
affected the probability of naturalization among immigrants after 1996. 


Theories About the Pursuit of Naturalization 


In the United States, to be eligible to become a naturalized citizen, a migrant 
must be at least 18 years old, a legal permanent resident and a resident of the 
country for at least five years (Woodrow-Lafield et al. 2001). Immigrants must 
also demonstrate the ability to speak, read and write English, pass a test on 
U.S. government and history, and be of good moral character (for example, not 
have been convicted of a felony). A few legal permanent residents are exempt 
from some of these requirements. For example, the spouses of U.S. citizens, 
children of naturalized citizens face only a three-year residency requirement 
and military personnel may naturalize after only one year of active duty. In 
addition, minor children of legal permanent residents may become citizens 
when their parents naturalize (Woodrow-Lafield 2004). Independent of such 
legal restrictions, naturalization rates vary considerably across groups and over 
time. For example, Mexicans and Canadians have been less likely to naturalize 
(Jasso and Rosenzweig 1990), although naturalization rates among Mexicans 
appear to have increased somewhat during the 1990s (Balistreri and Van Hook 
2002). Those who naturalize also tend to possess characteristics that suggest 
they may be more invested in the United States economy (e.g., homeowners, 
self-employed workers) and less likely to emigrate (e.g., they come from 
countries that they may be less likely to return to because of distance, low 
GNP per capita and low literacy rates) (Barken and Khokhlov 1980; Beijbom 
1971: Bernard 1936; Jasso and Rosenzweig 1986; Yang 1994). Family status is 
important in that those with children are more likely to naturalize (Liang 1994b; 
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Yang 1994). Finally, naturalization varies with age at immigration (Yang [1994] 
finds a convex curvilinear relationship) and gender in that women are more 
likely to naturalize (Jasso and Rosenzweig 1986; Yang 1994). 

Theoretically we may distinguish two major, complementary views about 
the foundations of citizenship and the factors that may influence the pursuit of 
naturalization. One perspective tends to see citizenship as involving distinctly 
political-economic rights (Ong 1999), with the most prominent political right being 
the right to vote and the major economic right being access to certain employment 
opportunities (Aleinikoff 2001). Those who become citizens will be able to vote 
and to pursue new job possibilities; in turn, they are expected to embrace largely 
uniform national identifications (Aleinikoff 2003; Schuck 1998). This perspective 
on U.S. citizenship, which we term the instrumental-legalistic (IL) view, envisions 
immigrants individually and quite explicitly naturalizing for political and economic 
reasons. From this vantage point, variations in naturalization across groups and 
over time are explained predominately within the framework of rational-choice 
models (DeVoretz and Pivnenko 2004; 2005; Jasso and Rosenzweig 1990), with 
such models assuming that the decision to naturalize derives from weighing the 
benefits vs. the costs of becoming a citizen. 

What are some of the major costs? One is the loss of citizenship in the country 
of origin (when dual citizenship is not permitted). Loss of citizenship in the country 
of origin can reduce or eliminate access to public benefits (Such as retirement 
funds, public health care or welfare), restrict travel and close off employment * 
options in the country of origin. Therefore, those coming from high-income 
countries that (1) do not permit dual citizenship and (2) offer generous public 
benefits or lucrative employment opportunities only to citizens are less likely to 
naturalize than those from other countries. Other costs include those incurred in 
the process of acquiring citizenship (Such as bureaucratic hassles, fees and time 
spent improving language skills, and knowledge to pass English and civics tests), 
so those with low income and from non-English speaking countries are less likely 
to naturalize. The major benefits of naturalization include access to more jobs 
(because many government, defense and other jobs require U.S. citizenship), 
public assistance (although access to public assistance by non-citizens varies 
by state), the ability to sponsor relatives for immigration, and the right to vote. 
Overall, immigrants are more likely to naturalize if they meet the legal criteria 
(age and residency) and if the benefits of naturalization outweigh the costs of 
doing so. In short, immigrants are more likely to naturalize if they perceive greater 
benefits from doing so. 

Other ideas suggest different bases for citizenship and reasons for the pursuit 
of naturalization (Bloemraad 2006; Feldblum 2000). Some of these, including 
those that focus on the operation of post-nationalist and transnationalist forces, 
imply a diminishing relevance of national citizenship altogether (Baubóck 1994; 
Carens 1987; Jacobson 1996; Soysal 1994). However, because it is difficult to 
imagine any sort of industrial or post-industrial welfare state without markers of 
state membership (Galloway 2000), we focus not on the irrelevance of citizenship 
but rather on definitions involving less restrictive criteria for membership (Levitt 
2003). Such approaches note the existence of multiple kinds of citizenship and 
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often the prevalence of transnational, including dual, citizenship (Basch, Glick 
Schiller and Szanton Blanc 1994; Gilbertson and Singer 2003; Ong 1999; Portes, 
Guarnizo and Landolt 1999). Because such perspectives see the pursuit of 
naturalization as depending on social contextual factors that provide material 
and symbolic benefits of naturalization (Bloemraad 2002, 2006; Liang 1994b; 
Morawska 2001, 2003), we term this view the social-contextual approach. 

Bloemraad (2004, 2006) has developed particularly insightful theories about 
how state-level institutional supports for immigrant settlement might influence 
the likelihood of naturalization. Her primary idea is that the tangible support 
provided to newcomers from institutional and state sources helps to shape 
immigrant contexts of reception (Portes and Rumbaut 2001; Reitz 2003). The 
availability of such institutional and state programs influences how welcome 
immigrants feel at arrival, how much settlement help they receive, and how much 
assistance they can draw upon when learning the skills required for naturalization 
(e.g., knowledge of civics and English) (Bloemraad 2003). Such factors foster 
the motivation to apply for citizenship by making immigrants feel welcome and 
by providing assistance with naturalization. We extend her approach to suggest 
that the overall favorability of social contexts of reception will positively influence 
the likelihood of naturalization. Just as the availability of tangible assistance may 
increase the likelihood of naturalization, so, too, may favorable but intangible 
attitudes toward immigrants encourage naturalization by signaling that immigrant 
contributions to new locales are welcomed and valued. 

The positive effect of favorability may also depend on immigrants' social and 
structural integration, which in turn may influence newcomers' incentives to 
naturalize and their needs to demonstrate commitment to integration through 
naturalization. Diehl and Blohm (2003), drawing on social psychological theories 
of social integration (Hechter 1971; Tajfel and Turner 1979), note that peripheral 
(subordinate) immigrants are more likely to accept the legitimacy of the core 
(superordinate) group and pursue social and legal integration if they perceive the 
host society's system of status attainment as open and social mobility possible. 
In general, then, more favorable attitudes (views more supportive of immigrants) 
are likely to increase naturalization, although the overall relationship may be 
curvilinear. So long as support is at least moderate, naturalization offers a way 
of formalizing (and demonstrating commitment to) membership in the society 
that is providing expected or ongoing opportunities for economic mobility. It 
also announces the intention to seek even further opportunities for economic 
mobility. But immigrants who face the most unfavorable attitudes (e.g., because 
many members of their group tend to be unauthorized) may perceive themselves 
as ineligible for mobility and as lacking incentives to embrace membership, so 
their naturalization tendencies may remain low. Similarly, immigrants who live 
in states with the most favorable receptions may be the most upwardly mobile 
and the most easily incorporated economically, thus diminishing any need to 
demonstrate membership. Applying such ideas to naturalization in the United 
States, we would expect a curvilinear pattern whereby rates of naturalization 
among immigrants increase as state-level public support of immigrants increases, 
but then decline in states with the highest levels of support. 
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In this study, we focus empirical attention on two contextual factors that may 
affectimmigrant naturalization. One involves a key aspect of economic context (the 
size of the welfare benefit in the state where the immigrant lives) and the other a 
key aspect of social context (the favorability of attitudes toward immigrants in the 
state). Based on the above theoretical considerations, we hypothesize that both 
welfare benefits and favorability of attitudes will positively affect the likelihood 
of naturalization. Because of the greater salience of naturalization after welfare 
reform, we further hypothesize that these tendencies will be more pronounced 
during the post-reform compared to the pre-reform period, especially in states that 
restored benefits for non-citizens. We also expect the influence on naturalization 
of favorability of attitudes toward immigrants to diminish as such attitudes 
become even more favorable because the warmth of immigrant reception may 
also encourage upward mobility, which in turn may reduce the need to seek 
membership as a demonstration of worthiness (i.e., thus generating a ceiling 
effect). Similarly, the positive influence of favorable attitudes on naturalization 
may reverse in states providing the very highest levels of welfare benefits because 
the strongest economic safety nets /n combination with favorable attitudes may 
be sufficient to eliminate the need to demonstrate worthiness and commitment 
through naturalization (i.e., such conditions may generate an interaction effect 
between state welfare benefit level and attitudes). 

Some immigrants may be less responsive to their social contexts of reception 
than others. One such group is refugees fleeing political persecution in their . 
countries of origin. Their settlement in the United States is given high priority and 
special economic support by the U.S. government (Bloemraad 2006). Because 
they receive extra governmental settlement assistance at the onset of their 
time in the country, they are less likely to be affected by vagaries in contexts of 
reception. Given the nature of the hypotheses we are testing, we thus exclude 
refugees from our analyses. Another group is migrants from Mexico, who consist 
mainly of low-education persons who come to the country primarily to work, 
many intending to return to Mexico after relatively short periods of time (Bean 
and Stevens 2003). We hypothesize that such labor migrants will be less sensitive 
to their contexts of reception because they do not have strong expectations of 
mobility given their low levels of education, their widespread unauthorized status 
at entry, and their frequent intentions of returning to their countries of origin 
(Bean, Brown and Rumbaut 2006; Portes and Rumbaut 2001). 
Data and Measures 
Data and Samples 


To assess the relationship between changing contexts of immigrant reception 
and naturalization, we rely on data from the Survey of Program Dynamics (SPD). 
The SPD is a relatively new data source that was designed and implemented by 
the U.S. Census Bureau to study the short- and medium-term effects of welfare 
reform. It offers unique advantages over other data sets for the purposes of this 
study because it includes information on immigration, welfare and naturalization 
histories. Perhaps most significant is that the SPD includes data on the year 
naturalization occurred (Liang 1994a) and thus permits the examination of 
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naturalization both before and after welfare reform. The study follows a sub- 
sample of the original respondents of the 1992 and 1993 panels of the Survey of 
Income and Program Participation (SIPP) (with oversamples of poor and minority 
families), first in 1997 with the SPD Bridge Survey (a modified version of the 
March 1997 CPS), and then annually from 1998 through 2002 with the SPD 
survey. We exclude children from the analysis because they are not eligible for 
naturalization apart from their parents. Non-citizens who entered the country or 
reached age 18 later than 1988 enter the sample in the year they arrived in the 
country or reached age 18. 

We also exclude those originating from countries that have predominantly 
sent refugees to the United States in the 1970s and 1980s. Even prior to welfare 
reform, U.S. immigration policy established different eligibility rules for receiving 
public assistance in the case of persons entering as refugees vs. the case of 
persons entering as permanent resident aliens. Refugees were (and still are) 
immediately eligible whereas regular immigrants are not (Gordon 1987; Vialet 
1993). We identify as refugees those coming from 11 nations, almostall of whose 
immigrants to the United States during the 1980s were refugees.' Because we 
estimate non-parametric discrete-time event history models of naturalization, 
the data are organized into person-year observations, with each individual 
contributing a person-year for each year they remain at risk of naturalizing, 
including the year they naturalized. The final analytic sample (excluding 
Mexicans, refugees and cases with missing values on key variables) includes 
969 non-citizen adults age 18+ as of 1988. These individuals contributed 5,957 
person years, 4,409 during the pre-reform period (1988-1996), and 1,548 during 
the post-reform period (1997-2002). 

The available data permit the tracking of individuals in the sub-sample for nine 
to 10 years from 1992/93 to 2002. In addition, because retrospective information 
was collected, individuals may be tracked starting as early as 1988. Thus, the data 
enable the examination of naturalization patterns before welfare reform (1988- 
1996) and after (1997-2002). Even though the SPD offers such unique advantages, 
it suffers the disadvantage of relatively high attrition rates. However, even large 
amounts of attrition will not bias results unless cases fall out of the sample in a 
non-random manner such that attrition is significantly associated with the error 
term (i.e., associated with both the dependent and key independent variables) 
(Fitzgerald, Gottschalk and Moffitt 1998). In extensive analyses to detect the 
presence of attrition bias, we found that attrition in the SPD, while high, is not 
significantly associated with citizenship status among immigrants. Moreover, 
because the key independent variables in our analyses - state welfare benefit 
level and state immigrant receptivity — also do not appear to be associated with 
attrition, we think it reasonable to conclude that attrition is unlikely to distort our 
findings significantly. 


Statistical Models 


We estimate discrete-time hazard models (Allison 1995) to model the probability 
of naturalization among non-citizens. We use discrete- rather than continuous- 
time models (such as Cox proportional hazards) because our data lack precise 
information about the timing of moves into citizenship; we know only whether 
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a change occurred between interviews (usually conducted at one-year intervals). 
Also, discrete-time hazard models can easily handle time-varying covariates, 
right-censorship and left-truncated cases (if start times are known) (Allison 1995; 
Guo 1993). One potential problem associated with event history methods is left- 
truncation bias. Our sample is left-truncated because it includes non-citizens who 
had been living in the United States for many years and thus had been exposed 
to the "risk" of naturalization prior to their inclusion in the sample. This introduces 
sample selection bias because ongoing "non-citizen" spells are likely to be of 
longer duration (shorter non-citizen spells had already ended before the beginning 
of the observation period because these people had naturalized earlier). To handle 
this problem, we control for approximate duration of ongoing non-citizen spells 
(based on the first time the respondent came to the United States to stay), thereby 
estimating conditional likelihood discrete-time hazard models that condition the 
likelihood function on the length of the spell. Conditional likelihood models are 
identical to the standard discrete-time hazard models except that spell duration 
is measured starting from the beginning of the spell, not from the time the case 
first enters the sample (Guo 1993). 
The naturalization model we estimate is: 


L, [P(N; Y (t-P(N,) ]- «T, +0,R, +0,4,+0,B,+0,C,+0,X,+0,Z4,7 (1) 


TY 
where P(N, ) indicates the probability of naturalization (1 = naturalized, 0 = non- 
citizen) for respondent i in state j at yeart conditional on being a non-citizen in year 
t1. T is the number of years the respondent has lived in the United States and 
has thus been exposed to the risk of naturalization; R indicates the time period 
for yeart (1 — postreform, 0 — pre-reform); A is a non-time varying measure of 
attitudes toward immigrants after welfare reform for non-citizens living in state j; 
B is the 1997 value of welfare benefits provided to natives living in state j; C is the 
level of access to benefits that states provided to non-citizens after 1996: X is a 
vector of non-time-varying control variables (race, ethnicity, gender and country- 
of-origin grouping); and Z is a vector of time-varying control variables taken from 
the prior year t-1 (marital status, educational attainment, home ownership and 
income-to-poverty ratio). The model is estimated on a sample of person-year 
observations for all non-citizens during the years they report as non-citizen until 
and including the year they report as naturalized or are censored. 

To test whether naturalization probabilities changed differentially by state 
benefit level or state's attitudes toward immigrants, we add and test the 
significance of two-way interaction terms: 


L. [P(N Y (P (N,) ]- o4T, +0,R,+0,A, +0,B,+0,C + 
aX 0 ZB (R, x A; 9, (R, xB; +8, (R, xC,) (2) 


Thus, a, is the effect of living in a favorable-attitude state prior to welfare reform, 
a, + B, is the effect after welfare reform, and P, is the change for those living in 
favorable attitude states. Similarly, a, is the effect of state welfare benefit level 
prior to welfare reform, a, + B, is the effect after welfare reform, and p, is the 
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change. Also a, + B, is the effect of living in a state with high access to benefits 
after reform and p, is the change. Finally, to test whether and how the effect of 
state attitudes toward immigrants on changes in the probability of naturalization 
varies by state benefit level, we add and test the significance of a three-way 
interaction term (while including all underlying two-way terms): 


E [P (N, ye (Ni) i= 1, 4 0, R, 0,4; 0, Br-Fo,C; * o X; 
Zier +B, (R, xA, )=B, (R, x B, kp. (BO: kp, (R, x A;x B.) (3) 


p, is the additional effect of living in a high-benefit, favorable state on post- 
reform differences in the likelihood of naturalization. Because of the clustered, 
highly stratified sampling design of the SIPP/SPD (Chakrabarty 1989), we 
adjust all standard errors and other test statistics to take into account design 
effects. Because our key independent variables are state-level measures, not 
individual-level measures, the standard errors for these variables have been 
adjusted accordingly. 


Measures 


Nativity/Citizenship 

The SIPP collects information on naturalization status and migration history at 
the second SIPP interview (four months after the first SIPP interview), and the 
SPD again collects these data in the Bridge and annual surveys. In addition, the 
2001 and 2002 SPD include year of naturalization for those who naturalized. We 
classify as immigrants those who were born outside the United States and U.S. 
areas (such as Puerto Rico), and were not born abroad of American parents. 
Some respondents reported citizenship status in one interview but not in others. 
For these cases, we make the assumption that citizenship status did not change 
over time. Other foreign-born respondents reported as naturalized citizens even 
though they had not been living in the United States long enough to qualify for 
naturalization (five years in most cases, three years for those with U.S.-born 
spouses). Following Passel and Clark (1997, 1998), we recode these individuals as 
non-citizens. We infer the year of naturalization based on the year their citizenship 
status was observed in the data to have changed from non-citizen to citizen (this 
is possible because naturalization status is asked at each interview). For those 
who naturalized prior to the first interview in 1992/1993, we use reported year 
of naturalization to estimate the timing of changes in citizenship status. Thirty 
percent of this group did not report year of naturalization. For this group, year 
of naturalization was approximated as the mid-point between 1992 and the year 
the respondent had been in the United States for at least five years. Because the 
year of naturalization was imputed to be earlier than 1988 for most members 
of this latter group (81 percent), the vast majority of those with imputed year 
of naturalization were eventually dropped from the analytical sample for the 
naturalization models, which was restricted to non-citizens who naturalized in 
1988 or later and those who never naturalized. 
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Table 1: Descriptives for Analytic Sample 
— '——— ———— ——n—— — — -——— ——— 


Mean or 

Percentage — SD' 
Naturalized in Year t (96) 5.8 — 
Immigrant Receptivity (z-score form: -1.7, 4.9) AT 1.03 
State Welfare Benefit Level (z-score form: -1.7, 1.9) .08 DU 
Non-citizen Welfare Access (0,1) E .30 
Pre-Reform: 1987-1996 (9%) 73.3 — 
Post-Reform: 1997-2001 (96) 26.7 — 
Proportion of Naturalization Applications Completed in 1 year (.4, .7) D2 .10 
Years in U.S. (0, 90) 14.60 11.12 
Age 18-29 (96) 18.4 — 
Age 30-44 (96) 44.2 — 
Age 45-64 (%) 29.5 — 
Age 65+ (%) 7.9 — 
Non-Hispanic White (%) 28.2 — 
Asian (%) 31.6 — 
Hispanic (%) 32.0 — ` 
Black (%) 8.2 — 
U.S.-born Spouse (%) 25.3 — 
Naturalized Spouse (%) 12.3 — 
Non-citizen Spouse (96) 31.8 — 
Not Married (96) 30.6 — 
Number of Children Age 0-14 in Household (0, 9) 1.00 125 
Disabled (96) 4.6 — 
Homeowner (%) 50.8 — 
Poor English (%) 19.6 — 
Welfare Recipient Any Time Before 1997 (96) 32.4 — 
0-8 Years Schooling (96) 173 e 
9-11 Years Schooling (96) 9.3 Sa? 
HS Graduate (%) 30.5 D 
More than HS (%) 42.9 Ces 
Person-years 5.957 


Source: 1992-2002 SPD Longitudinal File. 

Sample: Person-years from 1988 to 2002 contributed by immigrants who were non-citizens in 
the prior year. Refugees and those born in Mexico and Central America are excluded (see text for 
explanation). Range of continuous variables given in parentheses. 

!Standard deviations provided only for continuous variables. 
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State Welfare Policy 

We also examine the influence of state welfare benefit level on naturalization. 
State-level information on welfare benefits was obtained from the New 
Federalism state database available from the Urban Institute. We developed a 
standardized measure of state benefit level based on both the 1997 SSI state 
supplement available to couples living independently with no other income other 
than federal SSI, and the 1997 AFDC/TANF benefit paid to a single, non-working 
parent with two children. Since the SSI and TANF benefits run on different scales, 
we first created standardized scores for state SSI and TANF benefit levels. We 
then averaged the two z-scores, yielding a score ranging approximately from -1.7 
to 1.7, with average states falling close to zero. We treat state benefit level as 
non-time varying, because the amounts states paid to welfare recipients rarely 
changed during the 1990s. 

To measure the extent to which states make their safety nets available 
to immigrants during the post-reform period, we use the “Safety Net” scale 
created by Zimmermann and Tumlin (1999), which we refer to here as “access.” 
The scale takes into account 12 separate categories of immigrant eligibility 
decisions, such as whether the state provides Medicaid or TANF to immigrants 
entering the United States after August 1996, or whether the state created a 
food program for immigrants no longer covered by Food Stamps. Points are 
awarded to states for having programs in each category and deducted for the 
introduction of certain restrictions to immigrants. The scale ranges from one to 
four, with states that made their safety nets the most available to immigrants in 
category 1 and those that placed the most restrictions on immigrants in category 
4. A full description of the scale is included in Zimmermann and Tumlin (1999, 
Appendix B). To aid interpretation of coefficients in the multivariate models, we 
recoded the variable so that it varies from 0 to 1 (with categories 1, 2, 3 and 4 
recoded respectively to 0.00, 0.33, 0.66 and 1.00), and then reverse-coded the 
scale with higher values indicating greater access to welfare programs for non- 
citizens following welfare reform. 


Other State Characteristics 

To measure the extent to which state-level attitudes are favorable toward 
immigrants and immigration, we used a scale developed by De Jong and Tran 
(2001) and De Jong and Steinmetz (2004) from individual responses of natives to 
questions about immigrants and immigration in the General Social Survey in the 
years 1995 through 1997. Because of space constraints, we omit details about 
the use of the scale in this study but invite inquiries from interested readers. We 
also control for other state-level factors that could influence the likelihood of 
naturalization among non-citizens. Some states, as a part of their welfare reform 
policy, encouraged non-citizen welfare recipients to apply for naturalization so 
that they then would become eligible for federal public assistance programs. 
We construct a variable that indicates whether the respondent was living in a 
state that adopted a naturalization program linked to welfare receipt. State-level 
information about postreform naturalization programs was obtained from a 
state-level database compiled by researchers at the Urban Institute (Zimmerman 
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1999). Because this variable was highly correlated with the welfare access 
measure (r = .80), we estimated models with the naturalization program variable 
both included and excluded. The results were essentially similar in both cases 
(available from the authors by request), and we include only the latter here. The 
time between filing an application and taking the oath of citizenship also varies 
across INS district offices (U.S. General Accounting Office 1997). To control for 
this variation, we included a measure of the probability that an application for 
naturalization would be completed in one year. To calculate this probability, for 
each INS district office we divided the number of applications completed in a 
given year by the sum of the number completed in that year plus the number 
of applications that were pending from the previous year. We then averaged the 
1996, 1997 and 1998 estimates. 


Other Factors 

We include in our multivariate models controls for socioeconomic status, family 
composition and other demographic characteristics, most of which are allowed 
to vary from year to year. Indicators of socioeconomic status and human capital 
(educational attainment, English proficiency and homeownership) are likely to 
have direct effects on naturalization (Choi 1992; Coe 1985a, 1985b; Menefee 
1981; Moffitt 1992; McGarry 1996; Warlick 1980). 

Educational attainment is treated as time-varying and thus may change from ` 
year to year. We classify respondents as attaining 0-8 years of education, 9-11 
years, a high school diploma, and attending college or more. The SPD 1997 
Bridge Survey collected data on English language proficiency. Respondents are 
asked how well they speak English: Not very well/not at all, Not well, Well, and 
Very well. Finally, homeownership is important because others have found it 
to be a significant predictor of naturalization (Yang 1994), and is an indicator of 
wealth that may be tapped as a substitute for governmental support in times 
of economic need (Choi 1992; McGarry 1996; Soldo 1988; Tienda 1986). Like 
educational attainment, homeownership is a time-varying measure that may 
change from year to year. In addition to socioeconomic status, we include in 
our models controls for certain socio-demographic factors that are likely to be 
associated with naturalization and welfare receipt. These include years in the 
United States (plus a squared term), current spouse's citizenship status, age (18- 
29, 30-49, 50-64, 65+), gender, disability status, number of children under age 
15, and race/ethnicity (Asian, Hispanic, black vs. non-Hispanic white). All except 
gender and race/ethnicity are treated as time-varying. 


Results 


After the passage of welfare reform in 1996, the average annual probability 
of naturalization for the immigrants included in our analytical sample from 
the SPD went up by a factor of about five (see Table 2). In interpreting this 
increase, it must be remembered that although the study follows cohorts 
of non-citizens over time, it examines annual probabilities of naturalization. 
Naturalization occurs only once (citizens do not "de-naturalize") and depends 
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in part on the accumulation of U.S. experience. Thus, some increase in the 
cumulative number of naturalizations over time would be expected on account 
of these dynamics alone. However, no increase would be expected in the 
probability simply because of the passage of time. Thus, the increase that 
occurs in the likelihood of naturalization is not caused by duration, and welfare 
reform appears likely to remain a plausible cause. In other words, the higher 
probabilities of naturalization in the post-reform period are consistent with the 
idea that the salience of naturalization increased substantially as a result of the 
federal legislation that made legal citizenship a condition for obtaining social 
citizenship (namely eligibility for receipt of social services). 


Table 2: Average Annual Probability of Naturalizing 


Pre-reform ^ Post-reform Change 
Pre-reform Welfare Receipt 


Non-recipient .030 .142 d 2^ 
Recipient .022 .140 .118* 
Immigrant Receptivity 

Unfavorable (< -1.0) .023 .082 .059* 
Average (-1.0 - 1.0) .027 .178 41548 
Favorable (> 1.0) .034 .158 .124* 
State Welfare Benefit Level 

Low Benefit (< 0) .035 .140 .105* 
High Benefit (= 0) .022 .143 121 
State Welfare Access to Non-citizens 

Low Access (< .5) .051 Sie .061* 


High Access (= 5) .023 147 .124* 


Source and Sample: See Table 1. 
* change is statistically significant (p — .05) 


To what extent does this mean that the rise appears to derive from legal 
permanent residents seeking to become citizens in order to gain eligibility for 
welfare, as IL theory would predict? A clue in this regard may be obtained by 
examining the increase in the probability of naturalization among non-welfare 
recipients compared to recipients (with welfare recipiency measured prior to 
welfare reform). If the pursuit of naturalization were occurring primarily to obtain 
welfare, we would expect to see higher increases among welfare recipients after 
welfare reform than among non-recipients. However, we do not. Instead the 
increase is the same for both recipients and non-recipients (.117 in the case of 
recipients vs. .112 in the case of non-recipients, a difference that is not statistically 
significant, p = .76). Rather, the predictions of SC theory are more accurate. For 
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example, for immigrants living in states with more favorable social contexts of 
reception toward newcomers (states with attitudinal receptivity “scores of 1.0 or 
higher), the average post-reform annual probability of naturalizing is about .158, 
as compared to .082 for immigrants living in more unfavorable reception states 
(with receptivity scores less than -1.0). Moreover, the increase from the pre-reform 
period is greater for immigrants in favorable (about .125) than unfavorable (0.059) 
states, suggesting an important role for social context in affecting naturalization. 
That social context may matter is reinforced by the probability of naturalizing 
actually not being greater in high as compared with low benefit states .143 vs. 
.140), and by the increases in the probabilities of naturalizing after welfare reform 
in high and low benefit states being about the same (and also not significantly 
different from each other ). Finally, as predicted by SC theory, immigrants are 
more likely to naturalize in high than low access states (.147 vs. .112 in the post- 
reform period), and the change over time is significantly greater in high access 
states (.124 vs. .061). 

Such unadjusted figures, however, do not control for the influence of other 
factors that also affect naturalization, so it is premature at this point to embrace 
conclusions about the influence of social contextual factors on the explanation 
of postreform naturalization tendencies. More definitive evidence requires 
multivariate models that control for differences in other factors affecting 
naturalization. To obtain such results, we estimate event history models as. 
described above, and then we evaluate their outcomes by calculating predicted 
probabilities of naturalization for combinations of values on key independent 
variables. Odds ratios showing the effects of the control and independent 
variables are shown in Table 3 for various additive models containing combinations 
of key independent variables. In the case of the control variables, the direction 
and magnitude of their influence on naturalization is largely consistent with 
expectations. Having a naturalized spouse, for example, increases the likelihood 
of naturalization substantially, as does having more than a high school education. 
In the case of state welfare benefit level, however, we note that this variable is 
positively related to the odds of naturalization among immigrants, but the odds 
ratios are not statistically significant. In the case of social contextual receptivity, 
however, immigrant attitudinal receptivity is strongly positively related to the 
likelihood of naturalizing among immigrants, although in a less positive way the 
higher the level of receptivity, as indicated by the significance of the squared 
term. Thus, the effects of our measure of social context on the annual probability 
of naturalizing conform to theoretical expectations, whereas that of the measure 
of economic context does not, indicating that SC theoretical predictions about 
socio-cultural factors affecting the pursuit of citizenship seem more accurate 
than IL predictions. 

We also find, as predicted, that the influence of the contextual variables is 
stronger in the post-reform period than before. Table 4 presents results of tests 
for the statistical significance of the interaction between time-period and each of 
the contextual factors. As shown in the last four rows of the table, the interaction 
effect for the attitudinal receptivity terms is strong. The interaction term involving 
access determines the effect of states restoring benefits after welfare reform; 
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Table 3: Discrete-time Event History Models of Naturalization (Odds Ratios) 


Model 1 Model 2 Model 3 

Squared Oren .89* .89* 
State Welfare Benefit Level 1.01 — 1.10 
Non-citizen Welfare Access — 1.23 1.50 
Post-Reform Period: 1997-2001 (2 124 Got 
Naturalization Processing Time 155 EH I5 
Years in United States 1.00 1.00 1.00 

Squared 1.00 1.00 1.00 
Age 18-29 .81 .82 .82 
Age 30-44 75 16 ELS 
Age 45-64 .87 .88 .88 
(Age 65+) — — — 
Asian 132 1.32 1732 
Hispanic 1.06 1.08 1.09 
Black 1.45 1.47 1.48 
(NH-White) — — — 
U.S.Born Spouse .80 .J9 .80 
Naturalized Spouse 1.65* 1.64* 102^ 
Non-citizen Spouse EA BO dm 
(Not Married) = = — 
N Children Age 0-14 1.09 1.09 1.09 
Disabled 1:23 123 1.22 
Homeowner el. [372 1.31% 
Poor English .93 .94 .94 
Welfare Recipient (pre-reform) TT 1.13 TT 
0-8 Years of Schooling EC oio 38 one 
9-11 Years of Schooling .96 .96 .96 
HS Graduate .65* Ze Go 
(More than HS) = = Gs 
Person-years 5,957 5,957 5,957 


Pseudo R-squared 138 139 139 


EDS OUI ps sO p= .05 
Source and Sample: See Table 1. 
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the term involving benefit level determines the effect of variation in state-level 
benefits after reform. Including both in the equation enables*assessment of 
whether benefit levels add to the explanation of the probability of naturalization 
beyond simply restoring benefits. As can be seen from the first two models, 
which include benefit level and state restoration of access separately, each factor 
positively influences naturalization. Because higher benefits tend to characterize 
states that restored access (r = .69), however, we are particularly interested in the 
effect of benefit level when the access term is also included in the model (Model 
3). In this case, neither of the interactions is significant. Thus, state variation 


Table 4: Interaction Effects of Time Period 


Model 1 Model 2 Model 3 

Immigrant Receptivity OL -97 .96 

Squared 1.01 1.04 1.04 
State Welfare Benefit Level 14* — .91 
Non-citizen Welfare Access — 261% ZAS 
Post-Reform Period: 1997-2001 MO > 3.08** 4.50*** 

x Immigrant Receptivity ZEE 2.06*** 2 17 

x Receptivity Squared e LO JO: 

x State Welfare Benefit Level Kë — 1.36 

X Non-citizen Welfare Access — 4.00** 2.20 


A UY AS 05 
Source and Sample: See Table 1. 
All models control for socioeconomic and demographic factors (see Table 3 for list of co-variates) 


in benefit levels following welfare reform does not add to the explanation of 
the probability of naturalization beyond the influence of states having restored 
benefits, contrary to the expectations of IL theory. 

We evaluate the form of the statistically significant effects by calculating 
predicted probabilities from models that include the interaction terms and all 
of the control variables. The results are shown in Figure 1 for the pre-reform 
and post-reform periods separately and illustrate in graphic form the importance 
of the social context of attitudinal receptivity in affecting naturalization. That 
such effects may become less positive as "warmth" increases is hinted at in 
the downturn of the effect at higher receptivity levels. We also test explicitly for 
the presence of the interaction between attitudinal receptivity and benefit level 
after welfare reform by examining the three-way interaction term between time- 
period, benefit level and receptivity (the latter broken down into low, medium and 
high categories). Such an effect is not only highly significant as hypothesized, 
but it also follows the predicted pattern (Figure 2). That is, the highest annual 
probability of naturalizing occurs in those states with score combinations of 
benefits (scaled 1 and 0 for high and low) and attitudinal receptivity (scaled 2, 1 
and 0 for high, medium and low) that are moderate (have combined scores that 
sum to 2 rather than 3, 1 or 0). 
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Figure 1. Predicted Probability of Naturalizing by Time Period and State Immigrant Receptivity 
25 


20 4 


445 - Em M 
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Figure 2. Predicted Annual Probability of Naturalizing, by Time Period, State Welfare 
Benefit Level and State Immigrant Receptivity 
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Such combinations occur when states have high benefits but moderate 
receptivity (Connecticut, Massachusetts, Minnesota, New York and Rhode 
Island), or low benefits but high receptivity (Arizona, District of Columbia, Hawaii, 
Indiana, Maryland, Michigan, New Jersey, Ohio, Oregon, Pennsylvania, Virginia 
and Washington). These states have contexts of social and economic reception 
most likely to foster circumstances favoring naturalization as predicted above by 
SC theory. Conversely, the lowest probability of naturalization occurs in states 
with either very low levels of support (low receptivity and low public assistance 
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benefits: Arkansas, Colorado, Nebraska, New Hampshire, North Carolina, South 
Carolina, Utah, Maine/Vermont, North Dakota and South Dakota), or in states 
with the greatest levels of support (high receptivity and high public assistance 
benefits: Illinois and Wisconsin), states where immigrants either possess 
the fewest incentives to naturalize or the least need to do so to demonstrate 
worthiness and commitment to their new destinations. 


Summary and Discussion 


The present research seeks to take advantage of the changes in citizenship 
criteria and welfare availability resulting from the 1996 Welfare Reform Act to 
assess the nature and relative influence of social and economic contexts of 
immigrant reception on naturalization behavior. The research accomplishes this 
by deriving predictions about the effects of these contexts on naturalization 
from two general frameworks about the foundations and nature of citizenship 
and thus the attendant reasons they imply for the pursuit of naturalization. The 
findings support the idea that social-contextual factors matter at least as much for 
naturalization as instrumental-legal ones, in that more support appears in the data 
for SC than IL theory. That warmer attitudinal contexts of reception most increase 
naturalization, that naturalization probabilities increase after welfare reform as 
much among welfare non-recipients as recipients, that benefit levels do not add 
to the explanation of naturalization beyond what access explains, and that high - 
benefits reduce the need for naturalization only under the condition of warm 
receptions are all findings that would not be predicted by IL theory. The results 
of the research thus fall into substantial agreement with ideas that argue the 
social contexts of immigrant reception are important influences on naturalization 
behavior (Bloemraad 2006). By extension, such contexts are also likely to increase 
successful immigrant incorporation, because citizenship makes a considerable 
difference for full social and economic participation in the society. 

In terms of implications for immigration and immigrant policy, the results 
indicate that decisions such as migration, seeking naturalization and even 
participating in welfare, can be — and often are - largely motivated by socially 
based considerations. This involves a different image of immigrants than a view 
that depicts them as primarily motivated by desires for individual utlilitarian gain. 
The latter often tends to assume that migration and naturalization result from the 
possibility of obtaining welfare in the United States. The recommended policy 
focus that follows from this assumption is one that typically targets immigration 
rather than settlement policy, suggesting the limitation or curtailment of 
immigration to keep out immigrants who are thought to be drawn to the country 
for welfare. The results of the present research, however, imply that naturalization 
decisions are motivated more by hospitable welcomes than by the possibility of 
welfare receipt. They thus reinforce the idea that favorable contexts of reception 
facilitate immigrant naturalization rather than encouraging welfare dependency, 
thus pointing to the need for policies that facilitate immigrant naturalization. Such 
policies would seem more likely to constitute effective instruments for boosting 
immigrant economic status and incorporation than those that simply curtail 
immigration as a way to deter the admission of persons with lower economic 
statuses. 
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Note 


1. These countries are the former Soviet Union, Cuba, Vietnam, Laos, Romania, 
Iran, Ethiopia, Cambodia, Poland, Afghanistan and Nicaragua. In the case of 
persons coming during the 1990s, countries from the Balkans are added to 
this list. This classification strategy is viable because refugees have tended to 
come from only a relatively small number of countries (Fix and Passel 1994). 
In addition, countries that send refugees usually do not send large numbers 
of other legal immigrants. The 11 refugee-sending countries sent almost 90 
percent of all refugee arrivals to the United States during the 1980s, and 
more than 91 percent of those who came were classified refugees (Bean and 
Stevens 2003). 


References 


Aleinikoff, T. Alexander. 1990. "Citizens, Aliens, Membership, and the Constitution." 
Constitutional Commentary 7:9-34. 


. 2001. "Policing Boundaries: Migration, Citizenship, and the State." 
Pp. 267-91. E Pluribus Unum? Contemporary and Historical Perspectives 
on Immigrant Political Incorporation. G. Gerstle and J. Mollenkopf, editors. 
Russell Sage Foundation. 


. 2003. "Between National and Postnational: Membership in the United 
States." Pp. 110-29. Toward Assimilation and Citizenship: Immigrants in Liberal 
Nation-States. C. Joppke and E. Morawska, editors. Palgrave Macmillan. 





Allison, Paul D. 1995. Survival Analysis Using the SAS System: A Practical Guide. 
SAS Publishing. 


Balistreri, Kelly, and Jennifer Van Hook. 2002. "Diversity and Change in the 
Institutional Context of Immigrant Adaptation: California Schools 1985-2000." 
Demography 39:639-54. 


Barkan, E.R., and N. Khokhlov. 1980. "Socioeconomic Data as Indexes 
of Naturalization Patterns in the United-States — a Theory Revisited." 
Ethnicity 7:159-90. 


Basch, Linda, Nina Glick Schiller and Cristina Szanton Blanc. 1994. Nations 
Unbound: Transnational Projects, Postcolonial | Predicaments, | and 
Deterritorialized Nation-States. Gordon and Breach. 


Baubóck, Rainer. 1994. 7ransnational Citizenship: Membership and Rights in 
International Migration. Edward Elgar. 


Bean, Frank D., and Stephanie Bell-Rose. 1999. /mmigration and Opportunity: 
Race, Ethnicity and Employment in the United States. Russell Sage. 


Bean, Frank D., Susan K. Brown and Rubén G. Rumbaut. 2006. "Mexican Immigrant 
Political and Economic Incorporation." Perspectives on Politics 4:309-13. 


662 * Social Forces Volume 85, Number 2 * December 2006 


Bean, Frank D., and Gillian Stevens. 2003. America’s Newcomers and the 
Dynamics of Diversity. Russell Sage Foundation. 


Beijbom, U. 1971. Swedes in Chicago: A Demographic and Social Study of the 
1846-1880 Immigration. Chicago Historical Society. 


Bendix, Reinhard. 1977. Nation-Building and Citizenship: Studies of Our Changing 
Social Order. University of California Press. 


Bernard, WS. 1936. “Cultural Determinants of Naturalization.” American 
Sociological Review 1:943-53. 


Blank, Rebecca, and Ron Haskins. 2001. The New World of Welfare. Washington, 
D.C.: The Brookings Institution. 


Bloemraad, Irene. 2001. “Outsiders and Insiders: Collective Identity and Collective 
Action in the Quebec Independence Movement, 1995." Research in Political 
Sociology (The Politics of Social Inequality) 9:271-305. 


. 2002. "The North American Naturalization Gap: An Institutional Approach 
to Citizenship Acquisition in the United States and Canada." /nternational 
Migration Review 36:193-228. 





. 2004. "Who Claims Dual Citizenship? The Limits of Postnationalism, ` 
the Possibilities of Transnationalism, and the Persistence of Traditional 
Citizenship." /nternational Migration Review 38:5-42. 





. 2006. Becoming a Citizen: Incorporating Immigrants and Refugees in the 
United States and Canada. Berkeley: University of California Press. 





Bratsberg, Bernt Jr, James F. Ragan and Zafar M. Nasir. 2002. "The Effect of 
Naturalization on Wage Growth: A Panel Study of Young Male Immigrants." 
Journal of Labor Economics 20:568-97. 


Carens, Joseph H. 1987. "Aliens and Citizens: The Case for Open Borders." 7he 
Review of Politics 49:251-73. 


Chakrabarty, Rameswar P. 1989. "Multivariate Analysis by Users of SIPP Micro- 
Data Files." SIPP Working Paper Series No. 8915. U.S. Bureau of the Census: 
Washington, D.C. 


Choi, Namkee G. 1992. "Correlates of the Elderly's Participation and 
Nonparticipation in the Supplemental Security Income (SSI) Program: A New 
Evaluation." Journal of Sociology and Social Welfare 19:85-104. 


Coe, R. 1985a. "A Longitudinal Analysis of Nonparticipation in the Food Stamp 


Program by Eligible Households." Institute for Research on Poverty, University 
of Wisconsin, Madison. 


. 1985b. “Nonparticipation in SSI by the Eligible Elderly." Southern Economic 
Journal: 891-97. 





Welfare Reform and Naturalization * 663 


De Jong, Gordon F., and Michele Steinmetz. 2004. "Receptivity Attitudes and the 
Occupational Attainment of Male and Female Immigrant Workers." Population 
Research and Policy Review 23(2):91-116. 


De Jong, Gordon F, and Q.G. Tran. 2001. "Warm Welcome, Cool Welcome: 
Mapping Receptivity Toward Immigrants in the U.S." Population Today 
29(8):1,4-5. 


DeVoretz, Don J., and Sergiy Pivnenko. 2004. "The Economic Causes and 
Consequences of Canadian Citizenship.” Discussion paper #1395. Bonn, 
Germany: Institute for the Study of Labor. 


. 2005. “Self-Selection, Immigrant Public Finance Performance, and 
Canadian Citizenship.” Discussion paper #1463. Bonn, Germany: Institute 
for the Study of Labor. 





Diehl, Claudia, and Michael Blohm. 2003. “Rights or Identity? Naturalization 
Processes among ‘Labor Migrants’ in Germany.” /nternational Migration 
Review 37:133-62. 


Espenshade, Thomas J., Jessica L. Baraka and Gregory A. Huber. 1997. 
“Implications of the 1996 Welfare and Immigration Reform Acts for U.S. 
Immigration.” Population and Development Review 23:769-801. 


Feldblum, Miriam. 2000. “Managing Membership: New Trends in Citizenship and 
Nationality Policy.” Pp. 475-99. From Migrants to Citizens: Membership in a 
Changing World. T.A. Aleinikoff and D. Klusmeyer, editors. Washington, D.C.: 
Carnegie Endowment for International Peace. 


Fitzgerald, John, Peter Gottschalk and Robert Moffitt. 1998. "An Analysis of 
Sample Attrition Bias in Panel Data: The Michigan Panel Study of Income 
Dynamics." 7he Journal of Human Resources 33:251-99. 


Fix, Michael, and Jeffrey S. Passel. 1994. /mmigration and Immigrants: Setting the 
Record Straight. Washington, D.C.: The Urban Institute Press. 


. 2002. "Assessing Welfare Reform's Immigrant Provisions." Pp. 179-202. 
Welfare Reform: The Next Act. A. Weil and K. Finegold, editors. Washington, 
D.C.: The Urban Institute Press. 





Fix, Michael, and Wendy Zimmermann. 2001. “All Under One Roof: Mixed-Status 
Families in an Era of Reform.” International Migration Review 35:397-419. 


Galloway, Donald. 2000. “The Dilemmas of Canadian Citizenship Law.” Pp. 
82-118. From Migrants to Citizens: Membership in a Changing World. T.A. 
Aleinikoff and D. Klusmeyer, editors. Washington, D.C.: Carnegie Endowment 


for International Peace. 


Gerstle, Gary, and John Mollenkopf. 2001. £ Pluribus Unum? Contemporary 
and Historical Perspectives on Immigrant Political Incorporation. Russell 
Sage Foundation. 


664 * Social Forces Volume 85, Number 2 * December 2006 


Gilbertson, Greta A., and Audrey Singer. 2003. "The Emergence of Protective 
Citizenship in the USA: Naturalization Among Dominican Immigrants in the 
Post-1996 Welfare Reform Era." Ethnic and Racial Studies 26:25-51. 


Gordon, Linda W. 1987. "Southeast Asian Refugee Migration to the United 
States." Pp. 153-73. Pacific Bridges: The New Immigration from Asia and the 
Pacific Islands. J.T. Fawcett and B.V. Carino, editors. New York: Center for 
Migration Studies. 


Granovetter, Mark. 1973. "The Strength of Weak Ties." American Journal of 
Sociology 78:360-80. 


Guo, Guang. 1993. "Event-History Analysis for Left-Truncated Data." Sociological 
Methodology 23:217-43. 


Hechter, Michael. 1971. “Towards a Theory of Ethnic Change.” Politics and 
Society 2(1):21-45. 


Jacobson, David. 1996. Rights Across Borders: Immigration and the Decline of 
Citizenship. Johns Hopkins University Press. 


Jasso, Guillermina, and Mark R. Rosenzweig. 1986. “Family Reunification and the 
Immigration Multiplier: U.S. Immigration Law, Origin-Country Conditions, and _ 
the Reproduction of Immigrants.” Demography 23:294-311. 


. 1990. The New Chosen People: Immigrants to the United States. Russell 
Sage Foundation. 





Jensen, Joan. 1988. Passage from India: Asian Indian Immigrants in North 
America. Yale University Press. 


Levitt, Peggy. 2003. “Keeping Feet in Both Worlds: Transnational Practices 
and Immigrant Incorporation in the United States.” Pp. 177-94. Toward 
Assimilation and Citizenship: Immigrants in Liberal Nation-States. C. Joppke 
and E. Morawska, editors. Palgrave MacMillan. 


Liang, Zai. 1994a. "On the Measurement of Naturalization.” Demography 
31:525:48. 


. 1994b. "Social Contact, Social Capital, and the Naturalization Process: 
Evidence from Six Immigrant Groups.” Social Science Research 23:407-37. 





Marshall, T.H. 1950. Citizenship and Social Class and Other Essays. Cambridge 
University Press. 


McGarry, Kathleen. 1996. "Factors Determining Participation of the 


Elderly in Supplemental Security Income." 7he Journal of Human 
Resources 31:331-58. 


Menefee, J.A., B. Edwards and S.J. Schieber. 1981. "Analysis of Nonparticipation 
in the SSI Program." Social Security Bulletin 44(6): 3-21. 


Welfare Reform and Naturalization *665 


Moffitt, Robert. 1992. "Incentive Effects of the U.S. Welfare System: A Review." 
Journal of Economic Literature 30:1-61. 


Morawska, Ewa. 2001. "Immigrants, Transnationalism, and Ethnicization: A 
Comparison of This Great Wave and the Last." Pp. 175-212. E Pluribus Unum? 
Contemporary and Historical Perspectives on Immigrant Political Incorporation. 
G. Gerstle and J. Mollenkopf, editors. Russell Sage Foundation. 


. 2003. "Immigrant Transnationalism and Assimilation: A Variety of 
Combinations and the Analytic Strategy It Suggests.” Pp. 133-76. /ntegrating 
Immigrants in Liberal Nation-States: From Post-national to Transnational. C. 
Joppke and E. Morawska, editors. Palgrave MacMillan. 





Morris, Milton. 1985. /mmigration: The Beleaguered Bureaucracy. Washington, 
D.C.: The Brookings Institution. 


Ong, Aihwa. 1999. Flexible Citizenship: The Cultural Logic of Transnationality. 
Duke University Press. 


Passel, Jeffrey, and Rebecca L. Clark. 1997. “How Many Naturalized Citizens are 
There? An Assessment of Data Quality in the Decennial Census and CPS.". 
Paper presented at the Population Association of America annual convention. 


. 1998. /mmigrants in New York: Their Legal Status, Incomes, and Taxes. 
Washington, D.C.: Urban Institute. 





Portes, Alejandro, Luis Guarnizo and Patricia Landolt. 1999. "Introduction: 
Pitfalls and Promise of an Emergent Research Field." Ethnic and Racial 
Studies 22:217-37. 


Portes, Alejandro, and Rubén G. Rumbaut. 2001. /egacies. The Story of the 
Immigrant Second Generation. University of California Press and Russell 
Sage Foundation. 


Raskin, Jamin. 1993. "Legal Aliens, Local Citizens: The Historical, Constitutional, 
and Theoretical Meanings of Alien Suffrage." University of Pennsylvania Law 
Review 141:1391-1470. 


Reimers, David M. 1992 [1985]. Stil! the Golden Door: The Third World Comes to 
America. Columbia University Press. 


Reitz, Jeffrey G. 2003. "Host Societies and the Reception of Immigrants." 
La Jolla: Center for Comparative Immigration Studies, University of 
California, San Diego. 


Rosberg, Gerald M. 1977. "Aliens and Equal Protection: Why Not the Right to 
Vote?" Michigan Law Review 75:1092-1136. 


Schuck, Peter H. 1998. Citizens, Strangers, and In-Betweens: Essays on Immigration 
and Citizenship. Westview Press. 


666 * Social Forces Volume 85, Number 2 * December 2006 


Soldo, Beth J., and Emily Agee. 1988. "America's Elderly." Population 
Bulletin 43:1-51. > 


Soysal, Tasemin Nuhoglu. 1994. Limits of Citizenship: Migrants and Postnational 
Membership in Europe. University of Chicago Press. 


Tajfel, H., and J.C. Turner. 1979. "An Integrative Theory of Intergroup Conflict." 
Pp. 33-47. The Social Psychology of Intergroup Relations. W.G. Austin and S. 
Worchel, editors. Brooks/Cole. 


Takaki, Ronald. 1989. Strangers from a Different Shore: A History of Asian 
Americans. Little, Brown. 


Tienda, Marta, and Leif Jensen. 1986. "Immigration and Public-Assistance 
Participation: Dispelling the Myth of Dependency" Social Science 
Hesearch 15:372-400. 


U.S. General Accounting Office. 1997. "Processing Differences Exist Among INS 
Field Units." Washington: General Accounting Office. 


Ueda, Reed. 2001. "Historical Patterns of Immigrant Status and Incorporation 
in the United States." Pp. 292-327. E Pluribus Unum? Contemporary and 
Historical Perspectives on Immigrant Political Incorporation. G. Gerstle and J. 
Mollenkopf, editors. Russell Sage Foundation. i 


Vialet, Joyce C. 1993. “Refugee Admissions and Resettlement Policy.” Washington, 
D.C.: Congressional Research Service. 


Warlick, J.L. 1980. “Participation of the Aged in SSI." Discussion Paper 618 (1980). 
University of Wisconsin-Madison: Institute for Research on Poverty. 


Weil, Alan, and Kenneth Finegold. 2002. “Welfare Reform: The Next Act.” 
Washington, D.C.: The Urban Institute Press. 


Woodrow-Lafield, Karen A. 2004. “Pathways to U.S. Citizenship for Child 


Immigrants.” Paper presented at the Population Association of America 
annual meeting, March 31-April 2. Philadelphia, PA. 


Woodrow-Lafield, Karen A., Xiaohe Xu, Thomas Kersen and Bunnak Poch. 2001. 
“Naturalization Experiences of U.S. Immigrants.” Pp. 106-11. Proceedings of 
the Social/Government Statistics Section, American Statistical Association, 


2000 Joint Statistical Meetings. Alexandria, VA: American Statistical 
Association. 


Yang, Philip O. 1994. "Explaining Immigrant Naturalization." /nternational Migration 
Review 28:449-77. 


Zimmermann, Wendy, and Karen C. Tumlin. 1999. "Patchwork Policies: State 


Assistance for Immigrants under Welfare Reform." Occasional Paper No. 24. 
Washington, D.C.: The Urban Institute. 


Becoming a Citizen in the United States and Canada: 
Structured Mobilization and Immigrant Political Incorporation 


Irene Bloemraad, University of California, Berkeley 


This article uses the puzzle of diverging trajectories of immigrant 
citizenship in the United States and Canada to build a new approach 
to the study of citizenship and political incorporation. I consider three 
existing models of citizenship: an approach that considers citizenship 
adoption as the product of cost/benefit calculations; an approach that 
understands individuals and groups to be differentially endowed with 
the skills, resources and interests necessary to acquire citizenship; and an 
approach that believes countries adopt citizenship regimes which either 
include or shut out immigrants. I then offer an alternative model of 
structured mobilization which views political incorporation as a social 
process of mobilization by friends, family, community organizations 
and local leaders that is embedded in an institutional context shaped 
by government policies of diversity and newcomer settlement. The 
material and symbolic resources provided by government shape the 
ability and interest of “social helpers” to assist with and mobilize 
around citizenship. The article concludes by considering the implications 
of structured mobilization for various debates in immigration and 
political sociology. 


In the mid-1900s, almost four of every five foreign-born U.S. and Canadian 
residents held citizenship in their adopted lands. Fifty years later, citizenship 
levels had changed little in Canada: about three quarters of the foreign-born 
had Canadian passports in 2001. In contrast, U.S. citizenship hovered at its 
lowest level in a century. Only two of every five foreign-born had naturalized 
in 2000. Why have the levels of citizenship in the United States declined so 
precipitously, and why are they so much lower than in Canada, a country with 
a similar immigrant tradition? 

| use the puzzle of diverging North American citizenship trajectories to build 
a new theoretical approach to the study of immigrant citizenship and political 
incorporation. | consider three existing approaches to citizenship acquisition. 
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Each provides some understanding of U.S.-Canada citizenship differences, but 
fails to bridge the micro-macro divide in citizenship accounts. The assumption of 
independent, autonomous would-be citizens inherent in many existing models 
of naturalization fails to adequately reflect the socia/ nature of immigrant political 
incorporation. | offer instead a model of structured mobilization that draws on 
analytical tools used by social movement scholars and researchers working in the 
tradition of “new’ institutionalism. Immigrant citizenship and political incorporation 
is a process akin to social movement mobilization, involving friends, family, co- 
ethnic organizations and local community leaders. Especially for immigrants who 
face language barriers, unfamiliarity with mainstream institutions and weaker ties 
to native-born citizens, fellow immigrants and local organizations critically shape 
how they think about citizenship. 

At the same time, the ability of these people and organizations to promote 
citizenship is influenced by government policies around immigrant integration 
and diversity. Localized mobilization is nested in, and affected by, a much larger 
institutional structure. Government policies provide instrumental and material 
assistance to immigrant communities and organizations, and such policies shape 
immigrants’ understanding of citizenship as a political and emotional bond to the 
adopted country. Immigrants thus make decisions about naturalization as people 
embedded in communities and nationally defined rules and understandings 
of citizenship. Communities and institutional structures do not merely offer 
contexts in which costs and benefits are weighed, but fundamentally shape 
perceptions of costs and benefits, as well as newcomers’ interest in and ability 
to acquire citizenship. 

The central thrust of this article is an analytical elaboration of the structured 
mobilization framework. It is informed by existing empirical research on 
immigrant citizenship and a comparative study of naturalization and political 
engagement among Portuguese immigrants and Vietnamese refugees in 
metropolitan Boston and Toronto.’ It is motivated by the belief that citizenship 
matters to immigrants, because without it they have diminished political voice, 
and to native-born citizens, because newcomers’ failure to become citizens 
undermines the meaning of democracy. 


Diverging Citizenship in North America: Comparing the United States and Canada 


Data from the 2004 U.S. Current Population Survey indicate that out of a foreign- 
born population of more than 34.2 million, slightly more than 13.1 million or 
38 percent, held U.S. citizenship. Figure 1, which tracks the level of citizenship 
among foreign-born residents in the United States from 1890 to 2000, shows that 
this level of citizenship is lower than at any point in the 20th century.? 

Part of this decline reflects the increase in unauthorized immigrants in the 
United States over the past 20 years. Because unauthorized migrants are legally 
barred from citizenship, their presence drives down aggregate citizenship 
levels.’ Using demographic techniques that attempt to distinguish between 
legal and unauthorized migrants, Fix, Passel and Sucher (2003) calculate that 
while citizenship levels plummeted from 64 percent of legal immigrants in 1970 
to 39 percent in 1996, naturalization levels among legal residents rose in 2002 
to 49 percent.^ 
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Aggregate citizenship levels can also be deceptive because they vary greatly 
with migration flows and length of residence. During periods of substantial 


born living in the United 


States (the denominator for calculations), but because they cannot become 


immigration, new arrivals increase the number of foreign 
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citizens immediately, their presence drives down aggregate citizenship levels. 
Renewed large-scale migration in the last three decades of the 20th century 
consequently accounts for some of the recent decline in U.S. citizenship levels. A 
trend line on Figure 1 shows the percentage of the U.S. population that is foreign- 
born in a given census year. Not surprisingly, citizenship levels fell during the two 
great waves of U.S. migration, from 1880 to 1924, and from 1965 to the present. 
Conversely, during periods of restricted immigration when the average length of 
residence increases, as from 1924 to 1965, citizenship levels rise.? According to 
Schmidley and Gibson (1999), about a third of the decline in U.S. naturalization 
from 1970 to 1996 stems from the rapid increase in immigration since the 1965 
Immigration and Nationality Act Amendments. 

Still, more recent immigrants acquire citizenship more slowly and at a lower 
level than those who came before them. In 1970, 57 percent of immigrants with 
11to 15 years of residence held U.S. citizenship. Ten years later, only 44 percent of 
those with 11 to 15 years of residence reported citizenship; in 1990, 41 percent; 
and in 2000, only 37 percent." New large-scale immigration depresses aggregate 
citizenship levels, but more recent migrants are also less likely to naturalize. 

Recent cohorts' slower U.S. naturalization contrasts with the trend in Canada. 
Of those with 11 to 15 years of residence, 59 percent held Canadian citizenship 
in 1971, 68 percent in 1981, 78 percent in 1991, and 83 percent in 1996.? More 
rapid naturalization of recent migrants translates into higher aggregate citizenship . 
levels, as shown in Figure 2. For most of the 20th century citizenship in the United 
States and Canada looked remarkably similar, but in the 1970s — as both countries 
began welcoming large numbers of immigrants from the Caribbean, Asia and 
Latin America - citizenship trajectories diverged. Canada, like the United States, 
experienced an increase in its foreign-born population — from 15 percent in 1971 
to 19 percent in 2001 - yet the greater number of new arrivals did not decrease 
overall citizenship. Instead, naturalization in 2001 is slightly higher than in 1971, 
and significantly greater than in the United States. 


Explaining Immigrant Citizenship: Current Theorizing 


Various commentators worry about U.S. declines. Some identify what they see as 
deterioration in the quality of today's immigrants (Borjas 1999; Camarota 2001). 
Newcomers' lower human capital relative to the native-born allegedly hinders 
incorporation into the labor market, retards social integration such as language 
learning and, by extension, undermines immigrants' naturalization. Others claim 
that contemporary immigrants' cultural values clash with America's Anglo- 
Protestant civic culture, or that the prevalence of easy, low-cost communication 
and transportation technologies keep immigrants with feet in two worlds, limiting 
their attachment to the adopted country (Huntington 2004). Limited attachment 
is believed to be compounded by the decline in value of citizenship. 


Cost-benefit Arguments and the Declining Significance of Citizenship 


Between a third to half of the contemporary decline in U.S. naturalization levels 
stems from increases in the number of immigrants and in the relative proportion 
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of unauthorized migrants, but what accounts for the rest of the drop? At first 
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citizens in the allocation of jobs, benefits and rights (Hollifield 1992; Plascencia, 
Freeman and Setzler 2003; Schuck 1998; Soysal 1994). Thus, in ah actor-centered 
model, naturalization becomes less likely after individuals calculate the declining 
utility of naturalization compared to the time, money and costs associated with 
citizenship (Jasso and Rosenzweig 1986, 1990; Jones-Correa 1998; Yang 1994). 
These accounts adopt an implicit or explicit model of human behavior as rationally 
motivated and chosen. In a structural approach, the importance of citizenship is 
declining in a world increasingly allocating rights based on personhood rather than 
state membership, and one in which supranational actors and non-governmental 
organizations provide alternative venues for extending and fighting for rights 
(Baubóck 1994; Jacobson 1996; Soysal 1994). Either as a micro-level choice or a 
change in the macro-level boundaries of citizenship, both accounts suggest that 
declines in naturalization stem from changes in the instrumental and material 
benefits of citizenship. 

Yet the relative importance of citizenship in the United States has not changed 
very much and, since 1996, it is arguably greater than in the 1970s and 1980s. 
Citizenship guarantees the right to be in the United States. Without it, non-citizens 
can be subject to mandatory detention and automatic deportation for relatively 
minor crimes, a practice that government has increasingly used since the mid- 
1990s. Citizenship also provides civil, social, economic and political rights. 
It determines eligibility for some public benefits since the 1996 U.S. Personal. 
Responsibility and Work Opportunity Reconciliation Act. Except for a few 
localities, citizenship is required for voting or running for office. Citizenship also 
influences employment in the public sector or defense industry. For immigrants 
wishing to sponsor family to the United States, citizenship provides quicker and 
easier family reunification. 

The costs of American citizenship are comparatively small. Lack of citizenship 
does not offer protection from the draft, as many immigrant men found out 
during the Vietnam conflict, and non-citizens must pay taxes like everyone else. 
The naturalization oath that all would-be adult citizens must swear requires a 
renunciation of past allegiances, but in practice U.S. citizens can hold multiple 
passports (Ansgar Kelly 1991-92; Bloemraad forthcoming; Duckett 2000; Spiro 
1997). Thus, naturalization does not mean losing one's former citizenship.? 

Arguments about cost-benefit calculations and the declining significance of 
citizenship become even more problematic with a comparative lens, as they 
cannot explain the Canadian trend or the difference between U.S. and Canadian 
patterns. Many of the benefits of U.S. and Canadian citizenship — political rights, 
access to certain jobs, protection against deportation — are the same. If anything, 
citizenship has more instrumental value in the United States. All non-citizen 
permanent residents in Canada have access to social welfare benefits, and 
citizenship provides no advantage in sponsoring family members. In both countries 
naturalization requires a number of years of legal residence before applying (three 
in Canada, generally five in the United States); basic language ability (English in 
the United States, English or French in Canada); some knowledge of the country, 
its history and government; a fee ($200 in Canada, $320 in the United States), 
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and evidence of "good moral character."'? In both countries, criminal convictions 
are grounds to refuse citizenship." 


The Qualities of a Citizen: Individual and Group Characteristics 


A second approach argues that variations in immigrant citizenship stem from 
differences in the skills, resources, interests and aptitudes of immigrants. This 
framework is predominant in research on North America. Foreign-born residents 
are more likely to seek citizenship if they have higher levels of education, are 
middle aged, live above the poverty line and are proficient in English as a second 
language (Bueker 2005; Jasso and Rosenzweig 1990; Portes and Rumbaut 1996; 
Yang 1994). Researchers also speculate that immigrants from the same country 
of origin share characteristics that affect naturalization. Migrants from countries 
proximate to U.S. borders might see their move as "reversible" and be less likely 
to take up citizenship (Bueker 2005; Portes and Mozo 1985; Portes and Rumbaut 
1996). Those from democratic countries might be more socialized into civic 
values that favor citizenship, or conversely, prior experiences with undemocratic 
regimes might render U.S. or Canadian citizenship more attractive (Black 1987; 
Bueker 2005; Yang 1994). Less frequently, scholars suggest that nationals from 
certain countries hold cultural values antithetical to democratic engagement 
(Glazer and Moynihan 1963; Huntington 2004). 

While researchers debate why individual or group differences matter, statistical 
analyses clearly show that personal traits influence naturalization, and that net 
of individual-level variation, country of origin remains a significant predictor of 
citizenship. These findings are consequential because the mix of immigrants in 
Canada differs from those in the United States. 

Hispanic migration — from Mexico, Central and South America, and the 
Spanish-speaking Caribbean — dominates flows to the United States. About half 
of all legal immigrants arriving in the United States in the 1990s came from the 
Americas and Caribbean, while those from Asia, Europe or Africa represented 
only 31, 15 and 4 percent of the total, respectively (INS 2002:16, 21). Those born 
in Mexico comprised nearly 30 percent of all foreign-born U.S. residents in the 
2000 U.S. Census. 

Canada is home to fewer Hispanic immigrants. In 2000, only 7 percent 
of immigrants came from Central and South America, while the majority, 53 
percent, hailed from Asia and the Pacific. Europeans made up 19 percent of 
immigrant admissions and about 18 percent came from Africa and the Middle 
East (CIC 2002:8). 

Immigrant selection criteria also differ. Both Canada and the United States 
administer immigration systems that grant entry based on skills and resources, 
family ties or the need for asylum, but the relative proportion of immigrants in 
these three categories varies. In the late 1990s, two-thirds to three quarters of 
legal immigrants to the United States acquired their status through family ties 
(INS 2002:17). Employment-based permanent immigration accounts for about 20 
percent of numerically limited visas, and a smaller percentage of all immigrant 
admissions (Usdansky and Espenshade 2001), while refugees comprised 6 to 16 
percent of admission in the 1990s (INS 2002178 
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Canadian immigration relies more heavily on skilled migration by people who 
have skills or resources (such as investment capital) needed in the Canadian 
economy. Under the "point system," potential migrants score points for job 
skills, education, language ability, and other personal characteristics. Immigration 
authorities can grant permanent residency if an applicant's total points surpasses 
the government-set threshold. In the second half of the 1990s, 50 to 60 percent 
of all immigrants arrived as independent migrants (a figure which includes the 
dependents of the principal applicant), 26 to 36 percent entered under family 
reunification and about 13 percent came as refugees or special admissions (CIC 
2005).? Scholars such as George Borjas (1999) contend that a point system 
screens out individuals with low human capital, resulting in better outcomes for 
immigrants and the host society. To the extent that Canada's immigration policy 
selects individuals with better language skills and more years of schooling, we 
might expect these immigrants to be more likely to naturalize.'^ 

Space constraints prevent a thorough analysis of the effects of immigration 
policy and immigrant origins, but we have strong evidence that these do not 
explain the U.S.-Canada naturalization differences (Bloemraad 2002, 2006). 
Citizenship levels are consistently higher in Canada across all immigrant groups, 
even after controlling for country of origin and individual attributes such as 
English ability, length of residence and education. Mexican migration is the most 
striking and salient difference: Mexicans, who are more numerous in the United- 
States, are less likely to naturalize than most other groups and they constitute 
a larger proportion of the undocumented population. Re-calculating the U.S. 
naturalization level in 2000 by including only non-Mexican migrants who fulfill 
residency requirements, the proportion of citizens increases to 48 percent. Yet a 
similar adjustment to the Canadian figures, including only those who satisfy the 
length of residence requirement, raises aggregate naturalization even more, to 
84 percent in 2001. 


The Context of Reception: National Ideologies, Government Institutions 
and State Policy 


Contrary to North American approaches, research on Western Europe examines 
how institutional configurations, policy philosophies and nationalism affect 
migrants’ political incorporation (Brubaker 1992; Favell 2001; Ireland 1994). Put 
in the language of social movement research, a country's political opportunity 
structure determines access to the political system (Koopmans and Statham 
1999). Students of citizenship find that immigrants’ attributes hold less relevance 
after considering the characteristics of their adopted home (Clarke, van Dam and 
Gooster 1998; de Rham 1990; Soysal 1994). The average naturalization rate in the 
core European Union and European Free Trade Area countries — calculated as the 
annual number of naturalizations over the non-citizen foreign population — varies 
widely. Countries such as Germany, Ireland, Italy and Switzerland recorded an 
annual naturalization rate of less than 1 percent in the early 1990s, whereas the 
rate was about 6.5 percent in the Netherlands and Sweden in 1994 (Clarke, van 
Dam and Gooster 1998). A similar calculation in North America produces a rate of 
about 3 percent in the United States and 10 percent in Canada.!5 
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The puzzle consequently shifts from “Why do certain immigrants naturalize 
while other do not?” to “What causes large cross-national variation in political 
incorporation?” Researchers respond that differences in laws and bureaucratic 
procedures generate higher hurdles for would-be citizens in some countries 
(Brubaker 1989; de Rham 1990; Weil 2001), and that legal differences arise from 
divergent national ideals and policy philosophies (Brubaker 1992; Favell 2001; 
Ireland 1994). 

Such an account should not apply to Canada and the United States. Both 
countries fall into the category of “liberal” welfare states (Esping-Andersen 1990) 
and are considered “classic countries of immigration.” (Brubaker 1989; Joppke 
1999) Citizenship is promoted through easy naturalization policies, and the new 
citizen is legally indistinguishable from the native-born." According to Laczko 
(1994), the two countries are among the few in the world that successfully 
combine heterogeneous populations and a high standard of living. Canada also 
shares with the United States a more negative migrant history: past policies of 
Asian exclusion, internment of citizens of Japanese background, and up until the 
mid-1960s, immigration systems that favored white Europeans. 


A Model of Structured Mobilization 


To understand immigrant citizenship, we need to conceptualize immigrants' 
naturalization as embedded within a larger institutional and policy environment. 
Immigrants’ individual and group attributes clearly affect their interest in and 
ability to acquire citizenship. But political incorporation is also fundamentally 
influenced by the receiving society. Synthesizing insights from North American 
and European scholarship produces a model of structured mobilization, where the 
host country’s political institutions, administrative bureaucracies and integration 
policies facilitate or hinder immigrant citizenship. 

Theoretically, structured mobilization draws on  institutionally-minded 
comparative-historical research and literatures on social movements and political 
process. At first blush, these literatures seem antithetical. Institutional analysis 
concentrates on continuity over time, pointing out how government policies, 
institutions and social norms pattern the actions and understandings of individuals 
and organizations (Clemens and Cook 1999; Hirsch 1997; Thelen and Steinmo 
1992). Social movement research examines opposition to institutionalized politics, 
concentrating on how political challengers mobilize to win concessions or alter the 
political order (Andrews 2001; McAdam, Tarrow and Tilly 2001; Tarrow 1994). Yet 
the theoretical gap is small because the central concepts identified by McAdam, 
McCarthy and Zald (1996) — political opportunity structures, organizations (or 
“mobilizing structures”) and framing dynamics — overlap with central arguments 
in institutional analysis. '® 

Melding the two approaches produces significant benefits because immigrant 
political incorporation involves normal politics, which are heavily structured 
by institutional environments, and immigrants’ status as political outsiders or 
challengers. Considering the predicament of Latinos in New York City, Jones- 
Correa concludes, “To enter into politics, then, immigrants must overcome the 
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inertia of the political system to be taken seriously as political players. Immigrants 
have to mobilize themselves first..." (1998:65) Such mobiliZation must be 
conducted not only vis-à-vis the mainstream political system, but also within 
the community. Immigrant leaders and advocates must convince compatriots 
to become involved or to shift attention from private economic concerns or 
homeland politics to domestic political issues. 

A fruitful synthesis requires two further steps: the elaboration of mechanisms 
that link institutional and policy structures to people's everyday thoughts and 
activities, and the identification of mechanisms that let individuals modify 
structures. Under many institutional or political opportunity arguments, immigrants 
and minority communities can do little once they find themselves in one nation- 
state or another. German ethnic conceptions of nation, French universalistic 
citizenship or British multicultural race relations become both explanation and 
destiny for immigrants' political trajectories. We hear little about how immigrants 
negotiate these constraints or how national structures reproduce patterns of 
citizenship and incorporation. What are the mechanisms through which host 
societies shape immigrant lives? 


Figure 3. Immigrant Political Incorporation through Structured Mobilization 
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To conceptualize this interaction, | simplify the relevant actors, as shown in 
Figure 3. Political incorporation involves actions taken by ordinary immigrants 
and the mobilizing work done by community organizations and leaders. Outside 
the immigrant community, mainstream actors — such as political parties, unions, 
churches and other groups - can also influence and mobilize newcomers. All of 
these actors play a role in the dynamic of immigrant political incorporation. 

At the same time, government institutions and policies affect mobilization 
activities. They can do so directly, by influencing immigrants, or indirectly by 
affecting the goals, understandings and resources of community organizations, 
leaders and mainstream actors. State influence occurs through at least two 
analytically separate dynamics. One is interpretative: host countries' institutions 
and policies affect newcomers’ and native-born citizens’ understandings of 
immigrants’ legitimate political standing in a country. Are foreign-born residents to 
be welcomed or shunned? Should they be treated as future citizens or temporary 
labor? Does national membership include diverse origins and cultures? What 
accommodations, if any, must newcomers and native-born residents make in 
remaking the national and civic community? A second dynamic is instrumental, 
affecting people's ability to participate and mobilize. Do public programs or policies 
provide material resources, suchas funding, meeting space or technical assistance 
to immigrant communities and organizations? Does the state encourage different 
levels of government or private actors to support newcomers? Are there policies 
in place to materially affect immigrants’ ability to acquire citizenship, such as 
language training? Institutions and policies affect understandings of citizenship 
and the ability to participate.'? Once mobilized, immigrants — and those opposed 
to their political projects — can reinforce, undermine or modify existing policies or 
institutional practices via feedback loops (Pierson 1993, 1994). 


The Social Nature of Citizenship 


Scholars of contentious and routinized politics repeatedly document how civic 
and political engagement occurs most readily through personal contact and 
social interaction, rather than impersonal appeals or spontaneous engagement 
(e.g., Gould 1995; McAdam, McCarthy and Zald 1996; Rosenstone and Hansen 
1993). Verba and colleagues (1995) demonstrate that involvement with voluntary 
associations, workplaces and religious institutions teaches civic and political 
skills, while Putnam (2000) argues that social capital drives civic and political 
engagement, as well as trust in the political system. 

These studies focus almost exclusively on native-born citizens, but there 
are strong theoretical and empirical grounds to believe immigrant political 
incorporation also depends on interpersonal and organizational contact. Students 
of immigration document the importance of intra-ethnic ties, and the social 
capital that results, for migration (Massey 1990; Palloni et al. 2001), educational 
outcomes (Zhou and Bankston Ill 1998; Portes 1995) and success in the labor 
market (Aguilera and Massey 2003; Sanders, Nee and Sernau 2002). When 
migrants need to interact with legal systems and government bureaucracies, 
dense interpersonal ties facilitate the diffusion of information about legalization 
programs (Hagan and Baker 1993) and provide assistance with naturalization 
applications (Alvarez 1987; Bloemraad 2006). 
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Scholars have devoted scant attention to the role of community organizations in 
incorporation, but evidence suggests that informal associations, such as hometown- 
based soccer teams, formal voluntary organizations and even for-profit ethnic 
businesses (such a notary public or newspaper) are crucial conduits of information 
and assistance with political incorporation. Alvarez (1987) reports that community 
centers in Los Angeles and Chicago, and the International Ladies Garment Workers' 
Union in New York City, act as intermediaries between immigrants and the (then) 
Immigration and Naturalization Service. They assist Latino immigrants with the 
application form and studying for the civics exam. Bloemraad (2006) shows how 
for-profit and non-profit organizations encourage naturalization among Portuguese 
and Vietnamese migrants in Toronto and Boston by providing services and 
generating norms favoring citizenship. Gilbertson and Singer (2003) report that 
following passage of the Welfare Reform Act, ethnic businesses, professional 
groups, social associations, social service providers and even Dominican political 
parties mobilized to facilitate naturalization in New York City. All three accounts 
document the role of the ethnic media in providing information about citizenship, 
while Bloemraad and Gilbertson and Singer also identify community leaders as 
important catalysts. A robust organizational infrastructure also facilitates political 
mobilization because politicians see such groups as an efficient way to reach many 
potential voters (Marwell 2004; Parenti 1967). 


The Nested Nature of Political Incorporation: National Policies 


The social dynamics of political incorporation differ little in the United States and 
Canada. What varies — and helps explain the citizenship gap — is the degree to 
which actors and institutions outside the immigrant community help newcomers. 
Especially important are the specific policies of integration and diversity that greet 
newcomers. Some have contrasted American and Canadian political cultures to 
explain cross-national differences in politics and social behavior (e.g., Lipset 1986, 
1990), but cultural accounts turn a blind eye to competing ideological traditions 
within a country, they often lack a causal mechanism explaining how culture 
produces specific outcomes and they assume historical continuity over long 
periods, about 200 years in the case of Lipset's analysis.? Patterns of Citizenship 
in Canada and the United States had similar trajectories prior to the 1970s; if 
political culture plays a role, this culture changed over the past 40 years. | 
While this might be the case, the most concrete divergence in American 
and Canadian institutional environments lies at the policy level. Scholars who 
have paid attention to reception contexts in North America mostly emphasize 
border policy, the regulations that dictate entry into U.S. or Canadian territory. 
Thus Portes and Rumbaut (1996) use the intersection of U.S. immigration 
policy, societal discrimination, ethnic community strength and immigrants' 
human capital to understand between-group differences in education and 
language acquisition among the children of immigrants. The argument 
here takes inspiration from this approach, but goes further by asking which 
American practices and institutions provide a common fate for U.S. immigrants 
as compared to compatriots living in Canada once they cross the border. It 
also asks how government policies help constitute (or undermine) the strength 
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of other variables affecting incorporation, such as the cohesion of ethnic 
communities or the importance of human capital. 


The Contours of Canadian and U.S. Policy Environments 


Three sets of policy differences stand out. The first is the different perspectives 
of the bureaucracies overseeing citizenship. In Canada, Citizenship and 
Immigration Canada (CIC) is a stand-alone federal department. In the United 
States, the Immigration and Naturalization Service (INS), part of the Department 
of Justice, oversaw naturalization until March 1, 2003. Since then, the Bureau of 
U.S. Citizenship and Immigration Services (USCIS), housed in the Department of 
Homeland Security, administers immigration and citizenship adjudications. 

Comparing the two countries' bureaucracies, we find that CIC generally 
promotes citizenship and integration, while an ethos of law enforcement 
loomed large over the former INS and imbues the overall mission of Homeland 
Security. According to Canadian officials, immigrants are free to make their own 
choices regarding citizenship, but government seeks to sway such decisions. 
"The Canadian policy is that we are a country of immigration. The only way an 
immigrant can influence how the country is run, to do that, you need to vote. 
And you can only vote if you become a citizen. The Citizenship Act ensures the 
facilitation of citizenship, so that immigrants can exercise their voting capacity. 
That is the main policy thrust."?' In contrast, former INS Commissioner Doris 
Meissner describes a very different attitude, where "the dominant culture of 
the agency... [was] rooted in a view of immigration as a source of security and 
law enforcement vulnerability more than of continued nation building." (2001:2) 
Service functions, such as administering naturalization, often take second place 
to border control (Andreas 2000; Magaña 2003). The move of citizenship services 
to the Department of Homeland Security continues this tradition. 

Political and bureaucratic decisions regarding policy and priorities drive these 
decisions, not legal structures. CIC has adopted a more aggressive policy of 
citizenship promotion as part of Canadian nation-building, while geographic 
location reduces border control worries. The Canadian Citizenship Act does 
not require citizenship promotion, and in the United States, the “promotion of 
naturalization programs... is clearly permitted, if not actually mandated" under 
the U.S. Immigration and Naturalization Act (North 1985:58). 

A second policy area is newcomer settlement. Settlement policies aim to ease 
migrants' integration. Programs might include language training, employment 
counseling or even social assistance. The American federal government tends to 
see immigrant settlement as a private concern; USCIS oversees no settlement 
programs and it disburses no grants for community-based integration programs. 
Only officially recognized refugees, historically those fleeing Communist regimes, 
receive benefits under refugee resettlement programs (Zucker and Zucker 1989; 
Hein 1993). Everyone else must rely on their own resources and the assistance 
of friends, family and the immigrant community. 

The Canadian state views settlement and incorporation as a public issue 
necessitating some government intervention and support. Currently CIC oversees 
three programs open to immigrants or organizations helping immigrants. Language 
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Instruction for Newcomers to Canada (LINC) provides basic language instruction 
in English or French to recently arrived immigrants, with a goal'of serving about 
45 percent of adult immigrants; the Host Program matches Canadian volunteers 
with new immigrants to provide early settlement support; and the Immigrant 
Settlement and Adaptation Program (ISAP) funds community organizations to 
provide reception, orientation, translation, interpretation and referral services, as 
well as some counseling, especially pertaining to employment. 

Finally, the two nations differ in their policies on ethno-racial diversity. Both 
countries embrace multiculturalism, but the American variant centers on 
race and civil rights, while the Canadian version favors ethnicity, recognition 
and integration. 

As Judith Shklar (1991) has noted, one cannot understand U.S. citizenship 
without taking into account the legacy of slavery and the concomitant sense 
of hierarchy and standing in American society that is predicated on race.? In 
the private sphere, newcomers perceive widespread acceptance of multiple 
cultural identities and "hyphenated" Americanism, but ethnic or immigrant- 
centered claims have limited political salience in the public arena. The Civil Rights 
movement of the 1950s and '60s and subsequent movements for native rights, 
Chicano rights and Asian-American rights forced the U.S. government to create 
remedial policies such as the Voting Rights Act, anti-discrimination measures 
and affirmative action programs. One result is that political understandings of. 
multiculturalism are couched in broad classifications of white, black, Asian- 
American and Latino. 

Because the United States has no formal policy on /mmigrant diversity, the 
patchwork of civil rights legislation and minority policies create incentives 
for community leaders to define communities in racial terms (Joppke 1999; 
Skerry 1993). Race-based multiculturalism is a salient political tool, but it 
resonates poorly with ordinary immigrants (Bloemraad 2006). As Joppke 
suggests, "Race is different. Its content is not a positive heritage (however 
modified) transplanted into the new society, but the negative experience of 
oppression at the hands of the receiving society. Its direction is not integration 
into a (white) majority... but restitution for harm and public existence as a 
protected, separate group." (1999:143) 

In Canada, conflict over diversity centered historically on ethnicity rather 
than race. The Canadian climate was not amenable to a plantation economy 
built on slavery? British conquest of New France in 1659 produced a country 
divided into two linguistic and cultural groups, French Catholics and British 
Protestants. The Constitution of 1867 sought accommodation in a federal state 
with special language and religious guarantees. This helped mediate conflict, 
but inter-ethnic inequalities — described by John Porter (1965) as a "vertical 
mosaic” — spawned threats of Quebec separation in the 1960s and 1970s at 
the same time that the United States dealt with racial conflict. The federal 
response to this threat, as well as mobilization by Canada's “other” ethnics, led 
to official multiculturalism in 1971. 

Canadian multiculturalism provides symbolic recognition and some public 
funding and technical support to communities. The importance of multiculturalism 
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is by no means universally accepted, but it has undermined the view that Canadian 
immigrants and their children are subordinate to the descendents of British and 
French colonists (Breton 1986; Abu-Laban and Stasiulis 1992). It tends to identify 
groups by national (or ethnic) origin, labeling people as Vietnamese-Canadians 
rather than Asian Canadians. Multiculturalism consequently bolsters immigrants' 
sense of recognition and resonates well with their self-image as culturally distinct. 
It also provides a language to frame grievances when groups feel that they have 
not received the respect they deserve.” 

Importantly, the timing of Canada’s investment in settlement and its shift 
to multicultural citizenship occurred precisely when levels of naturalization in 
Canada and the United States parted course. Federal government spending in 
these areas, while uneven and subject to cutbacks, grew rapidly over the 1970s 
and 1980s, and then modestly into the mid-1990s. In the 1966-67 fiscal year, the 
Citizenship Branch of the Department of the Secretary of State gave $88,150 to 
12 groups concerned with settlement and ethno-racial citizenship (Pal 1993: 108). 
A subsequent programmatic overhaul and the assignment of multiculturalism to 
the Citizenship Branch generated significant expansion. In 1970 more than 100 
new staff and consultants were hired by a unit that previously employed 110 
employees. In the 1974-75 fiscal year the branch gave $2.65 million in grants to 
648 groups (Hawkins 1988:31 1, 365; Pal 1993:189, 192). The federal Department 
of Manpower and Immigration, which oversaw immigrant entry and labor force 
insertion, estimated that about half of all new migrant workers visited one of 360 
Canada Manpower Centers for job counseling and training (Hawkins 1988:339). 
In fiscal year 1987-88, the multiculturalism program distributed more than $20 
million in grants, with almost $6.5 million directed to "Citizenship and Community 
Participation." (Pal 1993:200) In fiscal year 1996-97, the federal government 
allocated almost $60 million for multiculturalism programs. 

The late 1990s brought serious cutbacks to multiculturalism and stagnation to 
settlement initiatives. The multicultural budget was cut in half, to just under $30 
million in 1997-98, while settlement funding failed to keep pace with inflation. 
From the 1995-96 fiscal year to 2003-04, the budget of the Settlement Branch went 
from $166 million to $175 million, with the proportion of CIC's budget devoted to 
settlement falling from 46 percent to 37 percent (CIC 1995; House of Commons 
2003). Of the total authorized in 2003-04, CIC allocated $100 million to LINC, 
$2.8 million to the Host program, and $30 million to ISAP (House of Commons 
2003:2). Thus in 2004, the government spent about $1,500 per new immigrant 
(House of Commons 2003:6), an amount critics claimed was inadequate. 

We have no comparable figures for the United States because the federal 
government does not run such programs for non-refugee immigrants. The 
budgets of immigrant-serving organizations clearly show their dependence 
on funds provided by federal, state, county and local agencies in the form 
of contracts or grants (Bloemraad 2006:169-70; Cordero-Guzmán 2005), but 
these programs usually do not target specific immigrant concerns such as 
language and civics training or translation services. Non-refugee migrants 
receive much less government support, and the support they do receive tends 
to fluctuate with political change. Large scale social spending, envisioned by 
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President Johnson's Great Society initiative, did not survive the 1970s. Even 
local efforts to support citizenship deal largely with crises father than long- 
term services. For example, the Welfare Reform Act of 1996 spurred the state 
of Massachusetts to provide $2 million in funding to community agencies 
assisting with naturalization, but the appropriation was authorized for only 
three years. Lobbying by immigrant advocates continued some state support 
after the three-year period, but in 2002 Gov. Jane Swift vetoed the allocation 
of $750,000 in a push to balance the state budget. Other states followed 
a similar pattern. Many made no investment in naturalization, even though 
citizenship was a requirement for various public benefits. 


The Mechanisms of Structured Mobilization 


These three sets of policies channel immigrants' political incorporation. Most 
obviously, government funding and grants change incentives and newcomers' 
material resources, as does the provision of meeting spaces, technical assistance 
and other help. We thus find that communities that receive more support — as 
immigrant groups in Canada do relative to those in the United States, and as 
refugee communities in the United States do relative to other newcomers - build 
larger organizational infrastructures and have more diverse sets of community 
organizations (Bloemraad 2005). These organizations in turn provide instrumental 
help with citizenship and political integration. Settlement and diversity policies can 
also encourage legislative committees, civil servants and elected representatives 
to seek immigrants' input on programs directed at them, thereby opening access 
to the political system. 

Yet a dense organizational infrastructure does not automatically lead to political 
activity. An institutionally complete immigrant community (Breton 1964) — one 
that has a well-developed organizational infrastructure — might instead insulate its 
members from the political mainstream into ethnic "ghettos."?* Political mobilization 
requires motivation in addition to capacity. The policies highlighted above also 
generate interpretative effects making citizenship more attractive to immigrants. 

Mettler (2002:352) suggests three interpretative mechanisms. First, policy 
beneficiaries are more likely to feel that politics are relevant. Second, those 
targeted by policies can become the focus of strategic mobilization, increasing 
feelings of personal and political efficacy. Finally, being the recipient of 
government attention may enhance awareness of one's rights, duties and 
obligations. Applied to immigrants, proactive policies teach the importance 
of politics, make immigrants targets of political mobilization by mainstream 
actors, and increase newcomers' sense of political legitimacy. The lack of such 
policies or policies seen as punitive communicates the irrelevance of political 
membership or suggests second-class citizenship. 

Positive interpretive effects should be more apparent in Canada than the United 
states. In-depth interviews with U.S. immigrants suggest that while some new 
citizens strongly value political membership, many view citizenship as a means of 
ensuring rights, not as an invitation or responsibility to engage in decision-making. 
Gilbertson and Singer discuss how some Dominican immigrants felt coerced into 
citizenship following welfare reform, "viewing [citizenship] narrowly as a set of 
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legal rights, rather than as a change of affective orientation or allegiance." In my 
interviews with Vietnamese and Portuguese migrants similar themes emerged 
(Bloemraad 2006). For example, Thien, a Vietnamese refugee in Boston with a 
primary education and limited English language skills, explained that American 
citizenship provides security, access to economic opportunity, a passport for 
travel and the responsibility of obeying American laws: 


The good thing about the US. is it has a very strong and 
effective law system... [Also] life in the U.S. is flexible. If you 
go to work, you will have money. You have all the necessities 
that you need. But if you don’t work then you won't have any 
money... | don't have much more to say because my whole 
life surrounds only these things. If | work hard then | can buy 
a car and a house. | also have the freedom to travel. But if | 
commit a crime then | will go to jail. 


Citizenship does not necessarily include a strong participatory element. Thien 
does not engage in political activities nor does he vote, “| have no time... | don't 
know who is good or who is bad." 

In contrast, more Canadian immigrants see a link to government that entails 
rights, responsibility, participation in the polity and emotional attachment. Hiep, 
a Vietnamese Canadian senior living in subsidized housing, votes regularly even 
though his language skills are limited and he is relatively new to the country. He 
speaks confidently about Canadian multiculturalism: 


To my understanding, Canada is a country with a true 
meaning of multiculturalism. It is very good and suitable 
to this country. Why? The Canadian government listens to 
the voice of the people's will, the requests, and helps to 
respond. They not only respect the cultures of the different 
community backgrounds, but they always try to develop the 
culture and tradition of that community more. 


Such interpretative effects are also evident among immigrant community leaders. 
Hai, a middle-aged Vietnamese Canadian who has been active in organizing 
various community initiatives, explains: 


Here we try to provide an opportunity for everybody. It 
doesn't matter whether you have a language ability or you 
do not have a language ability. You, as a person, you have 
the right to live, you have the right to access, to service, to 


government. 


Discourses around the public management of diversity influence leaders' 
ability to speak up. On one level, multiculturalism and settlement policies 
provide discursive resources to make claims on government or fellow citizens, 
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much in the way that social movement scholars talk about framing strategies. 
On another level, national beliefs about diversity and the appropriateness 
of political intervention structure immigrants' conception of their interests, 
identities and understanding of what is politically possible or preferable. 
Public ideologies thus act as cognitive maps or moral schemas (Clemens and 
Cook 1999; Zucker 1991). 

The material and interpretative effects of government policy help explain why 
Canadian citizenship levels are higher than in the United States. While some of the 
U.S. decline stems from larger cohorts of recent and undocumented immigrants, 
contemporary immigrants also wait longer to naturalize and are less likely to 
pursue citizenship, in part due to the limited resources available to help them and 
an ambiguous message about government's interest in new citizens. In Canada, 
while large cohorts of recent migrants should also have depressed citizenship 
levels, bureaucratic orientations, settlement programs and multiculturalism 
policy encourage citizenship. 

Further support for this approach emerges in comparing refugees in the 
United States - newcomers who receive state assistance — with the majority of 
family and economic migrants who receive little public support. Refugees are 
one and a half times more likely to become citizens than eligible legal immigrants 
with similar socio-economic and demographic characteristics (Fix, Passel and 
Sucher 2003:6). Although refugees have clear political reasons to participate, 
the support given to them by the U.S. government probably plays a significant 
role in explaining Cubans' high levels of political participation (Moreno 1996). The 
prevalence of Vietnamese mutual assistance associations can be linked directly 
to government disbursements (Hein 1997). Ethnographic research finds that in 
the health care system, Latino immigrants who are official refugees benefit not 
only from having medical services paid by the government, but medical staff also 
see them as more deserving and treat them as citizens who can make demands; 
non-refugees are discouraged from making claims on American institutions 
(Horton 2004). The dynamics of structured mobilization, applied here to cross- 
national variation, can also illuminate citizenship gaps between migrant groups. 


Conclusion: Implications for Future Research and Theorizing 


This article makes the claim that civic and political attachment is strengthened by 
public funding and programmatic support, and it is undermined by the absence 
of such support. For example, government provision of English language classes 
has a material impact on a newcomer's ability to satisfy the language requirement 
for citizenship, and it also communicates immigrants' importance to the receiving 
society, creating the foundation for political membership. Beyond immigrants' 
own skills and interests, we must examine whether those outside the immigrant 
community provide newcomers with a helping hand. 

These findings require researchers to consider levels of analysis beyond 
individual or group attributes. In essence, | suggest a "European" turn to the 
study of citizenship in traditional immigrantreceiving countries. American 
scholarship has largely ignored the role of the state in immigrant integration, 
except for how state policy determines entry. As Nathan Glazer writes, "the 
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settlement, adaptation, and progress, or lack of it, of immigrants is largely, in 
the U.S. context, up to them.” (1998:60) | believe that state action — or inaction 
- influences immigrants' potential to achieve citizenship. Scholars of immigration 
in Europe demonstrate how legal structures and national ideologies channel 
incorporation, but most have assumed that such differences would be absent in 
North America, given broad similarities between the United States and Canada. | 
suggest that we must consider not only ideal-typical citizenship regimes, but also 
specific policies and their effects. 

The argument here extends a nascent research agenda examining institutional 
effects on immigrants' integration, whether political (Ireland 1994; Jones-Correa 
1998; Soysal 1994), economic (Reitz 1998) or linguistic (Portes and Rumbaut 
1996:207-31). In the behavioral tradition, institutional factors merely modify 
the costs and benefits individuals weigh when making decisions. Structured 
mobilization takes seriously the social nature of incorporation and pays particular 
attention to processes of mobilization and meaning-making which alter individuals’ 
perception of costs and benefits. 

Finally, this approach cautions against a whole-hearted adoption of transnational 
or postnational paradigms. Forces of globalization clearly make the world a more 
interconnected place and facilitate the ability of individuals, businesses and 
organizations to be active in multiple countries. In some cases, human rights 
norms and globalization force states to be more flexible toward citizenship and 
belonging (Hollifield 1992; Jacobson 1996; Soysal 1994). However, the everyday 
experiences of immigrants remain grounded in specific societies, societies 
that are fundamentally shaped by particular government policies. A model of 
structured mobilization shows how nation-states continue to channel the fortunes 
of migrants, despite countervailing transnational or supranational forces. 


Notes 


1. The Boston-Toronto study relies on statistical data on immigrant citizenship 
in Canada and the United States, documentary evidence from community 
organizations and government reports, and 151 in-depth interviews with 
community leaders, government officials and ordinary Portuguese immigrants 
and Vietnamese refugees (Bloemraad 2006). 


2. People born abroad to American citizens — and who are thus American 
citizens at birth — are not included. The data prior to 1920 reflect only adult 
male naturalization: enumerators did not collect systematic information 
on women and minors since their status largely depended on their 
husband or father. 


3. Most North American surveys do not ask respondents’ immigration status, 
so researchers use foreign birth to identify immigrants. 


4. Unauthorized immigrants, most of whom overstay temporary visas, also 
reside in Canada. Recent news reports put the number at about 200,000, 
but lacking reliable estimates from government or academics, the figure 
is purely speculative (Jimenez 2003). Unauthorized migrants certainly 
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10. 


Ate 


12 


constitute a smaller percentage of the overall foreign-born population than 
in the United States. 


In the year 1960, 265,000 individuals became legal permanent residents 
of the United States. Two decades later immigrant admissions doubled to 
530,000; and in 2000 they stood at 850,000 for the year (INS 2002). 


An immigrant's length of residence correlates strongly with citizenship 
acquisition, perhaps because time in a country reduces the costs of 
citizenship and makes the benefits more apparent (Jasso and Rosenzweig 
1986; Yang 1994), or because it marks assimilation and growing attachment 
to the receiving society (Evans 1988; Liang 1994). 


These figures are calculated using the Public Use Microfile Sample data from 
the 1970 (Form 1, state), 1980 (5 percent), 1990 (5 percent) and 2000 (5 
percent) U.S. censuses available through the Integrated Public Use Microfile 
Sample project (www.ipums.org). Unless otherwise indicated, all U.S. census 
data discussed below come from these sources. 


These figures are calculated using the 20 percent sample files of the 1971, 
1981, 1991 and 1996 Canadian censuses, accessed through on-site visits to 
Statistics Canada, Ottawa. | have not been able to do a similar calculation 
with 2001 Census data. Unless otherwise indicated, all Canadian census 
data come from these sources. 


The perception among many immigrants that U.S. naturalization requires 
a figurative, or even literal, stomping on one’s flag of origin might deter 
some (Jones-Correa 1998). However, this perception appears more rooted 
in symbolic understandings of American citizenship than concrete fears of 
material loss. The dramatic rise in sending countries’ willingness to tolerate 
dual citizenship appears to influence immigrants more than U.S. policy 
(Jones-Correa 2001; Bloemraad forthcoming). 


The U.S. residency requirement falls to three years for spouses of American 
citizens and can be waived entirely for those who have seen combat in the 
U.S. military. Both countries offer easier rules for the elderly. 


Refusals of citizenship applications — based on inadequate language skills, 
lack of civic knowledge or other problems fulfilling citizenship requirements 
— appear roughly similar in Canada and the United States. From the 1960s 
through the 1980s about 2 percent of applications in the United States and 
3 percent in Canada were formally rejected (North 1985:54). In the 1990s, 
U.S. refusal rates climbed to 15 percent as many immigrants applied for 


citizenship without first having fulfilled the residency requirement. In Canada, 
refusals stood at about 10 percent. 


U.S. policy establishes annual quotas across a selection of preference 
categories, but some individuals, such as the spouse of a U.S. citizen, fall 


outside the quota and can gain admission regardless of the number of visas 
disbursed in a given year. 


17. 


18. 


20. 


21 


22! 


2. 


24. 
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. Special admissions include humanitarian cases not falling under the refugee 


category, home caregiver admissions and other specialized programs. 


. For example, Camarota (2001) suggests that a decline in immigrants' 


educational attainment relative to the American native-born depresses 
immigrants' propensity to naturalize. 


. "Political opportunity structure" is generally understood as persistent 


dimensions of the political environment that affect people's willingness to 
undertake collective action (Tarrow 1994). 


. The North American calculation is mine, using data from the INS Yearbooks and 


CIC Statistical Reports for the number of naturalizations and the U.S. Current 
Population Survey and Canadian Census for the noncitizen foreign stock. 


The exception to this rule is the U.S. presidency which is restricted to those 
born on U.S. soil. This stands in contrast to Canada where foreign-born Prime 
Ministers have held office. 


Political opportunity structures, the term for the "macro" context in which 
social movements operate, replicate the heart of institutional analysis. Social 
movement framing processes - the use of ideas and sentiments to motivate 
individuals — mirror institutionalists' interest in cognitive schemata and notions 
of legitimacy. Only mobilizing structures do not find a parallel in institutional 
analysis, but students working in this tradition readily acknowledge the role 
of organizations in drawing together individuals with similar interests and in 
advancing such interests through collective mobilization. 


. This formulation is indebted to Suzanne Mettler's (2002) account of how the G.I. 


Bill fostered civic and political participation among veterans of World War II. 


Joppke (1999), for example, suggests that the characterization of Germany 
as a country of ethnic nationalism overstates ideological and institutional 
durability. German citizenship law is relatively restrictive, but asylum policies 
are generous. Multiple centers of policy-making in Germany — federal vs. 
local, parliament vs. the courts - muddy the ethnocentric portrait painted by 
many scholars. See also Skocpol (1992) on the lack of causal mechanisms in 
political culture accounts. 


Author's interview with senior CIC official, June 13, 2001. 


See also Smith (1997) on "ascriptive" American citizenship and Loury (2001) 
on racial stigma. 


Estimates of the number of African slaves in Canada vary, but likely didn't 
exceed a few thousand. As part of the British Empire, Canada automatically 
adopted British emancipation laws, culminating in the abolition of slavery 
throughout the Empire on July 31, 1834. 


However, Canadian multiculturalism provides community leaders with more 
limited symbolic weapons to attack racial discrimination. 
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25. Reviewing research on assimilation and transnationalism among five migrant 
groups, Morawska (2002) notes that the Chinese-Americán community in 
New York City has only a feeble voice in civic and political affairs, despite a 
highly developed institutional infrastructure. 


26. Immigrant leaders are also affected by the material effects of government 
policies. They are often direct beneficiaries of government grants because 
they are most likely to run the organizations receiving funding. This creates 
a classic policy feedback loop: ethnic leaders pay attention to mainstream 
politics, in part to ensure future resources, and when funding is threatened, 
they mobilize community members to oppose cuts or minimize their 
impact. 
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Labor Markets and Economic Incorporation among Recent 
Immigrants in Europe 


Irena Kogan, University of Mannheim 


The questions asked in the paper are whether and to what extent 
the employment situation among recent third-country immigrants 
differs across European Union countries and how it is related to 
these countries” labor market characteristics. The European Labor 
Force Survey data for the 1990s are used to disentangle the roles that 
the individual characteristics of immigrants, on the one hand, and 
the structural features of the receiving societies, on the other, play in 
the process of immigrants” labor market integration. The results of 
multilevel regression analyses confirm that in receiving countries with 
stronger demand for low-skilled labor, unprivileged immigrants are 
less disadvantaged at employment entry. Among men immigrants 
employment disadvantages are found to be lower in liberal welfare 
states marked by their flexible labor markets. 


Introduction 


Persistently high unemployment among unprivileged immigrants is one of the 
most urgent problems facing European Union countries today. Until recently 
studies on immigrant labor market incorporation dealt largely with individual 
determinants of labor market inclusion among immigrants coming from different 
source countries into one host society (e.g., for European research Bevelander 
1999; Fassmann et al. 1999; Heath and McMahon 2000; Kalter and Granato 
2002; Kogan 2004; Model 1999; Neels 2000). Research shows that upon arrival 
immigrants are disadvantaged in the host countries” labor markets, and their 
rates of return on foreign educational attainment and labor market experience 
are significantly lower (e.g., Chiswick 1978; Fishelson et al. 1980; Friedberg 
2000). With the passage of time in the host country, differences in human capital 
between the native-born population and immigrants narrow as immigrants 
learn the host country language, gain knowledge about the functioning of the 
host country labor market, and acquire local education and training (Chiswick 
1978). The assimilation trend is, however, sometimes disrupted by immigrants' 
reluctance to invest in the human capital specific to the host country, including 
the host country's language, due to their perceiving their stay as temporary 
(Bonacich 1972; Chiswick 2000; Dustmann 2000; England 1992; Heath and 
Ridge 1983). Not only may immigrants be reluctant to invest in specific host 


This paper draws on material in my book Working through Barriers to be published by 
Springer in 2007. Financial support from the Mannheim Centre for European Social 
Research (MZES) is gratefully acknowledged. For helpful comments I would like to thank 
Jackie Hagan and two anonymous reviewers. Direct correspondence to Irena Kogan, 
Mannheim Centre for European Social Research, University of Mannheim, P.O. Box, D- 
68131 Mannheim, Germany, +49-(0)621-1812813, +49-(0)621-1812803. E-mail: Irena. 
Kogan@mzes.uni-mannheim.de. 


© The University of North Carolina Press Social Forces, Volume 85, Number 2, December 2006 


698 * Social Forces Volume 85, Number 2 * December 2006 


country human capital, they might also differ from the native-born with regard to 
job search behavior, more readily targeting low-skilled, low-paid jobs (Kalter and 
Kogan 2003), which might result in their being trapped in the secondary labor 
market (Doeringer and Piore 1971). 

Another body of research shows that patterns of immigrant reception and 
labor market incorporation differ in various host countries, and that the structural 
characteristics of receiving societies might influence this process (Kogan 2003; 
Lewin-Epstein et al. 2003; Model etal. 1999; Portes and Rumbaut 1996; Reitz 1998; 
Reitz et al. 1999). In studies of three classic immigration countries, for example, It 
has been found that immigrant earnings are higher in Australia and Canada than 
in the United States (Reitz 1998). Model (1997) and Model and Ladipo (1996) have 
observed that immigrants fared better economically in New York than in London. 
Kogan (2003) has compared the integration of ex-Yugoslav immigrants in the 
Austrian and Swedish labor markets, finding evidence to suggest that Yugoslavs 
had higher labor force participation and lower unemployment in Austria than in 
Sweden. And Reitz et al. (1999) have shown that despite greater difficulties in 
accessing employment, guest worker immigrants in Germany enjoyed higher 
earning than immigrants in Canada. 

These comparative empirical studies have, however, anumber of shortcomings, 
not least due to data limitations. First, as a rule they contain quite a small number 
of macro-cases — normally only two or three countries — too few relative to the. 
number of explanatory macro-level variables, so that testing competing theories 
becomes difficult (Goldthorpe 1997; Lieberson 1991). Second, the studies 
comparing immigrant fortunes across various host countries usually do not 
include macro-level variables in their statistical modeling, and differences in 
immigrant fortunes observed across various destinations are interpreted in terms 
of institutional differences between the countries. Such an approach hardly allows 
for differentiating among the effects of single institutional or structural factors, 
but rather elucidates the influence of institutional packages. 

Without denying the merits of the research done thus far, the current study 
attempts to improve on the situation by avoiding both of the problems just 
mentioned. First, it contributes to and expands the discussion on the role of 
host societies’ institutional characteristics in immigrant economic integration 
by examining the labor market situation of immigrants in 14 European Union 
countries rather than merely two or three. Second, the proper names of countries 
are replaced by the values of theoretically relevant macro-level variables. In this 
way the study seeks to relate the variation in immigrants' labor market outcomes 
to differences in the social structure of the European societies. 

Previous studies that offered a framework for cross-national research have 
pointed to the significance of immigration policy (Borjas 1990, 1993) or labor 
markets in explaining the cross-national variation in immigrant labor market 
outcomes (Castles and Kosack 1985; Piore 1979: Sassen 1988, 1991). Portes and 
his colleagues (Portes and Bórócz 1989; Portes and Manning 1986; Portes and 
Rumbaut 2001) have added government policy and pre-existing ethnic community 
characteristics to the list of factors critical to the economic incorporation of 
immigrants and stressed the importance of the interaction among institutional 
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factors for channeling newcomers' economic success. The current study 
singles out the effect of labor market structure and regulation for the economic 
assimilation of recent immigrants. 

It is assumed in the study that common social processes underlie individual 
job allocation in different countries, and that the general mechanisms behind 
existing immigrant disadvantages in the labor market are basically the same in 
all countries analyzed here. However, the institutional and structural conditions 
under which decisions are made by individual actors — be they immigrants looking 
for employment in a new country or local employers deciding whether to hire an 
immigrant or a native worker — can vary across the countries. The main focus of 
the study is thus on the interplay between the labor market conditions in which 
decisions are made and the aims, resources and mechanisms that guide the 
behavior of individual actors. 


Labor Market Institutions and Immigrants' Economic Assimilation 


What are the hypothetical relationships between the labor market characteristics 
of host societies and immigrants’ employment chances? It is argued here 
that the degree of labor market flexibility in a given country likely influences 
employers’ decision making when hiring workers, particularly immigrants, 
because in highly protected labor markets employers are faced with potentially 
higher firing costs. Research suggests that the response of employers in labor 
markets with higher firing costs is to set higher productivity expectations for 
potential employees (Blau and Kahn 1999; Gangl 2003; Giesecke and Groß 
2003). Along with indicators of productivity (i.e., education or training), ascriptive 
characteristics (gender or immigrant status) also come into play. Thus, when the 
need for a “perfect” match is acute, as in higher-status positions, there is likely 
to be a higher risk that statistical or error discrimination! practices intervene in 
the screening process, causing employers to more readily act on prejudices. As 
a result, given the constraints of strict employment protection legislation (EPL), 
employers’ “favorization” of native-born workers (as opposed to immigrants) 
for higher status jobs appears to be quite rational. Hence, one might expect 
immigrant disadvantages upon employment entry to be larger in countries with 
stricter labor market regulation (Hypothesis 1). 

As countries systematically vary with regard to labor market regulation, 
immigrants’ employment chances should differ between two institutional regimes 
or syndromes: the syndrome of mobility, openness, flexibility and inequality 
and the syndrome of protection, closure, equality and rigidity.” (Engelen 2003) 
The first syndrome refers to countries classified by Esping-Andersen (1999) as 
liberal welfare regimes (e.g., UK, Ireland).? These are characterized by high labor 
market flexibility, weak, decentralized industrial relations and market-based social 
insurance. The second syndrome includes both conservative and social democratic 
welfare regimes with their more segregated, segmented and rigid labor markets, 
high labor costs (especially in conservative welfare countries such as Germany, 
Belgium, and France), rather compressed wage structures, and employment- 
based or universal social insurance (in social democratic welfare countries such as 
Sweden and Denmark). Thus in liberal welfare regimes, as opposed to conservative 
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and social democratic welfare regimes, the risk of unemployment among recent 
immigrants is expected to be lower (Hypothesis 2). 

At the lower end of the occupational hierarchy, with its low-skilled, low-paid, 
unstable jobs, often shunned by the native-born, hiring operates more along 
profit-maximization lines, even in countries with otherwise strict labor market 
regulation. For immigrant employees this means more equal treatment at the 
entry point with less or even no discrimination, but often at the price of discounted 
human capital (Cain 1976). Thus, in countries with stronger demand for unskilled 
or low-skilled, secondary labor market jobs, recent immigrants will potentially 
have higher employment entry chances, so that overall immigrant employment 
disadvantages should be smaller (Hypothesis 3). 


Research Methodology 


To examine the unemployment risk of recent immigrants in the 1990s, the 
European Union Labor Force Survey (EULFS) data set for the period of 1992- 
2000 is utilized (Eurostat 1998). The EULFS database provides standardized, 
cross-sectional information on labor force participation, employment and 
unemployment, core demographic and migration information, and educational 
background. It is also valuable for its large sample sizes, ensuring sufficient 
representation of immigrant populations. The data were centrally compiled by ` 
Eurostat, the statistical office of the European Union, from Labor Force Surveys 
conducted in the second quarter of each year by individual E.U. member states. 
Due to some data deficits the analysis has been limited to selected years and 
selected variables. Thus, Italy has been excluded due to unreliable immigrant- 
related information. Data on educational attainment is lacking from several 
countries for the year 1998 when Eurostat introduced a new classification of 
educational levels. The analytic strategy has been to combine countries and 
years of observations into a country-year dataset, resulting in 101 country-year 
observations, each country featuring an average of 7.2 times during this period. 
The analysis focuses on the 18- to 64-year-old labor force residing in collective 
or private households.? The sample size for the native-born varies between 38,894 
in Luxembourg and 655,276 in France, while for immigrants the smallest sample 
is from Finland (87 people), the largest from the United Kingdom (3,775; for 
immigrant sample size, see Table 1). In some countries survey participation was 
voluntary, which probably results in the under-representation of recent or irregular 
immigrants. Moreover, illegal immigrants tend not to be covered by regular 
national surveys, as they are not part of the resident population by definition. 
Coverage of legal immigrant residents alone — with temporary or permanent 
permits — without proper sampling of illegal migrants is likely to introduce an 
unavoidable bias into estimated models of immigrant assimilation. 
. Unemployment status, a dependent variable in the study, is a dichotomous 
variable distinguishing between employed and unemployed persons in the labor 
force.* The standard International Labour Organization (ILO 1990) definition of 


unemployment is applied: unemployed persons are those without a job, but 
currently available for and seeking work. 
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The odds of being unemployed are estimated separately by gender and are 
predicted by a number of individual and macro-level variables. On the individual 
level the following dummy coded? variables are included in the analysis: 


1. Comparison groups: native-born (reference group) 
and recent (up to five years in the host country), 
non-naturalized, third-country immigrants residing 
temporarily or permanently in E.U. countries. Third- 
country immigrants are defined as those migrants not 
coming from any of the 15 "old" member countries 
of the EU or Norway, Iceland, Switzerland, the United 
States, Canada, Australia, New Zealand or Japan. 


2. National origin: differentiating among immigrants 
from Central and Eastern Europe (reference group); 
Turkey, the Middle East and North Africa; other African 
countries; Asia; and Latin America. 


3. Age groups: dummy-coded variables with three 
categories: 18-25, 26-45 (reference group) and 46-65 
years old. 


4. Level of education: namely the lower, middle and upper 
part of the educational range, based on the seven- 
level International Standard Classification of Education 
(ISCED) (UNESCO 1997). More specifically, the first 
group, the reference category, includes those persons 
with basic compulsory education up to lower secondary 
education (ISCED 0-2); the second consists of those 
who have attained vocational, upper secondary or post- 
secondary non-tertiary education at the most (ISCED 3- 
4); the third group is constituted by those possessing 
any kind of tertiary credential (ISCED 5-6). Cases with 
missing information on the level of education are 
marked by the corresponding dummy variable. 


5. Yearof observation: from 1992 to 2000, with year 2000 
serving as reference category. 


The following macro-level variables are included in the analysis (see bivariate 
correlation coefficients for macro-level variables in Table A1 in the appendix): 


1. 7he sie of the unskilled and low-skilled sector: 
percentage of the total labor force employed in 
occupations of the lowest quarter of the occupational 
ladder’ (see Figure A1 in appendix for the average 
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number for the years 1992-2000) These include 
unskilled, semi-skilled and low-skilled occupations 
in sales and services, agriculture, fishery, mining, 
construction, manufacturing, transport, plant and 
machine operations, and assemblage. This variable is 
meant to reflect the structure of the host countries' 
labor markets. From Figure A.1 itis evident that southern 
European countries and Ireland have a heavier bottom 
in their occupational structure, while in Germany, the 
Netherlands and the United Kingdom this is less so. 
The variable is time-variant in the models, calculated for 
each year of observation. 


2. Summary index of the strictness of employment 
protection developed by the OECD (1999:66). The 
OECD EPL strictness index ranges from 0 to 4, where 
higher scores imply stricter employment protection 
and stricter regulation in the use of flexible forms 
of work arrangements. Low EPL countries such as 
the United Kingdom and Ireland score 0.5 and 0.9 
on the index, respectively, while the more regulated 
southern European labor markets reach index scores 
of around 3.5 (see Figure A.1 in appendix). Due to a 
lack of time-variant information the variable is time- 
constant in the analyses. 


3. Dummy-coded variables classifying E.U. countries into 
three welfare regime types: liberal (United Kingdom, 
Ireland); conservative (France, Germany, Netherlands, 
Belgium, Luxembourg, Austria, Spain, Portugal, Greece), 
as the reference category; and social democratic 
(Sweden, Denmark, Finland) (Esping-Andersen 1999). 
This variable is utilized to capture countries' differences 
with respect to employment protection regulation and 
welfare provision. 


4. Toaccountfora country's overalleconomic performance 
and to ascertain if immigrants are disproportionally 
affected by business cycle fluctuations, as suggested 
by Jones (1993), the GDP percentage change on a 
year ago, information taken from the OECD Statistical 
Compendium, is included in the model. 


The method used for explaining cross-country variation in unemployment 
among immigrants as compared to the native-born is multilevel analysis (Bryk 
and Raudenbush 1992; Snijders and Bosker 1999). Multilevel analysis techniques 
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are well suited to dealing with the hierarchical structure of the data. A hierarchical 
linear and non-linear modeling (HLM 5.05) program (Raudenbush et al. 2000) is 
used to estimate the effect of immigrant status on the log odds of unemployment. 
The two-level contextual analysis has certain advantages: first, it allows the 
effect of immigrant status to vary among the countries, and second, it enables 
one to estimate the effect of country-level attributes on immigrant inequality in 
employment chances. One of the strengths of HLM is its ability to weight its 
parameter estimates toward level-two units with more data, so countries with a 
relatively small number of immigrants will not seriously bias the results (due to 
the large sampling error associated with small sample size). 

The estimated model of an individual RU risk is as follows: 


logit (P, 2 dT Ge 3 x i Y oe Kay He t E Xaij 


q-l zl 


The first four terms estimate (1) the fixed effects of the intercept, (2) level 
1 covariates (a dummy variable for third-country immigrants, dummy variables 
for ethnic origin, age, education, and year of observation), (3) level 2 covariates 
(the size of the unskilled and low-skilled sector, EPL strictness, dummy variables 
for welfare regime type, and GDP percentage change), (4) and cross-level 
interactions between level 2 covariates and the dummy variable for the third- 
country immigrant group. Slopes of other individual level control variables were 
constrained to be equal across countries. Fixed parameters are expressed by Se 
in the equation above. None of the individual level variables, being dummy-coded 
variables, were centered in any of the models. Metric macro-level variables were 
centered around their grand means. 

The 5th term in the equation is the random intercept m of the level 2 units, and 
the 6th term is the random slope y,, associated with the level 1 dummy variable 
for third-country immigrants. Multivariate, normal distributional assumptions are 
made for Woj and p,,, with joint zero means and a covariance matrix. 


Descriptive Findings 


Figure 1 plots unemployment rates of recent third-country male immigrants and the 
native-born across the European Union countries. It is evident that in all countries 
under discussion, except for Ireland, male immigrants from third countries have 
higher unemployment rates, but cross-national variation is also apparent. In 
southern European countries, Austria and the United Kingdom unemployment 
rates among recent third-country immigrants seem to be closer to those among 
the native-born male population, while in other continental and Scandinavian 
countries employment disadvantage appears to be more pronounced among 
this immigrant group. In the Netherlands, Belgium, Luxembourg, Denmark and 
Sweden, for example, recent third-country male immigrants have more than a 
five-fold risk of being unemployed compared to the native-born, while in Finland 
and France their unemployment rates are more than four times as high. 
Similarly, female immigrants from third countries are disadvantaged with 
regard to employment in all E.U. countries. The only exception is Spain, where 
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- Greece; UK - the United Kingdom; BE - Belgium; DK - Denmark; PT - Portugal; NL - the Netherlands; AT — Austria; 


Note: Abbreviations (here and henceforth): ES - Spain; FI - Finland; IE - Ireland; FR — France; SE - Sweden; GR 
LU - Luxembourg; DE - Germany. 


Figure 1. Unemployment Rates among Men, Aged 18-64, in the E.U. Countries 


Source: EULFS 1992-2000, average numbers 
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women from third countries appear to have slightly lower unemployment than 
native-born Spanish women. The unemployment differences between female 
immigrants and the native-born are smallest in Ireland, Greece and Germany, and 
largest in Belgium, Sweden and the Netherlands. 

All in all, it can be said that for both men and women the unemployment 
gaps between natives and immigrants are somewhat smaller (or even reversed) 
in southern European countries and lreland, countries with a larger share of 
unskilled and low-skilled jobs. More pronounced is unemployment among recent 
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immigrants in Nordic countries, Belgium and the Netherlands, which are marked 
by a smaller unskilled and low-skilled job sector as well as stricter employment 
protection legislation. 

To illustrate the effects of countries' labor market characteristics on immigrants’ 
unemployment disadvantage, Figure 3 presents bivariate plots of the two labor 
market indicators and the unemployment gap between recent third-country 
male immigrants and natives, calculated as the ratio of unemployment rates 
for both population groups averaged for the period between 1992 and 2000. 
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Figure 3. Labor Market Indicators and Unemployment Figure 4 presents similar 
Disadvantage of Recent Third-country Male Immigrants correlationsforwomen.The 


graph in each figure clearly 





90 - shows that in countries 
45 with a heavier bottom in 

o 40 the occupational hierarchy 
° SO recent third-country 
eo immigrants, both men 
= 25 and women, experience 
Z 20 fewer difficulties in finding 
= 15 employment. A reversed 
e U-trend corresponds to the 
Pas correlation between the 
35 EPL index and immigrants' 

0 unemployment ratio, 

20 25 30 35 40 45 fitting the pattern for 
Proportion Low-skilled men somewhat better 


than for women. Recent 
third-country immigrants, 
particularly men, appear 
to fare better in the more . 
flexible labor markets of 
Ireland and the United 
Kingdom. At the same 
time, however, male and 
female newcomers from 
third countries seem to 
have fewer disadvantages 
in southern European 
countries, which have 
very strict labor market 


Unemployment Ratio 





regulations together 

0 1 2 3 4 with rather expanded 
EPL unskilled and low-skilled 

Index job markets, particularly in 


agriculture, construction 
and domestic services. 

mo Even though the third- 
country immigrants are treated in the figures above as a homogeneous group, this 
is not strictly so. Table 1 shows the variation with regard to origins of third-country 
immigrants in the E.U. countries. The majority of newcomers in Greece, Austria, 
Finland, Sweden, Luxembourg and Germany stems from Central and Eastern 
European countries, with the former Yugoslavia being one of the most prominent 
sending countries. Immigrants from the Middle East, Maghrebian countries 
and Turkey constitute the largest group in Belgium, France and the Netherlands 
and are also very pronounced in Germany, Denmark, Spain and the United 


Source: EULFS 1992-2000, average numbers 


Kingdom. Black Africans 
constitute the largest 
group among recent 
immigrants in Portugal. 
African immigrants are 
also a substantial group 
in the United Kingdom, 
Denmark, France and 
Luxembourg. Immigrants 
from South and East 
Asia are the largest 
group in the United 
Kingdom. The proportion 
of these immigrants 
varies between 10 
and 15 percent also in 
Luxembourg, Denmark, 
the Netherlands, France, 
Germany and Sweden. 
Finally, Latin Americans 
are the largest immigrant 
group in Portugal and 
Spain, while a significant 
proportion of immigrants 
from Caribbean countries 
settled inthe Netherlands, 
some in France, Sweden 
and the United Kingdom. 


Results of the 
Multivariate Analyses 


Though offering a general 
idea of the magnitude of 
employment disadvantage 
among recent  third- 
country immigrants, the 
descriptive figures in the 
section above might be 
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Figure 4. Labor Market Indicators and Unemployment Dis- 
advantage of Recent Third-country Female Immigrants 
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Source: EULFS 1992-2000, average numbers 


somewhat misleading because they do not consider differences in human capital, 
demographic characteristics and ethnic origins of immigrants. Hence, results of 
multilevel analyses, which take into account both individual characteristics of the 
analyzed individuals and the institutional factors of their receiving societies, are 


presented here. 


Table 2 presents the results of a logistic two-level regression predicting log 
odds of being unemployed for native-born men as compared to recent third- 
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Table 1: Origins of Third-country Immigrants in the European Union Countries 


Northern Africa, Sub-Saharan Latin America, 
Europe Middle East. Turkey Africa — Asia Caribbean N 
Austria 79.3 12.5 1.2 44 1.5 3158 
Belgium 17.1 58.7 4.6 9.1 3.9 1065 
Germany 544 27.8 3.9 10.1 27 3447 
Denmark 44.2 23.8 9.1 14.3 64 265 
Spain 8.3 254 3.6 5.9 42.2 1513 
Finland 73.6 9.2 6.9 6.9 23 87 
France 15.8 54.5 9.0 11.6 TA 865 
Greece 88.0 6.3 8 4.1 4 3600 
Ireland 18.6 5 0 0 0 393 
Luxembourg 57.3 74 8.1 16.8 4.9 567 
Netherlands 20.8 41.5 5.9 de 14.1 3586 
Portugal 7.0 .6 54.3 1.6 35.9 516 
Sweden 61.8 14.1 5.6 9.7 6.7 390 
UK 10.1 21.1 18.8 42.0 6.0 3775 


Source: EULFS 1992-2000 


country immigrants. [he aim of Model 1 is to assess the difference between the 
third-country immigrants and the native-born when only individual level variables 
are included, while allowing systematic variation across countries (between- 
country variation) in the intercept and the slope for third-country immigrants. The 
coefficient for third-country immigrants (e'?'—3.35) suggests that on average 
third-country male immigrants are much more likely to be unemployed than 
the native-born male population, even taking into account their individual level 
attributes. There are significant differences among immigrants of various ethnic 
origins in their log odds of unemployment. Immigrants from sub-Saharan African 
countries are substantially disadvantaged with regard to employment in European 
countries, while newcomers from Asian countries have lower unemployment risks 
than the rest of third-country immigrants. It should be noted, however, that the 
reference group is defined as immigrants from Europe and those of unidentified 
origin (other or stateless), but there are hardly any significant differences between 
these immigrants and those coming from the Middle East, North Africa, Turkey 
and Latin America. j 

Moreover, there exists a rather significant disadvantage (e? — 2.41) for young 
men compared to those 26-45 years old in finding employment. At the same 
time, older people have somewhat lower unemployment risk than the reference 
group. Those who left school with a vocational or secondary education certificate 
have lower unemployment risks (e^ — .65) than the least educated, controlling 
for other individual level variables. Tertiary education leads to even better 
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Table 2: Results of Logistic Two-level Regression Predicting Unemployment among Men 


Variables Model 1 Model 2 Model 3 
Intercept (native-born) ne Baud 2625 Z (15) Eil, (45) 
Proportion of low-skill jobs 05 ee OD) .03 (.02) 
EPL strictness E20. (05) H2 (.13) 
Liberal regime ds 25) 
Social-democratic regime lo) 
GDP change -09**  (.03) 
Third-country immigrants (difference from native-born) 
Intercept 121: 09) 1S COL) At eK 
Proportion of low-skill jobs dd 5 101) e 02) 
EPL strictness O ADO) -.10 (15) 
Liberal regime -1,04** (28) 
Social-democratic regime 40% (.18) 
GDP change .01 (.02) 
Individual-level variables 
Ethnic origin 
Middle East and North Africa .08 (.08) .07 (.08) .08 (.08) 
Sub-Saharan Africa ISTE 646) 8255919115) Mos 81/15) 
Asia 294^ * (41) Ser (10) 2:392 0614) 
Latin America -.19 (.17) -.17 (.17) -.16 (.18) 
Age 
18-25 .88** (04) .80"* (04) .88*** (04) 
26-45 (ref.) 0 0 0 
46-65 -22""* (.04) r2 DA) -22"* (.04) 
Level of education 
Compulsory (ref.) 0 0 0 
Upper secondary -43"*  (.04) -44"*  (.04) -43"* (04) 
Tertiary -83"* (06) eg ege sog s c OR) 
Information missing -25"*  (.04) 2 A) -25"* (04) 
Year of observation 
1992 .19 (.20) .05 (.18) -.19 (.21) 
1993 .39 (.20) .26 (.20) -.07 (.22) 
1994 48"  (.19) (a8 (.19) 30 (.19) 
1995 92 (.19) .20 (.20) .16 (.19) 
1996 .26 (.21) 16 (.22) .08 (.21) 
1997 .36 (.20) 25 (.22) 23 (.20) 
1998 .34 (.21) 20 (.22) 25 (.20) 
1999 23 (.18) .20 (.20) .14 (.18) 
2000 (ref.) 0 0 0 
Variance components 
Level 2 Variance intercept .48 Ai A0 
Level 2 Variance slope .61 .36 2 


Source: Pooled EULFS data, 1992-2000, selected years 
Notes: **p «.05  ***p —.01. Robust standard errors are in parentheses 
N (individual level) = 2,680,219; N (second level) = 101 
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chances in finding employment. The odds of being unemployed for holders of 
tertiary education certificates in the European Union are much (8-9 = .44) lower 
than for those who have compulsory education only, other things being equal. 
Since the data encompass cases with missing information on educational level, 
a dummy variable has been introduced to account for them. It signifies that men 
for whom information on education is missing have lower risks of unemployment 
than people with compulsory education. Finally, during the 1990s the risk of 
unemployment fluctuated, being somewhat higher in 1994 than in 2000. 

It can be assumed that the number of covariates included in the model on 
the individual level is far too small to account for the whole variety of difficulties 
encountered by recent arrivals, such as host country language problems, 
immediate adaptability and so on. It can, however, be taken for granted that 
recent immigrants experience quite similar difficulties immediately upon their 
arrival in any of the E.U. countries. The question thus arises: Are there systematic 
differences in the labor market structures and regulations of E.U. countries that 
might be responsible for the variation in immigrant unemployment penalties? 

In Model 2 the intercept and the slope for third-country immigrants are modeled 
as a function of macro-level characteristics (i.e., countries’ demand for low-skilled 
labor and EPL strictness). In Model 3 the dummies for welfare regime types are 
added to the variable pertaining to the general economic climate in the country. 
Because the effects of macro-level predictors on the slope for third-country. 
immigrants (i.e., cross-level interaction effects) are of primary interest to this 
study, while structural determinants of unemployment in general (main effects) 
are of lesser concern, only the former will be discussed in detail. Negative cross- 
level interaction effects indicate a decline in the unemployment gap between 
immigrants and the native-born, while positive effects signify an increase. 

The results show that stronger demand for unskilled and low-skilled labor 
tends to equalize the likelihood of recent third-country immigrants securing 
employment (compared to the native-born) as the immigrant unemployment gap 
decreases (e*!?). In accordance with Hypothesis 1, the effect of EPL strictness 
upon employment inequalities is positive and statistically significant. This means 
that stricter employment protection legislation might be held accountable 
for the greater difficulty recent third-country male immigrants face in finding 
employment, once national labor market structures are controlled for. Taken 
together, the two variables pertaining to labor market structure and regulation 
manage to explain about 41 percent [((14.36/.61))*10096] of the variance in the 
slope for recent third-country immigrants present in the model containing only 
individual-level characteristics. 

In Model 3, in addition to the macro-level variables related to labor market 
structure and regulations, dummy variables pertaining to welfare regimes are 
included, the conservative welfare regime serving as a reference category. It is 
noteworthy that the effect of EPL strictness disappears after dummy variables 
for welfare regimes are included. This accords with theoretical arguments and 
expectations that the nexus of labor market flexibility is an essential component 
in the welfare regime construct. The negative interaction effect of the slope for 
third-country immigrants and the proportion of those employed in unskilled and 
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low-skilled jobs remains significant even after controlling for welfare regime 
types. It appears that the unemployment gap between recent immigrants and 
the native-born in liberal welfare countries is smaller than in conservative welfare 
states. Conversely, in socio-democratic welfare regimes these immigrants seem 
to be more disadvantaged, or in other words, the immigrants' unemployment 
gap is larger. Business cycle fluctuations affect the employment chances of 
recent third-country immigrants and the native-born to a similar degree, as 
the coefficient for GDP change is not statistically significant. Altogether, 
macro-level variables included in the analyses account for about 55 percent 
[(1-(.27/.61))*100%] of the variance in the slope for recent third-country male 
immigrants, suggesting that they play an important role in explaining cross- 
national variations in immigrant inequalities. 

Table 3 shows the results of similar analyses of unemployment propensity 
conducted for recent third-country female immigrants and native-born 
women. The sequence of the models is the same as in the analyses of the 
male population, so the results can be compared between the two gender 
groups. From Model 1 it appears that immigrant women suffer from higher 
unemployment risk (e'° = 2.72) in the E.U. countries, but the gap between 
immigrants and native-born women is somewhat smaller than among males. 
Moreover, the differences among immigrant women of various ethnic origins 
are less pronounced (and are not statistically significant) than among men. 

The effects of individual-level control variables are quite similar to those 
observed among the males: younger women (18-25 years old) have higher 
unemployment risks than those aged 26-45. The older ones are, however, less 
disadvantaged: the gap between 46- to 65- and 26- to 45-year-old women is larger 
than the respective gap among the men, which might perhaps be related to the 
higher propensity of younger women to be involved in child-bearing and rearing, 
as well as statistical discrimination they might face from the side of employers. 
Educational effects on unemployment risk point in the expected direction and 
are quite similar to those of the men. The only substantial difference is that those 
women for whom information on education is missing are not systematically 
different from respondents with compulsory education. Finally, the dummy 
variables for years of observation show that the year 1998 was somewhat less 
favorable for women. 

Models 2 and 3 in Table 3 mimic those estimated for the men. Similarly, the 
effect of a country's demand for unskilled and low-skilled jobs is negative, which 
signifies a decrease in the unemployment gap between immigrant and the native- 
born women in countries with more sizable unskilled and low-skilled sectors. 
This implies that in countries with a larger share of unskilled and low-skilled jobs 
immigrant women are less penalized compared to native-born women than in 
the rest of the countries. EPL strictness, however, exerts no significant effect 
on immigrants' employment inequalities among women. In this regard, it should 
be noted that the estimated models for female unemployment do not include 
important variables pertaining to the women's family situation and child status 
(due to their unavailability in the dataset). Theoretical considerations suggest 
that employers might perceive both women and immigrants as less productive 
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Table 3: Results of Logistic Two-level Regression Predicting Unemployment among Women 





Variables Model 1 Model 2 Model 3 
Intercept (native-born) :2:23* ^. (46) 792247509014) 6 24897 29049) 
Proportion of low-skill jobs 0558069601) .02 (.02) 
EPL strictness .03 (.05) 9455: 21 (40) 
Liberal regime SO MA) 
Social-democratic regime .39 (.25) 
GDP change -.03 (.03) 
Third-country immigrants (difference from native-born) 
Intercept LO A .99***  (.09) SUED 
Proportion of low-skill jobs -08** (01) «0977 (02) 
EPL strictness 12 (.07) A7 (.18) 
Liberal regime -.16 (.35) 
Social-democratic regime dd 20) 
GDP change .00 (.03) 
Individual-level variables 
Ethnic origin 
Middle East and North Africa -.15 (.13) -.17 (.12) -.15 (.13) 
Sub-Saharan Africa .19 (.14) 24 (.14) .26 (.14) 
Asia -.14 (.10) -.15 (.10) -.15 (.10) 
Latin America -.13 (.20) -.12 (.19) -.12 (.20) - 
Age 
18-25 .62"*  (.04) .64"*  (.04) 63*"*  (.04) 
26-45 (ref.) 0 0 0 
46-65 ABT"  (.06) -48"*  (.06) -48"*  (.06) 
Level of education 
Compulsory (ref.) 0 0 0 
Upper secondary -28"*  (.04) 29009) -20 - (03) 
Tertiary -.80***  (.05) -82**  (.05) «815 (05) 
Information missing -.08 (.07) -08**  (.08) -08*"*  (.08) 
Year of observation 
1992 “2 (.20) 11 (.20) 14 (.22) 
1993 EN (.21) .39 (.21) EN (.26) 
1994 Hm (.21) Ke (21) 495857 5 (522) 
1995 .28 (.23) 28 (.22) At (.24) 
1996 .28 (.23) 27, (.21) d (.21) 
1997 33 (.24) 3i (.21) 3 (.22) 
1998 Od at 28) 48" (22) oes bud) 
1999 20 (.20) .18 (.19) 20 (.19) 
2000 (ref.) 0 0 0 
Variance components 
Level 2 Variance intercept ¿02 46 44 


Level 2 Variance slope 55 38 39 


Source: Pooled EULFS data, 1992-2000, selected years 
Notes: **p < .05 ***p < .01. Robust standard errors are in parentheses 
N (individual level) = 1,982,202; N (second level) = 101 
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and hence discriminate against them, but it is not immediately clear from the 
estimates that immigrant women are particularly disadvantaged in countries 
where the costs of a bad match are high, i.e. in less flexible economies. This 
finding is also confirmed in Model 3, which shows that the unemployment 
risks of immigrant women are indeed similar in flexible Ireland and the United 
Kingdom and in conservative continental Europe. Immigrant women do suffer 
from higher unemployment in the Nordic countries, marked by their strong 
gender equalization labor market policies. The macro-level variables in the final 
model manage to explain about 40 percent [(1-(.33/.55))*100%] of the variance 
in the slope for recent third-country female immigrants present in the model 
containing only individual-level characteristics. 


Summary and Discussion 


In recent decades European countries have increasingly become a destination for 
migrants from all over the world. Deficient economic incorporation of immigrants, 
their often low labor force participation, welfare dependency and, above all, 
their high unemployment are at the top of the agenda in the majority of these 
countries. Indeed, as revealed in this paper recent third-country immigrants, both 
male and female, face, as a rule, higher unemployment in European countries 
compared to the native-born population, but there exists some cross-national 
variation in this regard. In southern European countries and Ireland newcomers 
suffer smaller penalties with respect to employment than in the rest of Europe. 
Thus, multivariate analyses were conducted with an aim to explain whether this 
variation in unemployment risk among recent third-country immigrant populations 
as compared to the native-born could be systematically related to variation in 
institutional factors of the receiving societies responsible for immigrant labor 
market allocation, and in particular, to variation in labor market structure and 
regulations. 

The results of the multilevel logistic regression analyses confirm that in receiving 
countries with a larger share of unskilled and low-skilled jobs, recent third-country 
immigrants, both men and women, have better employment entry chances, or in 
other words, are less disadvantaged compared to the native-born (Hypothesis 3). 
A country's demand on the less-skilled labor force might also exert an effect on the 
nature of migrant inflow (e.g., seasonal migration for employment in agriculture), 
attracting migrants with more immediate goals. Southern European countries, 
for example, where the number of unskilled and low-skilled jobs, particularly in 
agriculture and construction, is quite large, also seem to have experienced a boom 
in temporary, labor-oriented immigration, so that comparatively smaller immigrant 
disadvantages in these countries can be explained by a match in the nature of 
migrant inflow to the type of labor needed. 

Further, it could be shown that the employment disadvantages of recent third- 
country male immigrants are indeed lower in countries with more flexible labor 
markets, which accords with Hypothesis 1. No similar effect for women has been 
confirmed, which might be at least partially attributed to the lack of important 
control variables in the unemployment equations of women. 

By and large, Engelen's (2003) thesis regarding different modes of economic 
incorporation in liberal welfare states (syndrome of mobility and flexibility) as 
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compared to conservative and social democratic welfare regimes (syndrome 
of closure and rigidity) has been confirmed (at least for men) by the results of 
the multilevel analyses presented here (Hypothesis 2). The findings show that 
in liberal welfare countries male immigrants have better prospects of finding 
employment than those who settled in countries with conservative welfare 
regimes. Both male and female immigrants heading to social democratic welfare 
regimes fare even worse with respect to employment than those in continental 
host countries. 

How sure can one be that in claiming an effect for labor market flexibility — a 
distinctive feature of the liberal welfare regime — on immigrants’ employment 
chances oneis not picking up other macro-level effects spuriously related to it (King 
et al. 1994)? Among possible correlates the following two can be mentioned. First, 
immigrants heading to countries with more flexible labor markets might be more 
positively selected with respect to observed as well as intrinsic characteristics 
such as motivation or abilities. While positive self-selection with respect to formal 
educational qualifications in liberal welfare regimes can be confirmed (results not 
shown), with the standardized labor force data it is hardly possible to ascertain 
whether immigrants heading to more unequal liberal welfare regimes are in fact 
"better" selected with respect to motivation or intrinsic abilities, as argued by 
Borjas (1987). 

Immigrant selection and labor market performance can also be related 
to status as labor or refugee migrant. There is indeed some evidence that 
Nordic countries, particularly Denmark and Sweden, but also Belgium and the 
Netherlands, accepted a larger proportion of asylum seekers in their total intake 
of third-country immigrants. The availability and comparability of the data referring 
to asylum seekers and refugees in Europe are, however, somewhat problematic? 
In any event, including a variable pertaining to the approximate proportion of 
asylum seekers in the multilevel regression analyses yields no significant effect 
for this variable, nor does it alter the effects of other macro-level variables. 

Second, in both of the liberal European welfare regimes, the United Kingdom 
and Ireland, English is spoken, and it goes without saying that in the modern 
world lots of people study and speak English, albeit not always at a level sufficient 
to enter prestigious white-collar employment in an English-speaking country. 
At any rate it might well be expected that people coming to Great Britain and 
Ireland would command the basics of the host country language upon arrival, 
which might not be the case among immigrants heading to Germany, Finland 
or Denmark. In the analyses presented here it was unfortunately impossible to 
control for host country language proficiency, as this variable is not present in 
the dataset. 

The current study focuses solely on immigrant unemployment, while other 
indicators of immigrant labor market performance, such as earnings, are 
neglected, largely because of data limitations. Analyzing income could add an 
important dimension to the assessment of immigrants' labor market inclusion 
and future research should address this issue. It is particularly important since 
E.U. countries vary with respect to their wage bargaining arrangement models, 
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minimum income legislation, and the way welfare states redistribute wealth 
and disperse wages. There are reasons to expect that in liberal welfare states, 
immigrants, particularly those in low-skilled employment, face substantial risks of 
economic deprivation and even poverty because such jobs often do not cover the 
cost of living. In conservative, and especially in social-democratic welfare regimes 
with their more egalitarian wage structures and stronger redistributive policies, 
low-skilled immigrants and even the unemployed should enjoy a decent living. 

Despite its limitations and the questions which remain open, it is fair to 
conclude that this paper elucidates contours in the variety of immigrant inclusion 
patterns in Europe more clearly and systematically than has been done thus 
far. In addition it points out an important direction for future cross-national 
research, which should more closely concentrate on the institutional settings 
of host societies, which clearly have been shown to influence the labor market 
integration of newcomers. 


Notes 


1. While the error discrimination approach assumes that false beliefs are 
imputed to the "true" productivity of workers due to a lack of full information, 
a statistical discrimination approach assumes that employers do not have 
full information on the productivity of potential workers, imputing group 
information instead. For more on statistical and error discrimination see 
Arrow (1972), Aigner and Cain (1977), England (1992), Phelps (1972). 


2. Inhis seminal book The Three Worlds of Welfare Capitalism, Esping-Andersen 
(1990) suggests clustering capitalist societies into three regimes: liberal, 
conservative and social democratic. The basis of this classification rests 
on three key concepts: the degree of "decommodification," the principle of 
stratification and the nature of state-market-family relations. According to 
Esping-Andersen welfare regimes represent certain institutional packages 
able to shape (e.g., employment or stratification), resulting in different 
outcomes on the individual level. 


3. Some countries did not include collective households in the EULFS, 
thus neglecting recently arrived asylum seekers who, as a rule, reside in 
specialized camps. 


4. Thus inactive persons (those furthering their education, homemakers, the 
retired, the disabled and "others") are excluded from the analyses. 


5. The categorical form of variables used for the analysis is determined by the 
specifics of the data, which were delivered in the form of a multidimensional 
table. Moreover, the data do not allow for including all possible factors 
influencing unemployment risks. Unfortunately, they do not include 
respondents' family characteristics and their child-status, impeding in 
particular analysis of the employment propensity of women. 


6. In some countries such as lreland, the reference group also includes 
immigrants whose nationality is defined as "other or stateless." 
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7. These are occupations which are assigned scores 16-33 on the 
International Socio-Economic Index of Occupational Stàtus (ISEI). ISEI 
ranks occupations on a hierarchical scale ranging between 16 and 85 
(Ganzeboom and Treiman 1996). 


8. Snijders and Bosker (1999) argue that this model should not include a separate 
parameter for level-one variance, because it is an equation for the probability 
of the outcome rather than the outcome itself LS If one interprets a logistic 
regression as a linear regression of a continuous latent variable observed in 
dichotomized form, the so-called threshold model, then the individual level 
residual e; is assumed to have a standard logistic distribution with the fixed 
variance of «7/3 (3.289) (Long 1997; Snijders and Bosker 1999). 


9. Eurostat, which publishes some information on refugees and asylum 
seekers, admits that definitions, data availability, reliability and comparability 
are sometimes problematic. Even though exact numbers of asylum seekers 
may not be absolutely consistent (which is the reason they are not presented 
in this paper), the general trends mentioned here seem to be supported by 
other publications (SOPEMI, 2000, 2001). 
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Appendix 


Figure A1. Values of Macro-level Variables Included in the Multilevel Model 
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Figure Al. continued 
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Table A1: Bivariate Correlation Coefficients among Macro-level Variables Included in Multi- 
level Analyses 





Variable 1 2 3 4 5 

1. Liberal Welfare Regime 1 

2. Conservative Welfare Regime 0 1 

3. Social-democratic Regime 2:48 -.68*** 1 

4. Demand for Low-skilled Labor S Pass -.10 1 

5. EPL Strictness -.68*** es = 021: 27s 1 

6. GDP Change chow -.24** .05 -.02 Ge 


Source: EULFS 1992-2000 
Notes N-101; *p «.05 ***p < .01 (two-tailed tests) 
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Multiculturalism and “American” Religion: 
The Case of Hindu Indian Americans 


Prema A. Kurien, Syracuse University 


How non-Christian religious groups should be politically recognized 
within Western multicultural societies has proved to be a pressing 
contemporary issue. This article examines some ways in which 
American policies regarding religion and multiculturalism have shaped 
Hindu Indian American organizations, forms of public expression and 
activism. Specifically, I look at three examples of the impact of such 
policies on Hindu Indian Americans representing strategic arenas in the 
socio-political recognition of ethnic groups, and also crucial moments 
in the institutionalization of an American Hinduism. I also show how 
the activism of Hindu Americans is reshaping the contours of religion, 
society and politics in the United States. 


Introduction 


How non-Christian religious groups should be politically recognized within Western 
multicultural societies has proved to be a knotty but pressing contemporary 
issue. This topic is important to consider in the context of a discussion of social 
membership policies because the political recognition of religious groups is 
crucial to providing them access to the valued economic, social and cultural 
goods that citizenship in Western countries confers. The approach of different 
countries toward accommodating non-Christian religions has varied a great deal 
depending on the particular narratives of nation espoused and whether they see 
religion as a matter of private life or of public policy (Fetzer and Soper 2005; 
Kepel 1997; Modood and Werbner 1997; Neilsen 1992). Despite this diversity, 
policies toward religious groups have been based on specific definitions of what 
comprises religion, what are accepted ways of practicing religion, and which 
group is seen as representing the “authentic” religious voice. While the goal is 
usually to create a neutral framework to accommodate the variety of religious 
groups in the country, in practice the definitions are often based on Christian 
(and frequently Protestant) understandings, which results in the marginalization 
of groups that do not fit this mold (see Alexis 2003). Having to conform to 
these policies in order to obtain recognition and resources can also result in the 
transformation of the religious beliefs and practices of such groups. 

Tariq Modood (1998:387) has criticized the “theoretical neglect of the role of 
religion” in Western multicultural societies, which “reflects a bias of theorists that 
should be urgently remedied.” This of course has changed since the events of 
Sept. 11, 2001 and the summer 2005 bombings in London. Both these events 
resulted in a greater tendency to religiously categorize non-Western groups, and 
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for the groups themselves, a greater need to manage and positively represent 
their religious identities in the public sphere (Purkayastha 2005:174). Most of 
the research on multiculturalism and non-Christian groups in the West has been 
undertaken since 9/11 and has focused on Muslim communities in Europe 
(AlSayyad and Castells 2002; Fetzer and Soper 2005; Hunter 2002; Modood 
2005). How do Western multicultural policies affect non-Christian religions that 
are based on very different principles from Islam or Christianity? This article 
examines some ways in which social citizenship policies work to effect closure 
around who is defined as belonging to the national multicultural family and the 
role that religion plays in national membership processes. Specifically, it focuses 
on the ways in which American policies regarding religion and multiculturalism 
have shaped Hindu Indian American organizations, forms of public expression 
and activism. | look at three examples of the impact of such policies on Hindu 
Indian Americans, representing strategic arenas in the socio-political recognition 
of ethnic groups, and also crucial moments in the institutionalization of Hinduism 
in the United States. 

The policies of Western states are generally based on the premise that it is 
possible to separate religious and non-religious spheres. This poses a significant 
challenge for many non-Christian religions that do not make such a distinction 
(see Alexis 2003:15; O'Brien and Palmer 1993:96). Perhaps the most influential 
set of policies shaping the religious practices of non-Christians are those that 
determine how religion can be expressed and practiced outside the home. In most 
cases, the expectation is that religious activity will be located within a religious 
institution or organization. Rules that outline what criteria have to be met before 
a group is classified as a religious organization are the most important of such 
policies (Dane 1998; Cormode 1998). 

Thus, | consider first how rules regarding the organization and administration of 
American non-profit corporations affect the way Hindu temples are organized and 
run in the United States. Congregational understandings of religion that underpin 
rules regarding what entities can qualify as churches and religious organizations 
have had profound impacts on Hindu temples. Most temples do not qualify as 
churches and thus are not able to obtain the financial benefits of obtaining such a 
status. Even the less stringent rules defining religious organizations have required 
substantial changes in the organization and management of Hindu temples giving 
rise to conflicts and legal contestation in many temples throughout the country. 

While the formation and operation of religious organizations is made possible 
by state laws, they still occupy a private space, oriented primarily toward the 
particular religious community that they represent. For religious groups to be 
meaningfully incorporated into the national fabric however, they also need to be 
publicly recognized by the state. My second example looks at Hindu American 
attempts to get Hinduism publicly recognized as an "American" religion. | 
examine how Hindus have tried to accommodate to, but also challenge the 
religio-cultural ethos of the United States as well as some of the differences 
between the strategies adopted before and after Sept. 11, 2001. We will see 
how Hindu attempts to differentiate themselves from Muslims were stepped up 
in the aftermath of 9/11 and how they resulted in greater support for the Hindu 
nationalist platform (Kurien 2006). 


Multiculturalism and “American” Religion ° 725 


Even if religions are politically recognized by the state, this recognition needs 
to be translated into policies that determine who is acknowledged as being a 
member of the national multicultural family at the local level. Schools are crucial 
local sites where national belonging is defined and inculcated. Not surprisingly 
therefore, the treatment of non-Christian religious traditions in the school 
curriculum has become a contentious issue in practically all Western multicultural 
societies. My final example examines the mobilization of groups demanding a 
positive portrayal of Hinduism in American school textbooks on parity with those 
of other religions. | show how this mobilization resulted in the involvement of 
both those supportive of and those opposed to Hindu nationalist politics and also 
raised fundamental questions about who is qualified to speak on behalf of the 
Hindu American community. 

While American policies have affected the institutionalization of an American 
Hinduism, this article shows how the activism of Hindu Americans in each of 
the spheres mentioned above is reshaping the contours of religion, society 
and politics in the United States. The article also raises questions regarding the 
formulation and implementation of multicultural policies and underscores the 
shortcomings of multiculturalism when it comes to accepting new patterns of 
difference such as that manifested by Hindu Americans. Finally, it demonstrates 
the complexity of identity formation and mobilization by immigrant groups by 
showing the ways in which the category of “Hindu Indian American” is being 
shaped by contestation between different groups and by social and political 
processes taking place in India. 


Background to the Research 


According to the 2000 U.S. census, Asian Indians in this country numbered close to 
1.7 million. Most adults of Indian origin in the United States today are immigrants 
who arrived after the passage of the 1965 Immigration and Naturalization Act. 
Although relatively smallin number, Indian Americans often wield a disproportionate 
influence because they are among the wealthiest and most educated foreign-born 
groups in this country.' There are no official figures on the religious distribution of 
Indians in the United States. According to census figures, Hindus constitute more 
than 80 percent of the population in India, but it is likely that they constitute a 
much smaller proportion of Indian Americans because Indian religious minorities, 
particularly Sikhs and Christians, are present in much larger numbers in the United 
States. Estimates of the proportion of Indian Americans from a Hindu background 
range from 45 percent to 76 percent (Hofrenning and Chiswick 1999; 2002 
Britannica Book of the Year; 2000 World Almanac’). 

Contemporary Hindu nationalism or Hindutva, emerged in India in the early 
20th century as a reaction to Western colonialism, and resurged again in the late 
1980s. Several scholars have argued that the “Hinduism under siege” Hindutva 
message, and its emphasis on the need for Hindu pride and assertiveness 
is particularly attractive to Hindus in the United States who become a racial, 
religious and cultural minority upon immigration and have to deal with the largely 
negative perceptions of Hinduism in the wider society. Coalescing to defend a 
beleaguered Hindu identity has become an important way for Indians from a 
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Hindu background to counter their relative invisibility within American society 
(Kurien 2004; Lal 1999; Mathew and Prashad 2000). This is the reason that Hindu 
American mobilization for recognition and resources is frequently imbricated 
with the Hindutva movement. 

My analysis here draws on an eight-year study and book (Kurien forthcoming) 
on the new forms, practices and interpretations of Hinduism in the United States. 
Besides participating in the activities and programs of 12 Hindu organizations, 
| conducted detailed interviews with leaders and many of the members (more 
than 120 first- and second-generation Hindu Americans in all). | have also been 
following the activities of the Hindu Indian community around the country by 
reading several Indian American newspapers and the international magazine, 
Hinduism Today (and its daily Hindu Press International reports), published from 
Hawaii. This particular article is an examination of Hindu Indian Americans in the 
public sphere and is based primarily on an analysis of discussions on Internet 
groups, websites and Internet magazines devoted to Indian or Hindu related 
topics. Since the year 2000, when the Internet became a major site of Hindu 
American activity, | have been monitoring several such Internet sources devoted 
to Indian- or Hindu-related topics. These sources also provide links to articles 
or news-briefs from a variety of sources discussing issues pertaining to Hindu 
Americans which | analyzed. Many of the groups involved in the legal contestations 
and mobilizations that | describe below have their own websites. These websites- 
were another valuable source of information. Through these sources | have been 
able to obtain a profile of the activities of Hindu Indian American leaders at a 
national level, which is the main focus of this article. 


Becoming a Religious Organization in the United States 


The United States provides significant tax breaks and other benefits for groups 
that are designated as churches or religious organizations, and these policies 
provide strong incentives to seek that designation. It is more advantageous to be 
recognized as a church because churches receive more financial benefits than 
religious organizations. The Internal Revenue Service document pertaining to 
churches (Tax Guide for Churches and Religious Organizations) makes clear that 
the term "church" is used generically to refer to a place of worship and that the 
designation may be obtained even by non-Christian religious institutions [pog 
It also indicates that there is "no attempt to evaluate the content of whatever 
doctrine a particular organization claims is religious." (p. 23) Despite these 
caveats, the list of characteristics the IRS attributes to a church is clearly based 
on a Christian model. | examine these characteristics below and show why Hindu 
temples are generally not able to achieve the church designation. 

The characteristics specified by the IRS may be subsumed under three 
categories. The first category has to do with the nature of the religion. To be 
designated as a church, the IRS indicates that the religion should havea "recognized 
creed and form of worship," a "formal code of doctrine and discipline," "literature 
of its own" and "definite and distinct ecclesiastical government." These criteria 
are problematic for Hindu temples because Hinduism does not have a founder 
or a single canonical text or commentary. Although some sectarian traditions 
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have a formalized leadership hierarchy, there is no ecclesiastical body for all of 
Hinduism. It is also a religion that stresses orthopraxis over theological belief. In 
other words, there is no clear “creed,” “code of doctrine,” “form of worship” or 
“ecclesiastical government” that can characterize Hindu belief or practice as a 
whole. Temples in India usually house one or more deities belonging to a single 
tradition, so there may be some uniformity in the creed and form of worship 
each type of temple practices. In the United States however, many temples 
tend to be more ecumenical than temples in India (Williams 1992:238-40) due to 
the enormous expense involved in constructing them and may enshrine deities 
from several, and sometimes opposing, traditions. This is another reason that 
ecumenical temples in the United States may not have a single creed, doctrine 
or even a single body of literature that is considered authoritative. 

The second category has to do with the nature of the place of worship, activities 
carried out in the place of worship and membership policies. The IRS requires 
that a church maintain “membership not associated with any other church or 
denomination,” “regular congregations” and “Sunday schools for the religious 
instruction of the young.” Hindu temples are primarily the abode of one or more 
deities and are not considered to be congregational homes for worshipers. For 
this reason, there is no concept of membership in a temple. Devotees can and 
do worship in a range of temples. Again, with the exception of a few sectarian 
temples, temple worship is typically not a congregational activity. Temples in 
India also do not generally conduct religious classes for children. Children 
learn Hindu doctrine and practice in the home primarily from their parents and 
grandparents. 

The final category has to do with the qualifications of the clergy. The IRS 
indicates that they should be “ordained” and “selected after completing prescribed 
courses of study” in the religious training schools belonging to the churches. 
These specifications may exclude Hindu priests. Hindu temple priests in India 
traditionally hail from Brahmin families where male members have been practicing 
the profession for generations. Such priests obtain their education informally by 
studying and working with their fathers. Over the past few decades, specialized 
religious schooling for temple priests has become increasingly common in India 
and even a requirement for employment at the some of the major southern Indian 
temples (Fuller 2003). However, not all of the priests working in temples in the 
United States have attended such schools. 

Not surprisingly, most Hindu temples and practices dont conform to the 
IRS's specifications for a church. In fact, most Hindu organizations that have 
obtained the church designation are non-traditional, neo-Hindu groups.^ Most 
traditional temples are currently incorporated as religious organizations. However, 
recognizing the additional benefits obtained by churches when compared to 
religious organizations, one Hindu leader recently urged temples to attempt 
to achieve the "more difficult church designation . . .[for] “legal and social 
reasons." It will be interesting to see whether more Hindu temples will attempt 
to achieve this classification, and if so, what changes that will necessitate in their 
organization and functioning. Also of interest is whether the influx of many new 
religious groups whose religious institutions do not fit the characteristics of the 
current IRS church designation will lead the latter to change some of its criteria. 
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Even though the "religious organizations" designation is based on far less 
exacting criteria, it has also been difficult for temples to achieve. For example, in 
accordance with these rules, temples must have a "general body" and a "board 
of directors." Thus they have to develop "membership lists," and the members 
must hold elections to choose members of the board of directors. Neither of 
these are traditional practices. The requirement that the temple must have a 
"general body" leads to disputes over who is a "member" of the temple and 
who is a "Hindu." The requirement that trustees or boards of directors should 
be elected by the general body leads to conflict between supporters of the 
incumbent trustees and those who want to force them out of office. Trustee 
succession, and the position, role and benefits accruing to wealthy donors of 
the temple have become particularly controversial in the United States mainly 
because rules of "custom and usage" governing symbolic and material temple 
transactions that may be applicable in India are not recognized in this country. 
In the past decade, there have been major legal conflicts in several American 
temples over these issues.? 

Perhaps the most publicized case has been that of the Flushing temple in New 
York, one of the oldest and wealthiest temples in the country. Beginning in 2001, 
a group of 28 temple-goers went to the state courts, alleging that the temple 
had been run as a "private fiefdom" and in "utmost secrecy" and demanding "a 
more democratic style of governance." (Hindu Press International, Sept. 24, 2004 
and Oct. 31, 2004). The main point of contention had to do with the bylaws of 
the temple, specifically which of two sets of bylaws of the temple were to be 
used in temple governance. The first set of bylaws was formulated by its board 
of trustees (comprising both Hindus and non-Hindus because there were not 
enough Hindus with American citizenship at that time) when the temple was 
founded in 1970. This set of bylaws allowed both Hindus and non-Hindus to be 
members of the temple as long as they pledged to abide by the purposes and 
objectives of the Hindu Temple Society in the United States. The 1970 bylaws also 
called for an elected board. In 1978, after the temple had been built, a second 
set of bylaws were drawn up which established a self-perpetuating board with 
"total control over the temple." (Hindu Press International, Aug. 17, 2004) and a 
Hindu membership. It is the second set of bylaws that the subsequent boards of 
trustees held as binding on the temple. However this was being challenged by a 
dissident group of temple-goers. The case moved from the appellate to the state 
supreme court which ruled in 2003 that the 1970 bylaws governing the temple 
were binding and therefore that the temple's board was “illegitimate.” It called 
for the board to be dissolved and elections to be held after establishing a list of 
members eligible to vote. 

The temple president, Dr. Uma Mysorekar, then took her case to the Becket 
Fund for Religious Liberty in Washington, D.C. which went to federal court to 
challenge the actions of the state court. The Becket Fund argued that it was a 
violation of the First Amendment for the state to tell the temple how to structure 
its internal affairs and that their organization was “trying to prevent the court from 
imposing a non-Hindu form of control on the largest Hindu temple in America.” 
(Hindu Press International, Aug. 4, 2004) The state court had ruled that the 
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temple had to “open its membership to all of the 21,000 individuals on its mailing 
list who are over 18 years of age and willing to pay a token membership fee” 
who would in turn be eligible to vote in the elections (Hindu Press International, 
Oct. 31, 2004). Dr. Mysorekar argued that an outcome would be that “anyone 
off the street would be able to declare himself a Hindu and vote on decisions 
regarding the future of the temple.” She was particularly concerned that, “One 
person could get 20 non-Hindus to sign up and make a donation of $3 [each] and 
begin voting by proxy.” (Hindu Press International, Sept. 24, 2004) After many 
years of wrangling between the two sides, elections did take place in January 
2006 and the incumbent board of directors was overwhelmingly re-elected. The 
challengers charged that this result was due to proxy voting on the part of the 
board (http://www.templedemocracy. org, March 12, 2006 update). 

The Flushing temple case illustrates many of the issues facing Hindu temples 
that have to abide by the rules governing religious organizations in the United 
States. Firstly, the requirement that the majority of the founding board members 
should be citizens places a burden on immigrant groups. Another thorny issue 
is with respect to who can be a member of the temple. Even having bylaws 
specifying that members of the temple should be Hindu does not prevent conflict 
because the designation of who is a Hindu is problematic in the United States. 
Because Hinduism has no official conversion ceremony, traditionally in India, a 
Hindu is defined as someone born into a Hindu family. Recognizing the needs 
and financial contributions of American believers, many temples in the United 
States define Hindus as those who "have faith in Hinduism." When this definition 
is combined with a requirement that trustees should be elected it could mean, as 
Dr. Mysorekar argued, that the temple and its vast material resources could fall 
under the control of those who did not have the best interests of the institution 
at heart. Finally, differences between the role and power of temple trustees in 
the Indian context where rules of custom and usage often permit them to obtain 
a share of some of the symbolic and material resources of the temple and the 
American rules where this is not allowed are at the heart of many of the conflicts 
that have taken place in U.S. temples. 


Defining American Religious Identity 


Whether and how religion will be publicly recognized as an actor in the public 
sphere depends largely on the definition of national identity and of citizenship, 
in turn derived from the unique history, of the particular country in question. A 
plethora of studies now document how Muslim groups have received different 
treatment in a variety of European countries for these reasons (AlSayyad and 
Castells 2002; Fetzer and Soper 2005; Hunter 2002; Kepel 1997; Modood 2005; 
Modood and Werbner 1997; Neilsen 1992). For example, Kepel (1997) compares 
the situation in Britain and France. Because of its history as a union of different 
nationalities (English, Scottish, Welsh and Irish), the British state recognizes that 
it is a multinational, multicultural state and thus permits immigrants to maintain 
their own cultural identity and institutions. As a state that has an official church 
(the Anglican church), Britain recognizes and promotes Christianity in the public 
sphere but permits minority religious groups like Muslims to develop their own 
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public presence and leadership. This is in contrast to the situation in France where 
religion is considered to be outside the public sphere and "cultural differences 
are subsumed in the shared membership of the secular nation." (Kepel 1997:109) 
Thus, France has traditionally discouraged the formation of religion based 
communities. However, beginning in the late 1980s, France has been trying to 
unify and organize Muslim religion to create a "French Islam," including attempts 
to organize institutes to train Imams who work in France (Kepel 1997). In the United 
States, the religions of different immigrant groups have been incorporated by a 
process whereby the religious canopy and correspondingly the national religious 
identity of the nation has been gradually expanded. Thus, the immigration of 
Catholics from Europe expanded the religious mainstream from "Protestant" to 
"Christian," and the later immigration of Jews, expanded it further to become 
"Judeo-Christian" (Prothero 2001). In this section, | examine how the post 9/11 
development of an "Abrahamic" religious umbrella pushed Hindu Americans into 
the public sphere to seek recognition for Hinduism as a valued contributor to the 
religious fabric of the United States. 

One strategy that Hindu community leaders have been following for a long 
time is to emulate the model of Jewish Americans. As a highly successful group 
that is integrated into mainstream American society while maintaining its religious 
and cultural distinctness, close community ties and connections with the home 
country, American Jews are viewed as a group that has been able to fit in while . 
remaining different. This is the route to success that Hindu Americans also want to 
adoptin their questto stake a position in American society. The building of temples 
around the country was followed or in some cases, paralleled, by the formation of 
a variety of other types of regional and national umbrella Hindu organizations to 
unify the community, define their identity, and represent their interests. Following 
the pattern of American Jews, one of the first types of organizations that Hindu 
Americans formed explicitly oriented toward the wider American society were 
anti-defamation groups. The American Hindus Against Defamation (AHAD) was 
formed in 1997 and had as its goal, the aggressive defense of Hinduism against 
defamation, commercialization and misuse. The organization was involved in 
several successful protest campaigns against the use of Hindu deities, icons 
and texts by American businesses and the entertainment industry. The success 
of AHAD was followed by the formation of several other anti-defamation groups 
around the country, including the Hindu International Council Against Defamation 
(HICAD) based in New Jersey and the Internet-based India Cause (http://www. 
indiacause.com). 

Other Hindu groups were focused on getting Hinduism publicly acknowledged 
as an American religion at the national level. In September 2000, despite some 
opposition from conservative Christians, Indian American lobby groups were 
able to have a Hindu priest open a Congress session (at which the Indian Prime 
Minister addressed a joint session of the house) for the first time, something that 
was reported with great pride in Indian American newspapers and websites. The 
second indication that Hindu Americans were being recognized by Washington 
came a month later, when President Clinton issued a proclamation from the White 
house wishing Indian Americans a Happy Diwali (an important Hindu festival). 
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In return for contributions from Silicon Valley to the Democratic party for the 
2000 elections, Indian American computer professionals had requested that the 
festival be officially recognized by the White House. The Indian American paper, 
the India Post reported that Indian Americans were jubilant when Clinton issued 
the greeting since this, “is a symbolic gesture that speaks volumes to the fact 
that Indian culture is accepted as part of America's overall fabric.” (Krishnakumar 
and Prashanth 2000:22) 

Particularly beginning in the 1990s, Hindu American leaders also began to 
make a systematic effort to participate in inter-faith and other events where 
they presented Hinduism to the American public in such a way that it fit in with 
Western norms. These efforts focused on three central issues which Americans 
generally tended to view negatively: Hindu conceptions of the divine, the 
nature of the caste system, and the position of women in Hindu society. Many 
American Hindu spokespersons objected to their religion being characterized as 
“polytheistic” and "idol worshiping." They pointed out that although the Hindu 
pantheon consisted of an array of deities, many Hindus believed that all of these 
deities are different forms manifested by one Supreme Being. Thus, they argued 
that Hinduism should be defined as a monotheistic religion. Similarly, most 
American Hindu leaders spoke out against the use of the term "idol" to describe 
Hindu representations of their deities, since it had the negative connotation that 
the worshiper considered the graven image to be divine. They preferred the term 
“icon” or "image" and argued that these images were only to represent the idea 
of the divine and to provide the worshiper with a tangible mental focus. Again, 
Hindu Indian American leaders contested the notion that caste stratification 
was a product of Hinduism by maintaining that the caste system was never 
religiously sanctioned by Hinduism and thus was not central to Hindu practice 
(India Post 1995). Hindu American leaders also challenged American stereotypes 
of Hinduism as a patriarchal religion by arguing that in ancient India, women were 
held in great esteem and that Hinduism gave women and men the same rights. 
The worship of powerful goddesses was presented as further evidence of the 
importance of women within Hinduism. 

The terrorist attacks on the United States of Sept. 11, 2001 brought about a 
change in the religious configuration of the country and resulted in a shift in the 
patterns of activism of Hindu American groups. There has been a lot of literature 
on the impact of 9/11 on Muslims in the United States and around the world but 
little recognition that 9/11 also affected the religious framing of other religious * 
groups, particularly those non-Judeo-Christian groups like Hindus and Sikhs who, 
before and after the attacks were often lumped together with Muslims. In the days 
following 9/11 a number of interfaith services were organized in different parts of 
the country. These services, typically conducted by Protestant ministers, Catholic 
priests and Jewish rabbis, also for the first time included Muslim clerics. Muslim 
spokespersons traveled around the country emphasizing that they were part of the 
same tradition as Christians and Jews, and proclaimed that “we worship the same 
God as you do.” It appeared as though their lobbying yielded immediate results, 
most visibly in the attempt to enlarge the American “Judeo-Christian” family of 
religions into an “Abrahamic” one that included Muslims (because Abraham is 
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acknowledged as the common ancestor in all three religious tradjtions). The need 
to include Muslims, "the fastest growing religious group in the United States," 
within the fabric of American religions had been recognized several years earlier 
by the Clinton administration. But it was only in the wake of 9/11 that this initiative 
bore fruit, with the term "Abrahamic" entering public discourse (see Prothero 2001). 
Several newspapers and magazines carried reports on this development. While 
evangelical Christians protested the inclusion of Muslims under the American 
religious canopy and argued that the concept of Abrahamic was too broad, Hindu 
Americans feared that it was too narrow and would further marginalize non- 
Abrahamic religions like Hinduism (Prothero 2001). 

The Hindu umbrella organization, HICAD, and several hundred individual 
Hindus sent a petition to President George Bush emphasizing that Hindus 
needed to be included within "America's pluralistic and multicultural traditions." 
The petition was to protest the exclusion of Hindus from the national prayer 
service at Washington National Cathedral organized in the wake of the events of 
9/11. The relevant sections are excerpted below. 


Hindus are very much a part of our nation. We are a hard 
working people who contribute to the American society, 
economy, education and quality of life, in a proportion much 
larger than our numbers. . . 


Mr. President, Hindus are a peace-loving people. We never 
threaten violence against our host country There is no 
world-wide Hindu network of terrorists. There are almost 
a billion Hindus living on Earth. They practice the world's 
oldest religion (over 8,000 years old). 


Non-violence, pluralism and respect (not just tolerance) 
of other traditions of worship to the One Almighty God, 
are integral parts of its [Hinduism's] basic tenets. We are a 
family-oriented people, with very low divorce rates. We are 
frugal, save for our children's education, and support our 
elders and extended families. Because of these beliefs, 
Hindu-Americans are called ideal citizens. . 


In the aftermath of the heinous terrorist attacks, Hindu- 
Americans and Hindu places of worship have become the 
target of xenophobic rage in some parts of North America. 
This is because many Americans do not know the difference 
between Hinduism and Islam; they lump them together as 
foreign religions. Your help in bringing the recognition to 
Hindus as a peace-loving people who are an integral part 


of our society, would go a long way in — Americans 
about Hinduism. . .(HICAD 2007). 
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Three aspects of the petition merit analysis. Firstly, it called attention to the fact 
that practitioners of Hinduism are a numerically significant part of American 
society. Secondly, the petition described Hindu Americans as model members 
of the United States and also echoed many of the themes Hindu leaders had 
been promoting (e.g., one god, family values) . Finally, it drew attention to the 
difference between Hinduism and Islam. This was done subtly in the petition, 
butin the original letter on which the petition was based, the writer stressed that 
Bush needed to help educate Americans to the fact that “Hinduism is very, very 
different from Islam . . . the opposite in fact, in many integral ways." 

The emphasis on the distinction between Hinduism and Islam became 
prominent in the post 9/11 public statements of many of the self-styled 
representatives of American Hindus. In a related development, anti-Muslim, 
Hindu nationalist sentiments, which until then had largely been confined to intra- 
group discussions and presentations, publicly emerged in the United States for 
the first time. Many Hindu Indian Americans bombarded their politicians and the 
media with anti-Pakistani and anti-Islamic propaganda, filled with quotes from the 
Koran, and also called into radio and television talk shows to criticize Islam. Others 
spoke up at town meetings to condemn the treatment of minorities in Muslim 
countries and to challenge the positive portrayals of Islam by Muslim speakers. In 
the weeks immediately following 9/11, Rajiv Malhotra of the Infinity Foundation, 
a prominent Hindu American spokesperson, was invited to several universities 
to speak about the unfolding events from a Hindu perspective. In light of the 
September 11th backlash in the United States, "a lot of Hindus suddenly have 
started realizing they better stand up and differentiate themselves from Muslims 
or Arabs," journalist Sarah Wildman quotes Malhotra as saying (Wildman 2001). 
In his talks at American University and Princeton University, he took the offensive 
against Islam, criticizing its leadership of duplicity for projecting a face of peace 
and tolerance in the United States while promoting fundamentalism at home. 
Hindu Americans were also more willing to mobilize in support of Indian and 
Hindu causes in the post 9/11 period. A petition charging CNN with pro-Pakistan 
and anti-Indian bias (based on an article on the Internet magazine, Sulekha.com, 
written by Rajiv Malhotra alleging the same) obtained 55,000 signatures, forcing 
CNN executives to meet with representatives of the Indian community in Atlanta 
in February 2002.7 

Hindu American strategies for recognition in the period before and after 
9/11 are a good case study of how the public activism of immigrant religious 
groups is powerfully shaped by the religio-cultural context of the country and 
official policies. In the pre 9/11 period, Hindus followed the established patterns 
of groups like the Jews — forming anti-defamation groups, lobbying to obtain 
political and social visibility by being recognized by the United States Congress 
and administration, and interpreting and presenting the religion in such a way that 
it fit in with American values and norms. The events of September 11 triggered 
off a process of Hindu American mobilization for a variety of reasons: the need 
of Hindu practitioners to distance themselves from Islam lest they be mistaken 
for Muslims and attacked, fears about the marginalization of Hindus if Muslims 
were also included under the American sacred canopy, resentment and worry 


734 * Social Forces Volume 85, Number 2 * December 2006 


about the sudden American rapprochement with Pakistan, and finally, an attempt 
to exploit the rise of anti-Islamic sentiments in the United States to obtain the 
recognition and support that Hindus craved. 


School Curricula and Religion 


Because the officially defined mission of the educational system is usually 
closely tied to national identity, how non-Christian religious traditions are treated 
in the school curriculum has varied from country to country. For example, Kepel 
(1997:109) points out that France's "republican model of education" which views 
the educational system as a "melting pot" creating common culture and values, 
tries to keep religious education out of the classroom. This is in contrast to the 
British system where religion is taught as part of the regular school curriculum. 
In the United States, battles over the coverage and representation of minority 
groups and cultures in school and university curricula were one of the first spurs 
for the development of multicultural policies (Gutman 1992; Newfield and Gordon 
1996). In response, state education boards have developed new guidelines for 
the treatment of minority groups in school textbooks and have standardized 
procedures to invite public input. A variety of religious groups have formed 
committees to advise education boards on curricula dealing with the particular 
groups they represent such as the Institute for Curriculum Services (a Jewish - 
group set up in 2004) and the California-based Council of Islamic Education (a 
group founded in 1988). Hindu American mobilization against school textbooks 
began in the Fall of 2004, when the school district in Fairfax, Virginia, put forward a 
new set of world history textbooks for public review. Hindu parents mobilized and 
were relatively successful in making some changes in the way Indian history was 
taught in their district. One textbook was rejected and eight others were revised 
(Glod 2005). Based on this success, Hindu American groups decided to organize 
and turn their attention to school textbooks in other regions of the country. 

In the summer of 2005, the California State Board of Education opened up 
their process of textbook review for sixth grade social studies to the public. Like 
other states, the California Department of Education has developed guidelines to 
evaluate the content of school textbooks. Under their Education Code, Sections 
60040 (a) and 60040 (b), "Standards for Evaluating Instructional Materials for 
Social Content (2000 Edition), they outline criteria for the treatment of ethnic 
and cultural groups and of religion (among a long list of other issues). Briefly, the 
criteria indicates that the beliefs and practices of the different groups should be 
presented without bias or prejudice and in such a manner that they instill pride 
in children belonging to the tradition being discussed (http://www.cde.ca.gov/ 
ci/cr/cf/documents/ socialcontent.pdf). Two Hindu American groups - the Vedic 
Foundation based in Austin, Texas, and the Hindu Education Foundation, a group 
comprised of members from around the country and India participated in the 
textbook review process. The efforts of the VF and the HEF were backed by athird 
group, the Hindu American Foundation, a Fremont-based group formed in 2004. 
The VF-HEF combine proposed more than 117 edits to the content on India and 
Hinduism covered by the textbooks. While some of the edits corrected blatant 
errors or gratuitous insults, the changes that would become controversial fell into 
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one of three categories paralleling those that Hindu leaders had been publicizing 
in the United States. Firstly, material referring to the plurality of deities and beliefs 
and forms of worship in Hinduism was redacted, and the texts were revised 
to portray Hinduism as a monotheistic religion based on Vedic texts. Secondly, 
the caste system was dissociated from Hinduism, its hereditary nature was not 
mentioned, and passages describing its oppressive nature were modified. Finally, 
references to patriarchy or the unequal treatment of women were erased. Ninety- 
one of these edits were originally accepted by the Curriculum Commission, and 
these edits were to be presented and ratified at a meeting on Nov. 9, 2005. 
However in early November, an Indian graduate student in California who had 
been approached by the Vedic Foundation for a signature for their petition notified 
Professor Michael Witzel, Professor of Sanskrit and Indian Studies at Harvard 
University and his collaborator, Steve Farmer, about the attempts of the groups 
to rewrite the textbooks. Witzel and Farmer swung into action and publicized 
the matter to other scholars of India via the Internet. On November 8, Witzel 
sent a letter to the California Board of Education with signatures from 46 other 
prominent academics specializing in Indian studies. The letter urged the board to 
reject the controversial edits proposed by the Hindu groups on the basis that they 
were "not of a scholarly but of a religious-political nature . . . primarily promoted 
by Hindutva supporters." Witzel also pointed out that the same revisions that the 
Hindu groups were trying to make in California textbooks had been temporarily 
inserted in textbooks in India when the Indian central government was led by 
a Hindutva party (between 1998 and 2004) and had since been removed when 
that government had been voted out of power. Various Indian American groups — 
those opposed to the efforts of the VF-HEF-HAF (which included some scholars, 
secular Indian Americans and Dalits or lower castes) and those supportive of 
these groups - mobilized their respective constituencies. Through a spate of 
articles on Internet websites, discussion groups, newspapers and magazines, 
both sides tried to get their views heard by a wider public. Articles by secular 
Indian American groups and some scholars meticulously traced and publicized 
the links between the Vedic Foundation, Hindu Education Foundation, Hindu 
American Foundation and Hindutva groups in the United States and India, and 
they criticized the changes as trying to promulgate a sanitized view of history 
and deny oppression (IPAC 2006, Maira and Swamy 2006). Groups supportive of 
the VF, HEF and HAF on the other hand, denounced the scholars, secular Indian 
American and Dalit groups as "anti-Hindu." They argued that the treatment of 
Hinduism in the California textbooks did not comply with the standards set by the 
California State Board of Education and that Hindus were merely demanding that 
Hinduism be treated with the same consideration and respect as other groups 
(Malhotra and Jhunhjunwala 2006). Several of the pro-Hindu writers pointed out 
that Jewish, Muslim, and Christian traditions were presented respectfully (and 
even erroneously), from the point of view of the practitioners of the religion. In 
contrast, they argued that the treatment of Hinduism in the textbooks was so 
biased and focused on the negative that it was causing grievous psychological 
harm to Hindu American children (Agarwal 2006, Venkat 2005). Members of the 
California State Board of Education found themselves caught in the cross-fire 


736 * Social Forces Volume 85, Number 2 * December 2006 


between the two sides. After a series of public and private meetings they finally 
voted to overturn most of the contentious changes proposed by the VF-HEF. In 
response, the Hindu American Foundation and an association of Hindu parents in 
California filed a legal case against the California State Board of Education. 

Several aspects of the case study presented above deserve closer analysis. 
First, research has documented a reciprocal relationship between the rise of 
Hindu Americans as a social and political force in the United States and the 
development of Hindu nationalism (Kurien 2004; Lal 1999; Mathew and Prashad 
2000; Rajagopal 2000). Beginning in the late 1990s (when the Hindu nationalist 
BJP came to power in India) and particularly after Sept. 11, 2001, Hindu nationalist 
leaders and parties in the United States have been emerging in the public sphere 
in the United States as the representatives of all Hindu Americans. In this 
context the California case is significant in that it presents a case study of Hindu 
nationalists being publicly challenged by other groups within the community. 

The tendency for radical groups to represent their communities in the 
diaspora is by no means unique to Hindu Americans. Scholars have noticed this 
tendency among other groups as well. For example, Kepel (1997) argues that 
in Britain, the Islamic group Jam/at-i Islami was able to produce The Muslim 
Guide for Teachers, Employers, Community Workers and Social Administrators 
in Britain and get its “radical Islamist stance" accepted as the "norm accepted. 
by all Muslims” by omitting “any reference to publications which present an 
alternative view of Islam.” (Kepel 1997:122) | have argued that an ironic and 
unintended consequence of multiculturalist policies is that they could promote 
the development of religio-ethnic nationalism among immigrants and result in 
such groups taking a leadership position within the community (Kurien 2004). 
This raises several questions. How can those who are charged with implementing 
multicultural policies adjudicate between different groups claiming to represent 
a community? How can multicultural societies prevent extremist groups from 
being recognized and legitimized as ethnic spokespersons? 

Another issue raised by the California textbook controversy is how school 
officials can deal with the frequent tension between its mission of presenting 
historically accurate material on the one hand, and its mission of enhancing pride 
and self-esteem of children belonging to a variety of traditions on the other. In the 
California case, Hindu groups are not the only ones who have grievances. Some 
parents have been particularly unhappy about the overly positive and “dishonest” 
portrayal of Islam in the textbooks, and have filed complaints charging that 
children were “being proselytized by the state’s public schools on behalf of 
Islam.” (Dreher 2002) The American Textbook Council has also criticized the 
California State Board of Education for its approval of “some shockingly deficient 
U.S. and world history textbooks [which] indicates that all qualitative impulses 
in Sacramento have been subordinated to the power of publisher interests and 
aggressive pressure groups." (http://www.historytextbooks. org/california.htm, 
retrieved Feb. 2, 2006). How these competing interests will be managed by 
education boards around the country in the next few years bears watching. 
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Conclusion 


This article has shown how and why the organization, practice and content of 
Hinduism — from the structure and administration of temples, to the definition of 
Hindu, to the history of Hinduism — has been transformed in the United States as 
a consequence of American religious incorporation policies. Part of this process 
involved bringing about changes in the structure and administration of religious 
organizations and presenting core tenets of the religion so that they are more 
compatible with American culture and society. But as we have seen, becoming 
an American religion and ethnic group has also resulted in contestation over 
what constitutes the authentic Hindu tradition and over who can and should be 
its spokespersons, with different interests drawing on transnational resources 
to legitimize their respective positions. Hindu American challenges to American 
religious incorporation policies, and their internal conflicts which have involved 
American courts and policy makers, are likely to bring about changes in the 
approach and the strategies of American policy makers toward them and toward 
non-Christian groups in general. 

Contemporary immigration patterns have led to the presence of a diversity 
of religions in Western multicultural countries, many of which are based on 
fundamentally different assumptions and practices from the dominant Christian 
tradition. This may mean that policies currently based explicitly or implicitly on the 
Christian model will need to be reformulated. For example, commentators have 
pointed to the need to revisit the public/private and secular/sacred dichotomies 
that have become normative in many European multicultural societies in order 
to accommodate Muslim perspectives on religion (Modood 2005). Almost all 
of the focus on how Western state policies affect contemporary non-Christian 
groups has been on Muslims with the consequence that groups like Hindus have 
been completely ignored. However, such an exclusion is only likely to heighten 
feelings of marginality and powerlessness and exacerbate the turn to extremism. 
If the goal of states is to have policies that accommodate all religious groups, 
then these policies need to be formulated and implemented with more sensitivity 
toward differences between traditions. Again, it is important that policies on the 
formation of religious organizations, on public recognition by the administration, 
and on portrayal in textbooks are equally applied to all. 

Policies aimed at facilitating the integration of immigrants and winning their 
loyalty have focused on the positive acknowledgment of cultural difference. As 
Modood has repeatedly pointed out (1998, 2005) however, religion has been 
the blind spot of multiculturalism. Recent events such as the attacks of 9/11 
and the summer 2005 bombings in London have shown that religious identity 
frequently trumps cultural and national background and that religion as a force of 
community solidarity urgently needs official recognition and attention. Modood 
(1998, 2005) criticizes the tendency of Western states to either identify with a 
particular religion on the one hand or with radical secularism on the other and 
calls for them to become multireligious in addition to multicultural. This has 
been echoed in the United States by Hindu American leaders who demand that 
America be defined as a “multireligious” or “pluralistic” religious country, instead 
of either “Judeo-Christian” or “Abrahamic.” Even if Western states do become 
officially multireligious, the dilemmas faced by contemporary states regarding 
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how to acknowledge differences but still treat groups in similar ways and how 
to deal with the inevitable clashes between groups with very different ideas of 
the social good are not likely to disappear. However, attempts to open dialogue 
between groups, and between groups and the state, will potentially result in the 
development of more areas of shared consensus. 


Notes 


1. For example, according to the U.S. 2000 census, the median household 
income of foreign-born Indians was $68,500 compared to $53,400 for 
native-born whites. Again, 41.9 percent of foreign-born Indian men, and 27.7 
percent of women held postgraduate and professional degrees compared to 
11 percent among native-born white men and 9.6 percent among native-born 
white women (Kibria 2006). 


2. The Britannica Book of the Year and the World Almanac estimates accessed 
July 9, 2004 at: http://www.pluralism.org/resources/statistics/tradition. 
php#Hinduism. 


3. Because these leaders are a self-selected and largely elite group, itis possible 
that their representations of Hinduism may not reflect those of lay Hindu 
Americans. However it is also important to keep in mind that these leaders 
have been successful because they were able to mobilize large numbers of 
Hindu Americans around the country to support the causes they espoused. 


4. Such as the Saiva Siddhanta Church, Chinmaya Mission, Srigiri Peetham of 
Pennsylvania, Barsana Dham of Texas. “Dharma Summit Concludes in New 
Jersey,” Hindu Press International report, Aug. 17, 2005. 


5. Paramacharya Palaniswami at the Dharma Summit. “Dharma Summit 
Concludes in New Jersey,” Hindu Press International report, Aug. 17, 2005. 


6. This is not to say that such conflicts do not take place in India. (In fact, 
there is a long history of conflicts over the issue of trustee succession and 
over the misappropriation of temple resources by trustees). However the 


issues are a little different and the conflicts often play out differently in the 
Indian context. 


7. Accessed May 6, 2002 at:http://www.infinityfoundation.com/mandala/s es/ 
S es rao-r govt.htm. ee 
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Social Policy and Immigrant Joblessness in Britain, 
Germany and Sweden 


Christel Kesler, University of California, Berkeley 


I examine patterns of joblessness among immigrant men and women 
from 33 countries of origin now living in Britain, Germany and 
Sweden. Access to welfare, access to the labor market, job segregation 
and institutional support for women's employment define distinct 
policy configurations in these three destinations. Findings show that 
gaps in joblessness between immigrants and the native-born are larger 
in Sweden than in Britain and Germany, net of human capital, socio- 
demographic characteristics and sending country. However, long-settled 
immigrants have more similar outcomes across the three countries than 
recent newcomers. Also, immigrant women in Sweden work at higher 
absolute rates than immigrant women elsewhere. This suggests that 
gendered employment policies in destination countries are a crucial 
dimension of immigrant economic incorporation. Ultimately, such 
policies also shape the fiscal impact of immigration. 


Many industrialized countries will face financially insolvent welfare states in 
the course of the coming decades, as populations age and ratios of workers to 
pensioners decline. Immigration is an oft-mentioned solution, but its role in this 
policy arena is complex. On the one hand, immigration can and does counteract 
population decline. However, many immigrants, particularly in Western European 
host countries, work at rates much lower than their native-born counterparts due 
to lower labor force participation and higher vulnerability to unemployment and 
underemployment. If immigration is to economically benefit receiving societies, 
these societies must address immigrant joblessness.' 

High levels of joblessness are a concern not only because of the needs of host 
societies’ welfare-state finances. European debates on immigrant integration 
focus on joblessness because of its important implications for poverty and 
material well-being, social integration, and the future fates of immigrant second 
generations and beyond. A recent European Commission report (2004:5) on 
immigrant integration policies underscores this point: “Lack of access to 
employment has been identified as the greatest barrier to integration and thus 
the most important political priority within national integration policies.” (my 
emphasis) 

This analysis considers three countries (Britain, Germany and Sweden). All are 
home to a large foreign-born contingent: 8.3 percent, 12.5 percent and 12 percent 
of the respective host countries’ populations (OECD 2004:41). Comparisons 
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include immigrant groups from 33 sending countries, each present in at least 
two of the three receiving countries. The cross-national comparative design of 
this research serves as an analytic lens to better understand the contexts of 
immigrant joblessness in Western Europe. Empirically, the article has three foci: 
(1) cross-national differences in gaps of joblessness between immigrant groups 
and native-born populations, (2) absolute cross-national differences between 
immigrant jobless rates, and (3) how the effects of individual characteristics 
on joblessness vary across countries and between immigrant and native-born 
populations. 


Motivation and Previous Research 


Despite the everincreasing flows of international migration toward newer 
receiving societies, important theories for understanding immigrant 
employment are still based largely on classic settler societies. For example, 
important explanations for the seeming paradox of proliferating low-skilled 
immigrant work in the advanced post-industrial world consider the features of 
modern service economies, with a particular focus on large urban centers in 
liberal? countries (Sassen 1988, 2001; Waldinger 1996; Waldinger and Lichter 
2003). The explanations vary even within these settings, but when we look at 
a wider range of receiving societies, the very explanandum changes, because 
immigrants have variable rates of joblessness.? Though prominent researchers 
have noted the remarkable variety among advanced postindustrial economies 
(Esping-Andersen 1999; Fligstein 2001), little empirical research has explored 
the implications for immigrant labor-market experiences. 

Comparative research is central to understanding the role of context in 
immigrant economic incorporation. Nevertheless, cross-national comparative 
studies have been rare until recently. Three strands of research have begun to 
remedy this. The first looks at a variety of immigrant groups ina single host country. 
There are a number of excellent recent studies on immigrant economic outcomes 
for the three host countries that are the focus here (Arai and Vilhelmsson 2004; 
Bell 1997; Berthoud 2000; Bevelander 1999, 2005; Granato and Kalter 2001: 
Heath and McMahon 1997; Kogan 2004; le Grand and Szulkin 2002; Modood 
1997; Owen 1997; Scott 1999; Seifert 1996). These studies consistently show 
variation across immigrant groups in each country, net of observed individual-level 
characteristics. There are a variety of explanations for net intergroup differences, 
including unobserved differences in human capital, cultural attitudes toward 
work and discrimination in the host country. It is difficult to adjudicate between 
these with standard statistical data, and a full analysis of intergroup differences 
is beyond the scope of this article, but an important lesson is that one must 
carefully take origins into account in any analysis of destination effects. 

A second approach tracks a single immigrant group across multiple receiving 
societies. Kogan (2003) posits that welfare availability and demand for low- 
skilled workers improve occupational status outcomes but depress labor force 
participation and employment among exYugoslavs in Sweden vs. Austria. 
Lewin-Epstein et al. (2003) attribute higher occupational attainment and higher 
unemployment among immigrants from the former Soviet Union in Canada vs. 
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Israel to Canada's restrictive “point system” that selects for human capital and 
israel's settlement assistance that benefits recent newcomers. Model et al. 
(1999) find that there are surprisingly few destination effects on the economic 
outcomes of Caribbean migrants in Canada, the United States, Britain and France. 
The major downside to such studies looking at a single group is that we do not 
know whether findings can be generalized. A wider range of sending countries 
allows detection of patterns with broader contextual implications. 

The few studies that include multiple immigrant groups and receiving 
societies tend to focus on the liberal societies, perhaps in part because there 
is a comparatively long tradition of immigration research on these countries. 
Certainly, liberal societies are not identical in their institutional configurations. 
One body of research on the United States and the United Kingdom (Model 1997, 
2005; Model and Ladipo 1996) attributes more positive immigrant outcomes in 
the United States to the presence of native-born minorities (African Americans) 
in the lowest position of the labor queue. Reitz (1998) and collaborators (Reitz et 
al. 1999) identify overall earnings inequality as a primary contextual difference 
between Australia, Canada and the United States, and Canada and Germany. 
In countries with more equal earnings, those at the bottom of the distribution 
(disproportionately immigrants) are economically better off. One recent study 
(van Tubergen et al. 2004) includes many sending and receiving countries. This 
study is a major contribution, and its double comparative design is ideally suited 
to test theories about the effects of immigrants' origins and destinations on labor 
force participation and unemployment. Due to its very broad scope, this study 
is not able to devote attention to cross-national differences among the effects 
of individual-level variables (e.g., destination effects on recent newcomers vs. 
longer-settled immigrants). 

| extend these previous studies in several ways. First, unlike much of the 
previous research, | include immigrant men and women with a wide range of 
origins (33 sending countries) in three destinations. Second, | carefully distinguish 
between immigrant/native-born inequality and more general contextual effects 
(those that affect immigrants and the native-born). Finally, | examine the role of 
cross-national variation in the effects of individual-level variables. 


Cross-national Variations 


The discussion below emphasizes, first, variations across host societies that 
could affect observed gaps in joblessness between immigrant groups and 
native-born populations. These fall into several broad categories: variations in (a) 
observed and unobserved characteristics of the immigrants themselves, relative 
to their native-born counterparts, (b) the level of immigrant/native-born inequality 
in access to non-market income sources, (c) the level of immigrant/native-born 
inequality in access to the labor market, and (d) the extent of immigrant/native- 
born complementarity in the labor market. | briefly discuss each of these in turn. 
As relevant, the discussion indicates whether cross-national differences pertain 
to persons with specific individual-level characteristics. 

The second concern of the article is absolute cross-national differences 
between immigrant jobless rates. Note that this is a question of whether native- 
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born baselines in the three countries vary, such that immigrant/native-born gaps 
do not translate directly into absolute differences in immigrant outcomes. In 
this context, the discussion turns to cross-national differences in work/family 
policies, which are generally recognized to boost women's employment rates in 
social democratic countries such as Sweden (Gornick and Meyers 2003; Gornick 
et al. 1998). 

This research design does not permit a definitive disentangling of causal 
mechanisms; its goals are primarily descriptive. Nonetheless, this discussion 
serves to highlight some of the potential causes of cross-national variation, as 
well as the expected direction of effects. 


Language Abilities 


Variation in immigrants' characteristics is perhaps the most obvious explanation 
for cross-national differences in immigrant labor market outcomes. Most 
analyses (including this one) control for important observable characteristics. 
Nonetheless, immigrants' unobservable characteristics can also vary across host 
countries. One important example in this analysis is language ability, unobserved 
here because of data limitations. We might expect many immigrants in Britain 
to have a critical advantage vis-à-vis their counterparts in Sweden and Germany: 
They arrive with English-language ability. English is learned more frequently than 
German or Swedish in most sending countries, regardless of whether English is 
an official language. Thus, it is more likely that migrants to Britain will have overall 
better language abilities upon arrival. This residual effect of Britain's colonial 
history could lead to relatively high employment rates among recent arrivals to 
Britain, but we might also expect a cross-national convergence among longer- 
settled immigrants who have had time to learn German or Swedish. 


Income Inequality, Selectivity and De-commodification 


An important line of work, most clearly articulated by Borjas (1987), has stressed 
the relation between income inequality and selectivity of migration. The basic 
assertion is that higher income inequality attracts more positively selected 
migrants along unmeasured dimensions such as motivation. This reasoning 
suggests that migrants to Britain from a given origin country would be more 
positively selected than those to the other two countries.^ Recent empirical 
findings do show that the ratio between inequality in the sending and receiving 
society is negatively correlated with labor market outcomes (van Tubergen et al. 
2004). Holding origin constant, this implies that migrants to Britain will have more ^ 
positive outcomes. 

Note, however, that it is not possible to disentangle such unobserved 
heterogeneity, purportedly based on income-maximizing strategies prior to 
migration, from the effects of another institutional factor that immigrants need 
not be aware of before moving: the availability of non-market income sources. 
A major way that advanced industrialized societies reduce income inequality 
is through taxes and transfers. An effect of the availability of transfer income 
is empirically indistinguishable from an effect of income inequality on migrant 
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selectivity, at least in conventional data sources. Even with similar unobserved 
characteristics such as motivation, we would expect lower employment rates 
among immigrants in host countries that provide greater access to non-market 
income, because such immigrants are less forced into employment. 

Esping-Andersen (1990:37) calls access to non-market income sources "de- 
commodification," defined more specifically as "the degree to which individuals, 
or families, can uphold a socially acceptable standard of living independently of 
market participation." The marginal effective tax rate (METR) is a rough indicator 
of this concept for the working-aged adults in this analysis. (A METR of 100 
percent implies the financial equivalence of work and non-work — maximum de- 
commodification.) Across a range of family types and circumstances, Sweden 
and Germany actually have similarly high METRs; figures for Britain are lower 
(OECD 2005:Chap. 3). But most important for the discussion here is that both 
Britain and Germany restrict immigrants from receiving non-contributory benefits 
such as social assistance. In Britain, most immigrants have no access to social 
assistance for the first five years of residence, and in Germany, recourse to 
social assistance can jeopardize a non-citizen's right to renew a residence permit 
(Groenendijk et al. 2000). Thus, Sweden has the smallest gap between the welfare 
rights of immigrants and the native-born. 

The link between de-commodification and employment outcomes is empirically 
tenuous (OECD 2005:139). Despite an extremely high level of de-commodification, 
Sweden boasts one of the highest employment rates in the industrialized world. 
Nevertheless, de-commodification might more significantly depress employment 
among immigrants, who face specific barriers to employment. Whether 
through effects on the (unobserved) selectivity of migrant flows or through de- 
commodification, we might expect Sweden's high levels of income redistribution 
to depress immigrant employment relative to the native-born. 


Labor Market Access 


Formal and informal barriers to labor market access for immigrants vary across 
these countries. In terms of formal barriers, historically Germany has had much 
more restrictive citizenship policies than the other two countries, and on top of 
this, citizenship matters more in Germany for employment access. Until the early 
1990s, naturalization was nearly impossible for those without ancestral ties to 
Germany (Diehl and Blohm 2003:141). In the German data used here (from 1996 
and 2000), citizenship among the foreign-born in Germany was still rare. Citizenship 
matters more in Germany, because access to public-sector, civil-servant positions 
(including school teachers, for example), is restricted to E.U. citizens. 

Evidence on a more informal barrier to access, discrimination, also suggests 
that, for immigrants, Germany is the least advantageous of the three destinations. 
A report based on several waves of Eurobarometer data demonstrates that, 
along an array of dimensions, including resistance to diversity, opposition to civil 
rights for minorities, and support for repatriation policies, Germans adopt more 
"ethnically exclusionist" stances than their counterparts in the other two countries 
(Coenders et al. 2003:2-6). However, Brits are not far behind, with Swedes having 
by far the least exclusionist attitudes along most dimensions. 
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But we should be cautious in linking the results of opinion pollato discrimination 
itself. First, these results are for the population as a whole, not for employers. 
Even assuming employers have similar attitudes to the population at large, 
studies have shown that employers' discriminatory behavior can be unrelated 
to stated opinions (Pager and Quillian 2005). Anti-discrimination legislation 
could be important in preventing discriminatory behavior. Britain has the most 
highly developed anti-discrimination legislation in Europe; its first law against 
discrimination in the labor market dates back to the 1970s. Despite progressive 
immigrant-related policies on many fronts, Sweden's anti-discrimination legislation 
has lagged behind many other countries, particularly at the stage of recruitment of 
job candidates (Graham and Soininen 1998). Unlike Sweden and Britain, Germany 
has had no laws explicitly prohibiting discrimination in employment based on 
race, citizenship or origin country. Although E.U. regulations have required all 
member countries to implement laws against discrimination, Germany has failed 
to fully do so (European Commission 2005). 

Taken as a whole, the findings from this review of factors affecting labor market 
access for immigrants suggest that, along every possible dimension, Germany 
is the least advantageous destination country. If public opinions represent the 
level of discrimination in a society well, then Sweden is probably the least 
discriminatory context, but if formal anti-discrimination legislation plays a larger 
role, then Britain could be the least discriminatory context. 


Complementarity and Competition 


If immigrants are not successful in the competition with native-born workers for 
desirable jobs, then the availability of undesirable jobs is important: Are there 
jobs that native-born workers are unlikely to compete for? "Complementarity" in 
the workplace is occupational or sectoral segregation. A recent OECD publication 
provides a proxy for this concept: a standard index of dissimilarity at the level of 
economic sector, indicating how many non-citizen workers would need to be re- 
allocated to achieve parity with citizen workers.® The index value in the late 1990s 
is nearly 20 percent in Germany, 12.6 percent in Britain and only 10 percent in 
Sweden. This is especially a result of the over-representation of foreign workers 
in Germany in the industrial jobs for which they were recruited in the postwar 
period (OECD 2001:174-77). Such jobs have been an immigrant niche in all 
three countries, but Germany has remained more industrial than Britain and 
Sweden in recent decades (OECD 2000:85-7). In short, this evidence suggests 
that immigrants in Britain and Sweden compete more directly with native-born 
workers in the labor market, which might negatively affect probabilities of 
employment (Kogan 2003). 


Work/Family Policies 


One major factor likely to affect differences in the native-born baselines across 
these three countries is policies that allow or encourage women to work. Sweden 
is à world leader in promoting women’s (especially mothers’) employment (Gornick 
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et al. 1998; Stier et al. 2001). For the three countries here, the following policies 
are relevant. Germany alone has joint rather than individual taxation, which creates 
a moderate disincentive for married women's employment (OECD 2005). Other 
policies such as paid parental leave, public childcare and school scheduling aim to 
increase women's labor supply directly. Gornick and Meyers (2003:258) construct 
indexes of such policies. Their findings show that Sweden belongs to the group 
of countries most supportive of employment among women with pre-school and 
school-aged children. Germany, along with other continental European countries, 
takes an intermediate position, while Britain and other liberal countries are the 
least institutionally supportive. Complementing such labor-supply policies is a 
large, public, social-service sector in Sweden, which disproportionately employs 
women. If such cross-national differences have independent causal effects (and 
are not merely an indicator of a particularly gender egalitarian culture among 
native-born Swedes), they should increase employment among immigrant and 
native-born women alike. 


Data 


The analysis uses British Labour Force Surveys (BLFS), the German Mikrozensus 
(MZ) and Swedish Longitudinal Individual Data (LINDA).2 All are nationally 
representative, household-level surveys. 

The BLFS is a quarterly survey of 0.2 percent of the population of Great 
Britain and 0.3 percent of Northern Ireland (Office for National Statistics 2003). 
Addresses are randomly selected from the Postcode Address File in Great Britain 
and the Valuation List in Northern Ireland. The BLFS has a rotating structure; 
households remain in the sample for five consecutive quarters. | select each 
respondent's first quarter of participation and pool data over the period from 
Fall 1996 to Fall 2004 to obtain a sufficiently large sample size. For the first 
quarter of participation, interviews are face-to-face with at least one adult, who 
may provide information on other household members. Although interviewers 
speak only English, they carry written documents explaining the survey in nine 
other languages, and arrange for interpreters as necessary. Response rates have 
ranged from 80 percent to 85 percent in recent years. 

The MZ is an annual household survey; coverage is approximately 1 percent 
of all households (Lechert and Schmidt 2000; Lehnert et al. 2003). The scientific 
use files used in this research are the anonymized 70 percent sub-samples of 
the 1996 and 2000 MZ (ZUMA-Files), and some key information is collected only 
from a 45 percent random sub-sample of the original, so final coverage is slightly 
more than 0.3 percent for each year. The MZ is a single-stage cluster sample (a 
cluster is approximately nine residences) using two sampling frames: the census 
of the population for Western Germany and the population register for Eastern 
Germany, both updated for new housing construction each year. Interviews are 
generally face-to-face; in a minority of randomly selected cases, interviewers 
administer written surveys. The survey instrument is only in German. One adult 
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may provide information on all household members. Each year, 75 percent of 
households from the previous year remain in the sample, but it is impossible to 
track which households carry over in the scientific use file. To attain a larger sample 
size, | pool data from two independent samples (1996 and 2000). Response is 
legally mandatory for most questions. Unit non-response is 3 percent; item non- 
response on voluntary questions reaches 10 percent. 

LINDA combines data from population, tax and employment registers, which 
are linked by individual identity numbers (Edin and Fredriksson 2000). It is based 
on a simple random sample of identity numbers. Everyone in the household of 
each sampled individual is also included. LINDA includes a large over-sample of 
immigrants (20 percent of the total foreign-born population vs. 3.3 percent of the 
native-born). Because LINDA is based on registers rather than a survey, everyone 
sampled is in the data set. Information on how to file tax forms, a primary source 
of LINDA data, is available in 14 major immigrant languages (Skatteverket 2005). 
LINDA is longitudinal, but | use only 2002 data, because there is no comparable 
longitudinal data for the other two countries. 


Variables 


Employment 


The dependent variable for this analysis is work status, a dichotomous variable 
coded as 0 if respondents are jobless, and 1 if they are employed. As the 
combined result of labor force participation rates and unemployment rates 
among the active labor force, employment rates avoid the complicated and 
permeable boundary between inactivity and unemployment. Many active labor- 
market policies have begun to target the inactive as well as the unemployed, 
recognizing this permeability (OECD 2003). 

According to ILO definitions, persons in employment are those who did any 
work for pay or profit during a specified reference week, or were not working 
but had jobs from which they were temporarily absent (European Commission 
2003:9). Unpaid family workers are also included. | alter this definition in two ways 
because of data limitations. First, the reference period is an entire year instead 
of a single week; this liberal definition undoubtedly increases employment rates. 
Second, LINDA contains no information about unpaid family workers, and so 
such persons must be recoded as jobless in the British and German data. 

Information on employment in Sweden is based on income. A person is - 
employed if, during the reference year, s/he had any wage or salary income from x 
an employer, income from self-employment, or losses from self-employment. In 
the MZ and BLFS, a person is employed if s/he reports being employed at any 
time during the previous year.” 


Age and Age Squared 


Persons aged 25 through 59 are included. Age is centered at 40, which is near 
the mean in each country. 
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Education 


The education variable is UNESCO's (United Nations Educational, Scientific, 
and Cultural Organization) ISCED-97 (International Standard Classification of 
Education) (UNESCO 1997). The categories, in their generic formulation, are: 


1. Primary education (or first stage of basic education) 

2. Lower secondary (or second stage of basic education) 

3. (Upper) secondary education 

4. Post-secondary, non-tertiary education 

5. Tertiary education (not leading directly to an advanced 
research qualification) 

6. Tertiary education (leading to an advanced research 


qualification) 


Category 6 is very small, so it is combined with category 5 for the analysis. 
ISCED codes are directly available in LINDA (Statistika centralbyrán 2000); coding 
procedures for BLFS and MZ are based on external documentation (OECD 1999). 


Marital/Partnership Status 


This variable is coded 1 if a person lives with a spouse or domestic partner. | 
often refer to it as marital status even though this is not by legal definition.® 
Among partnered immigrants, an additional variable distinguishes native-born 
and foreign-born spouses. 


Children 


The variable for preschool-aged children is coded 1 if a person lives in a family 
with children under 6. The variable for school-aged children is coded 1 if a person 
lives in a family with children aged 6 to 17. 


Years Since Migration and Years Since Migration Squared 


Because of the yearlong reference period for the employment variable, 
immigrants who have arrived in the year prior to data collection are excluded. The 
analysis is also limited to immigrants who moved as adults (aged 18 and older); an 
analysis of the second and 1.5 generations is beyond the scope of this analysis.* 
This criterion implies that most immigrants attained any primary and secondary 
education in the home country. The range for time since migration, given the 
sample selection based on age, is 1 to 41 years. This variable is centered at 14, 
the approximate mean in each country. 


Origin Country 


Each of the 33 origins groups can be uniquely identified in at least two of the 
three receiving countries. The origin countries include some within the pre- 
2004 "EU-15" (Austria, Britain, France, Germany, Greece, Italy, the Netherlands, 
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Portugal and Spain); other highly developed countries (Australia, Canada, Japan 
and the United States); countries in Eastern Europe (Poland, Romania, the former 
Yugoslavia and the former Soviet Union);'? and countries outside of Europe or 
on Europe's periphery (Algeria, Bangladesh, China, Columbia, Egypt, India, Iran, 
Iraq, Morocco, Pakistan, the Philippines, Somalia, Sri Lanka, Turkey, Uganda and 
Vietnam). | also include a heterogeneous category of all other immigrants. The 
median sample size for immigrant groups (subdivided by country of residence, 
gender and country of origin and not including the “other” category) is 228, with 
a minimum of 30 (women from Algeria in Britain) and a maximum of 15,000 
(women from the former Yugoslavia in Sweden). A native-born comparison group 
provides a baseline in each country." 


Survey Year 


Dummies for survey year capture time-varying economic conditions within a given 
host country, such as the unemployment rate. These survey year effects apply 
only to Germany and Britain. (Swedish data are from 2002 only.) The reference 
year is 2000 for Germany and 2002 for Britain. 


Region 


A region variable ensures that immigrants are compared to native-born persons in ` 
similar labor markets. In Sweden and Germany, the region variable is constructed 
by state (län and Land, respectively). The British regional variable is an aggregation 
of a county/unitary authority-level indicator. In total, there are 21 regions for 
Sweden, 16 for Germany and 20 for Britain. 


Models 


For each country, and for men and women separately, the analysis is based on 
three models of employment probability, formally expressed: 











"| i Lang A 

E 

In E |=04810, 48C, +8,8, 8R 10; 2 
x. 

In , e Jc ««0,-86, 185,8, e rien (3) 


where mj is the probability of employment for the /th individual, O is a vector of 
dummy variables indicating countries of origin, C is a vector of demographic 
and human capital characteristics (age, age squared, education, marital status, 
preschool and school-aged children, years since migration and years since 
migration squared), S is a vector of dummy variables indicating survey year, R is a 
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vector of dummy variables indicating region, and F is a single dummy variable 
indicating whether a respondent belongs to any of the foreign-born groups. The 
excluded categories are native-born, low education (ISCED 1), not living with a 
spouse or partner, no children, survey year 2000 (Germany) and 2002 (Britain), and 
the capital "regions" of Stockholm, Berlin and inner London. Age is centered at 40, 
time since migration is centered at 14 for immigrants, and native-born persons 
are coded 0 on years since migration, origins and spouse nativity variables. 


Results 


The discussion of results proceeds as follows. After a brief look at descriptive 
statistics and immigrant/native-born gaps in employment probabilities within each 
country, the discussion turns to the primary research questions: Does the size of 
the gaps in joblessness between immigrant groups and native-born populations 
vary across countries? Do native-born baselines vary enough that immigrant/native- 
born gaps do not translate into absolute differences in immigrant outcomes? And 
finally, how do the effects of individual characteristics on joblessness vary across 
countries and between immigrant and native-born populations? 


Descriptive Statistics 


Table 1 presents descriptive statistics, by country, gender and nativity. There are ` 
a number of things worth noting. First, there are important nativity and cross- 
national differences in individual demographic and human capital characteristics. 
On average, immigrant men and women are more likely to be married or cohabiting, 
and more likely to have children than their native-born counterparts in all three 
countries. For this reason, the effects of marital status and children are particularly 
important in understanding immigrant/native-born gaps in joblessness. Immigrants 
are more likely to have foreign-born than native-born spouses, and this is similarly 
true in all three countries. Not surprisingly, immigrants are more likely to have 
very low levels of education than are their native-born counterparts in all three 
countries. In most cases, immigrants are also nearly as likely as or more likely than 
the native-born to be found at the high end of the educational spectrum. 

Employment outcomes (and  immigrant/native-born differences in 
employment) also vary cross-nationally. Employment rates among native-born 
men, but especially native-born women, are higher in Sweden than in either 
Germany or Britain. Foreign-born men and women work at lower rates than the 
native-born in all three countries. Partly due to the higher native-born employment 
rates in Sweden, this immigrant/native-born gap in employment among men 
appears to be larger there (15 percentage points) than in Germany and Britain 
(9 and 7 percentage points, respectively). Among women, despite the lower 
native-born bar in Germany and Britain, the gap between native- and foreign- 
born women is more than 15 percentage points in all three countries. Such 
aggregate employment rates are only minimally informative, because we know 
there is a high level of variation among immigrant populations in every country. 
Therefore, the discussion turns to models that take immigrants' countries of 
origin into account. 
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Within-country Origin Effects on Employment 


Gross origin gaps in Table 2 (gaps prior to controlling for individual-level human 
capital and socio-demographic characteristics, see equation 1) are highly variable. 
Many groups of immigrant men (particularly those from within the EU-15 and 
from other industrialized countries) post higher employment rates than native- 
born men, as seen in positive origin effects. However, this is only true in Germany 
and Britain; it is not the case in Sweden, where despite substantial variation, a// 
immigrant groups in the analysis work at lower rates than native-born Swedes. 
Previous research, despite using different models and data, also documents this 
pattern for Sweden (Bevelander 1999). Most groups of immigrant women in all 
three countries have lower gross employment rates than native-born women, as 
seen in negative origin effects. 

After controlling for individual-level characteristics (see equation 2), it is clear 
that most within-country origin gaps remain large. The large net variation across 
countries of origin again highlights the importance of looking at destination effects 
within detailed origin groups. Note that these net origin gaps are those that exist 
at the approximate mean value of years since migration for immigrants, 14. 


Destination Effects 


Table 3 shows that destination effects are distinct and statistically significant. '? 
The figures in this table are cross-national differences in the size of the gap 
between a given immigrant group and otherwise similar native-born persons 
(see equation 2). A positive number for the comparison "Germany vs. Sweden" 
indicates that the given group has a higher relative employment rate in Germany 
than in Sweden. 

Origin gaps in odds of employment are unambiguously larger among men in 
Sweden than among men in Britain and Germany, with only a few exceptions. 
We can see this in the overwhelmingly positive figures in all comparisons of 
"Germany vs. Sweden" and "Britain vs. Sweden." Of 47 explicit cross-national 
comparisons between Sweden and the other two countries, and for the 
heterogeneous group of "other" immigrants, in only one case are the relative 
odds of immigrant employment significantly higher in Sweden. (Immigrant men 
from the former Yugoslavia work at significantly higher relative rates in Sweden 
than in Britain.) For women, the pattern is largely the same. In only two cases are 
origin effects smaller in Sweden than elsewhere (among immigrant women from 
Bangladesh and Somalia in Britain vs. Sweden). For both men and women, origin 
effects are mostly of similar magnitude in Germany and Britain for both men 
and women: that is, the cross-national differences are statistically insignificant. 
The only significant difference is that immigrant men from the former Yugoslavia 
work at higher relative rates in Germany than in Britain. Immigrant women from 
Greece, the former Yugoslavia, Turkey and Vietnam also work at higher relative 
rates in Germany than in Britain. Nonetheless, the overall pattern of differences in 
origin effects between Britain and Germany is not nearly as striking as the pattern 
of Sweden compared to the other countries. 
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Table 3: Cross-national Differences in Origin Effects on Log Odds of Employment 
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Men Women 
Germany Britain Britain Germany Britain Britain 
vs. vs. vs. vs. vs. vs. 

Sweden Sweden Germany Sweden Sweden Germany 
EU-15 
Austria 1.18 .51 
Britain E .76 
France 1.33 1.92 .60 1.00 1.05 .06 
Germany .52 .98 
Greece 2.02 1.38 -.64 2.44 1:16 -1.28 
Italy .85 1.26 A1 1.13 1.01 -.12 
Netherlands 1.40 1.59 .19 .38 .95 -57 
Portugal 1.29 1.53 24 1.16 1.40 m» 
Spain 1.58 .93 -.65 .84 1.14 .30 
Other More Developed Countries 
Australia 2.87 2.01 
Canada 1.79 1.22 
Japan 2.04 -.34 
US 1.30 2.11 81 1.28 1.16 -.12 
Eastern Europe 
FSU .85 1.10 .25 1.13 .91 -.22 
Poland day .97 -.19 .86 i92 -.34 
Romania 10 1.30 
Yugoslavia .62 - 12 -1.34 1.01 22 -.79 
Other Less Developed Countries 
Algeria .20 .59 
Bangladesh .93 -1.25 
China 19 23 
Columbia 14 22 
Egypt 1.30 .54 
India .85 .67 
Iran -.05 AN 22 A1 .34 -.06 
Iraq ./9 .38 


Native-born men and women have higher employment rates in Sweden than 
in the other two countries, so it is worthwhile to look not only at the size of origin 
effects, but also at whether these gaps translate into absolute cross-national 
differences in odds of immigrant employment. We can compare the magnitude of 
cross-national differences for the native-born reference groups to the magnitude 
of cross-national differences in origin gaps. In Table 3, the native-born baselines 
use the approximate mean or mode for general independent variables across 
all countries (age 40, spouse present, no children and education ISCED 3). The 
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Table 3 (continued) 
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Morocco .79 45 -.33 .16 KC) .58 
Pakistan .83 -.13 
Philippines 1.58 .85 

Somalia -.14 -1.39 

Sri Lanka .79 57 

Turkey .50 .63 214 .68 .28 -.40 
Uganda .50 .05 

Vietnam Ti -.11 -.81 1.08 -.32 -1.40 
Other .73 .63 -.10 .88 .59 -.30 
Native-born -.94 -.66 .28 -1.43 -1.20 .23 
Baseline 





Notes: Bold indicates a statistically significant cross-national difference based on a Wald chi- 
square test with 1 degree of freedom (critical value = 3.84, p < .05). Native-born baselines use 
the approximate grand mean or mode for each variable: age 40, spouse present, no children, 
education ISCED 3, capital region, and survey year 2000 (Germany) and 2002 (Britain). Time 
since migration is centered at 14 years (native-born persons coded 0). 


difference in log odds of employment for native-born men is about 0.94 higher 
in Sweden than in Germany. In about half of cases (for 8 of 17 specific origins 
groups), the cross-national differences in effects of origin for men are of larger 
magnitude than this, meaning that these immigrant groups in Germany have not 
only higher re/ative odds of employment, but higher abso/ute odds as well. The 
difference in log odds of employment for native-born men is about 0.66 higher 
in Sweden than in Britain, so 21 groups of immigrant men (of 30) have higher 
absolute odds of employment in Britain. 

For women, the case is considerably more extreme, because native-born 
Swedish women have much higher odds of employment than native-born women 
elsewhere. As a result, the larger origin effects for women in Sweden generally 
do not translate into lower absolute odds of employment for immigrant women in 
Sweden when compared to immigrant women elsewhere, except in a few cases 
(one immigrant group in the Germany/Sweden comparison, and 5 in the Britain/ 
Sweden comparison). On the contrary, immigrant women in Sweden generally have 
considerably higher odds of employment than immigrant women in Germany and 
Britain. This is an important finding, because it suggests that Swedish institutions 
boost employment rates for women regardless of origin. This has the potential to 
counterbalance relatively large origin effects on employment in Sweden. 

In sum, significant destination effects for both men and women are present 
across this very wide range of sending countries. In Sweden, the immigrant/native- 
born differentials in employment are larger than in Britain or Germany among both 
men and women. However, the reference group of native-born women in Sweden 
works at such high rates that, despite the larger origin gaps, immigrant women in 
Sweden are more likely to work than immigrant women elsewhere. 
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Notes: Bold indicates coefficients significantly different from 0, p < .05, two-tailed test. Reference categories for native-born baselines are single, no chil- 
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.07 


1.83 


Native-born baseline 


dren present, ISCED 1, capital region, and survey year 2000 (Germany) and 2002 (Britain). Age is centered at 40. Time since migration is centered at 14 


years (native-born persons coded 0). Models also include origin effects. 
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Variations in Human Capital and 
Socio-demographic Effects 


The final issue explored here is how human capital 
and socio-demographic effects vary across countries 
and for native-born and immigrant populations. 
This analysis answers the questions: What are 
the human capital and demographic profiles of 
immigrants with particularly severe origin penalties? 
How do cross-national variations in the effects 
of these characteristics shape both absolute and 
relative immigrant employment rates? The selected 
coefficients in Table 4 are based on equation 3, 
allowing the effects of demographic and human 
capital characteristics to vary by country, gender 
and nativity. 

| focus first on family characteristics. Destination 
effects (Sweden's larger origin gaps) are larger 
among men who are married and have children than 
among single men and men without children. More 
specifically, immigrant men's employment rates are 
affected less positively by being married or having 
children than native-born men's employment rates. 
We see this in the interaction effects between 
foreign-born status and the family variables, which 
are negative in almost all cases, but generally less 
negative in Germany and Britain. Immigrant men 
who are married to immigrant women (the majority 
of married immigrant men, see Table 1) have much 
lower relative employment rates in Sweden than in 
the other two countries. 

Family characteristics are, not surprisingly, a major 
part of the explanation for high women's employment 
in Sweden. In fact, single women without children 
actually have higher employment rates in Germany 
than in Sweden (see native-born baseline for women). 
However, among married women and especially 
women with children, Sweden emerges as the clear 
leader in facilitating employment for both native- 
born and foreign-born women. That said, we can see 
in the sizable negative interaction effects between 
nativity and family characteristics in Sweden that 
institutions that promote women's employment are 
less effective for immigrant women. This is true in 
part because Swedish-born women experience 
no negative effects of being married or having 
children on their employment rates. Nonetheless, 
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family effects in Sweden are remarkably small even for immigrant women, when 
compared to family effects in Britain and Germany. 

Variations in education also affect the magnitude of origin gaps in employment 
in all three countries. Origin gaps increase at the upper end of the educational 
spectrum, because native-born employment probabilities rise more with 
increasing education than do immigrant employment probabilities. Recall that 
immigrants in the sample arrived in the host country as adults, so primary and 
secondary education was likely completed in the home country, a probable 
explanation for such gaps.'? The magnitude of destination effects also varies 
across the educational spectrum. Immigrant men in Germany with higher 
education (ISCED 5/6) experience a larger devaluation of their credentials than 
do their counterparts in Britain or Sweden. For women, the effects of education 
on cross-national differences in origin gaps are uniform: The difference between 
Sweden and the other two countries is largest at the high end of the educational 
spectrum. 

Figure 1 shows how time since migration changes destination effects. 
(Destination effects on origin gaps presented in previous sections were from 
models in which time since migration was fixed at 14 years.) Not surprisingly, 
immigrants’ employment rates increase with time in the host country, such 
that origin gaps in employment are largest for recently arrived immigrants. 
More interestingly, cross-national patterns also vary, depending on how long- 
immigrants have been in the country. For both men and women, the employment 
integration process is quicker in Sweden than in the other two countries, and this 
cross-national difference is statistically significant for both men and women.'4 
This means that, among those who have been in their host countries less 
than 14 years, the cross-national patterns are even more extreme than those 
presented in previous sections. For longer-settled immigrants, the cross-national 
differences narrow. Although the initial disadvantage of settling in Sweden does 
not disappear for the vast majority of immigrant groups, even after several 
decades of residence, the issue of immigrant joblessness in Sweden is clearly 
most pressing among recent newcomers. 


Discussion 


The analyses show that destination countries matter considerably for immigrant 
employment outcomes. Expanding on previous studies, | have considered 
employment rates for a very wide range of specific immigrant origins groups in 
three host countries. The first major finding is that origin gaps in employment are 
largest in Sweden relative to Britain and Germany. This is true even after controlling 
for individual-level human capital and socio-demographic characteristics, and it is 
true within the vast majority of detailed origins groups. The second major finding 
is that, despite Sweden’s larger origin gaps, immigrant women also benefit as 
women (or more specifically, as wives and as mothers) from Sweden's institutional 
support for women's employment. That is, immigrant women in Sweden work at 
much higher rates than their immigrant counterparts elsewhere, even though they 
work at rates far lower than the extremely high rates of Swedish-born women. | 

address each of these two findings in turn. | 


Social Policy and Immigrant Joblessness ° 763 


Figure 1. Effects of Time Since Migration on Immigrant Employment 
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Note: Effects are fixed at 0 for years = 14. 
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This research's major contribution lies in describing the contours of cross- 
national variation. Given this research design, itis clearly not possible to definitively 
determine the causal mechanisms of the observed cross-national differences. 
Nevertheless, we can return to the cross-national variations discussed above to 
consider likely explanations. 

Why the significantly larger origin gaps in Sweden? Compared to immigrants 
in Britain, immigrants in Sweden face several disadvantages. First, they are less 
likely to arrive with knowledge of the host country language, since Swedish is 
rarely used outside of Sweden. Second, anti-discrimination law is somewhat 
more established in Britain, which could mean that immigrants to Britain face 
less discrimination from employers, despite more ethnically exclusive public 
opinion than in Sweden. Both of these differences suggest that immigrants have 
more limited access to jobs in Sweden than in Britain. 

Based on institutional differences, we have no reason to believe that 
immigrants in Germany are more successful than in Sweden at competing with 
the native-born for jobs. In fact, immigrants to Germany face particularly severe 
barriers to access. Nonetheless, immigrants to Germany are also less likely to 
directly compete with the native-born for jobs, as evidenced by higher levels of 
sectoral segregation. Although origin gaps are generally much larger in Sweden, 
this pattern is more pronounced at the lower end of the educational spectrum 
for men. This could be because many low-skilled immigrant men in Germany 
continue to hold the manufacturing jobs for which they were recruited in the 
postwar period, and for that reason face joblessness less often than low-skilled 
immigrant men in Sweden. 

Perhaps the most salient difference between Germany and Sweden is in the 
range of options if desirable employment is not to be found. If the jobs available 
to immigrants in Sweden are undesirable, they can opt, at least temporarily, not 
to work. Because immigrants to Germany risk their residence status if they turn to 
social assistance, employment is forced to a considerably higher degree. Although 
we have reason to believe that access to desirable jobs is greater in Britain than 
in Sweden, Britain's low levels of social welfare, particularly for immigrants, 
mean that employment forcing is likely there as well. Note that the finding of 
larger origin gaps in Sweden is also consistent with the hypothesis of selectivity 
of migration flows based on income inequality. It is possible that migrants to 
Sweden are more negatively selected precisely because they are aware, prior to 
migration, of the low income inequality and high de-commodification in Sweden. 
But even if migration itself is driven predominantly by non-economic factors such 
as family unification, social networks and political upheaval, de-commodification 
could depress immigrant employment in Sweden. 

Relatively high differentials between immigrant and native-born joblessness 
represent an important challenge to Sweden's model of maximum employment 
which has been so successful at integrating marginalized groups into the labor 
force. However, it is also important to understand the ways in which other 
axes of inequality affect immigrant experiences. Because immigrant families 
are responsible for raising a disproportionate number of children in all three ` 
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of these countries, work and family policies are an important immigrant 
issue. Such policies appear to substantially increase women's employment 
in Sweden, not only among the native-born (a well-known finding) but also 
among immigrant women. These results further strengthen the claim that such 
policies independently affect women's employment and do not merely reflect 
a culturally specific preference for employment among Swedish women. For all 
receiving countries, the strategic promotion of women's labor force participation 
is promising for increasing native-born and immigrant employment rates and 
for preserving a fiscally sound welfare state. 


Notes 


1. “Jobless” refers to a characteristic of persons who are either unemployed or 
out of the labor force and therefore not working. See discussion of variables 
for complete definition. 


2. lamusing "liberal" in the economic sense: a high commitment to free market 
principles in spheres such as the labor market and the welfare state. 


3. In fact, as Waldinger (2001) notes, even within the United States (the focus 
of much of this work) the paradox of high immigrant employment rates is 
implicitly gendered: It applies much more to men than to women. 


4. Gini coefficients for Britain, Germany and Sweden were 36, 28.3 and 25 in 
1999-2000 (World Bank 2005). 


5. The figures are unfortunately by citizenship, not by nativity; this is typical 
of most official cross-national reports. These are probably conservative 
estimates of cross-national differences because non-citizens in Britain and 
Sweden are likely to be more negatively selected from among all immigrants 
(and less likely to work in the same sectors as the native-born) than non- 
citizens in Germany. 


6. Ithank Statistika centralbyrán (SCB) in Orebro and the Zentrum für Umfragen, 
Methoden und Analysen (ZUMA) in Mannheim for allowing me to work with 
LINDA and the MZ, respectively, on site at their facilities during visits in 2004. 
Due to legal regulations with these official micro-level data, | can only access 
them in-country. 


7. Identicaldefinitionsofemploymentwouldobviouslybepreferable.lhavechosen 
LINDA over the Swedish labor force survey (Arbetskraftsundersókningarna), 
which would have a more comparable employment variable, because of 
LINDAS far superior sample size and household-level design, very important 
for parts of the larger project from which this article is drawn. 


8. IntheSwedish data, cohabiting couples can only be identified if they are either 
officially registered or have children in common. This probably accounts for 
the somewhat lower partnership rates in Sweden when compared to the 


other two countries. 
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9. The second generation (native-born children of immigrants) is included in the 
native-born category because they are not uniformly identifiable in the data. 


10. Among immigrants who arrived in their host countries when the Soviet Union 
and Yugoslavia were still unified nation states, the contemporary names of 
places of origin are often not available in the data. In other cases, the data 
distributors have aggregated these countries of origin. 


11. A drawback of the German data is that among foreign-born German citizens, 
one cannot determine country of birth. Only beginning in 1999 (i.e., in the 
2000 data but not in the 1996 data) can one identify who is foreign-born 
among current German citizens. The native-born comparison group and the 
group of foreign-born German citizens (included in the "other" category) 
are therefore drawn only from the sub-sample of 2000 data and weighted 
accordingly. These limitations affect results much less than they would in 
other receiving countries. Most foreign-born German citizens are foreign- 
born "ethnic" Germans from Eastern Europe and the former Soviet Union. 
The bias of excluding naturalized immigrants is thus minimal — but growing 
— because of recent changes to German citizenship laws. The MZ's sample 
size and response rate are far superior to those of other available surveys, 
and help outweigh such limitations. 


12. Itestthe statistical significance of destination effects (the effect of destination 
country on the size of a specific origin gap) with Wald chi-square statistics. 
Allison (1999) argues that Wald statistics are problematic for comparing logit 
coefficients across groups, given unequal residual variation. Unfortunately, | 
cannot use the method he proposes because it assumes that the researcher 
can work with data for all populations simultaneously; | could not use the 
LINDA and MZ data outside of secure facilities in Sweden and Germany. 
Nonetheless, Allison's critique rests on the assumption that a binary variable 
(here, employment) is in fact an indicator of a latent continuous variable, 
“employability.” One need not assume the presence of a latent continuous 
variable, if one’s goal is to descriptively assess employment. 


13. About 70 percent of immigrant men and women with higher education (ISCED 
5/6) in Sweden and Germany arrived when they were older than 25, an age 
at which they are likely to have already completed education. In Britain, the 
figure is only about 50 percent; this difference could help explain a lower 
devaluation of university credentials in Britain. 


14. Based on Wald chi-square tests (with one degree of freedom and a critical 
value of 3.84), the linear term for years since migration is significantly more 
positive in Sweden than in the other two countries. Cross-national differences 
in the quadratic terms are mostly insignificant. 
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Environmental Racial Inequality in Detroit 


Liam Downey, University of Colorado 


This study uses industrial pollution data from the Environmental 
Protection Agency's Toxics Release Inventory (TRI) and tract-level 
demographic data from the 2000 U.S. census to determine whether 
environmental racial inequality existed in the Detroit metropolitan 
area in the year 2000. This study differs from prior environmental 
inequality research in two important ways. First, it offers a positive 
rationale for using hazard proximity indicators. Second, it uses a 
distance decay modeling technique to estimate hazard proximity. This 
technique weights each hazard's estimated negative effect by distance 
such that the estimated negative effect declines continuously as distance 
from the hazard increases, thus providing more accurate estimates of 
proximity-based environmental risk than can be obtained using other 
variable construction techniques currently found in the literature. 
Using this technique, I find that Detroit's black neighborhoods were 
disproportionately burdened by TRI facility activity in 2000 and that 
neighborhood racial composition had a strong independent effect on 
neighborhood proximity to TRI activity. 


Introduction 


Over the past couple of decades, academic interest in environmental inequality 
has grown dramatically, resulting in the development of a large and expanding 
body of research that has attempted to determine whether minority and low 
income groups are disproportionately burdened by environmental hazards 
(Bowen 2002; Pastor, Sadd and Morello-Frosch 2002; Szasz and Meuser 1997). 
Although the methodological techniques researchers have used to make this 
determination have improved considerably over time, researchers still face 
several critical methodological challenges (Bowen 2002; Chakraborty and 
Armstrong 2001; Mennis 2002). One of the most important of these challenges 
is the problem of properly measuring residential proximity and exposure to 
environmental hazards and industrial pollution (Bowen 2002; Downey 2003; Liu 
2001; Mohai and Saha 2006). 

The problem facing researchers is two-fold. First, it is likely that the strength 
of an environmental hazard's negative effects declines continuously as distance 
from the hazard increases (Liu 2001). Thus, if researchers want to accurately 
measure environmental inequality, they must be able to model this continuous 
negative effect. Second, data limitations force most environmental inequality 
researchers to use demographic data that are tied to areal units of analysis such 
as census tracts and zip codes (Downey 2003). This is problematic because the 
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boundaries used to create these areal units of analysis, although^not designed 
arbitrarily, are arbitrary with respect to the distribution of environmental hazards. 
As a result, environmental hazards are often located near the physical boundaries 
of areal units of analysis, sometimes closer to neighboring units of analysis than 
to the far side of their host unit (Downey 2003). 

This presents researchers with a real dilemma. Should they assume, as others 
have done, that an environmental hazard's negative effects are confined solely to 
and distributed evenly throughout its host unit of analysis (Downey 2003; Mohai: 
and Saha 2006)? Or should they assume that environmental hazards negatively 
affect both host and non-host units? More importantly, because it is quite likely 
that environmental hazards’ negative effects spread unevenly across both host 
and non-host units, how should they measure these uneven effects? And once 
measured how should they tie their measurements back to their demographic 
units of analysis? 

This article provides some new answers to these questions by using a distance 
decay modeling technique borrowed from geographic information systems (GIS) 
analysts, industrial pollution data drawn from the Environmental Protection 
Agency's (EPA) Toxics Release Inventory (TRI), and tract-level demographic data 
drawn from the 2000 U.S. Census to determine whether environmental racial 
inequality existed in the Detroit metropolitan area in the year 2000. 

| begin by describing the distance decay modeling technique and justifying its 
use. | then run a set of regression analyses that allows me to determine whether 
the distance decay estimates produced by the GIS modeling technique are 
positively associated with the percentages of Hispanics and non-Hispanic blacks 
in a census tract. Finally, | use these regression results to graph the associations 
that exist between the distance decay variables on the one hand and percent non- 
Hispanic black on the other (percent Hispanic is not positively associated with 
any of the distance decay variables). This allows me to draw more substantively 
meaningful conclusions about environmental racial inequality in Detroit than | 
otherwise could. 

Before proceeding, | should note that the distance decay technique used 
in this article has two important limitations. First, it cannot overcome the 
aforementioned problem that environmental inequality researchers are generally 
forced to use areal units of analysis such as census tracts and zip codes. Thus, 
most researchers using this technique will have to aggregate their distance 
decay data to the areal-unit level in order to merge it with their demographic data. 
Second, because researchers do not know the actual rate at which environmental 
hazards’ negative effects decline (their distance decay rate), the mathematical 
functions used to calculate distance decay are based on assumptions about the 
distance decay process rather than on precise knowledge of the process. 

Although these are important limitations, they are not as serious as they 
first appear. This is because the first limitation results from a lack of address- 
specific, individual-level, demographic data, and not from any problem with 
the technique, and the second limitation results from the state of academic 
knowledge regarding distance decay. As a result, the first limitation can be 
overcome with better demographic data, and the second limitation will become 
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less and less problematic as academic knowledge regarding the distance decay 
process improves. 


Literature Review 


Environmental inequality researchers have studied the distribution of social 
groupsaround a variety of environmental hazards including hazardous waste sites, 
manufacturing facilities, superfund sites, chemical accidents and air pollutants 
(Bowen 2002; Derezinski, Lacy and Stretesky 2003; Morello-Frosch, Pastor and 
Sadd 2001; Szasz and Meuser 1997). Most studies support the hypothesis that 
lower income and minority neighborhoods are disproportionately burdened by 
environmental hazards. However, results have not been entirely consistent (Bowen 
2002; Pastor, Sadd and Hipp 2001). For example, although most researchers have 
found evidence of income- and/or poverty-based environmental inequality (Ash 
and Fetter 2004; Derezinski, Lacy and Stretesky 2003, Downey 2003; Morello- 
Frosch, Pastor and Sadd 2001), some have not (Anderton et al. 1994; Bowen et 
al. 1995; Brown, Ciambrone and Hunter 1997). 

Similarly, although many studies have found strong evidence of environmental 
racial inequality (Hamilton 1995; Krieg 1995; Mohai and Bryant 1992; Morello- 
Frosch, Pastor and Sadd 2001; Stretesky and Hogan 1998; Stretesky and Lynch 
2002), some have found evidence of environmental racial inequality for some 
minority groups but not others (Brown, Ciambrone and Hunter 1997; Pastor, 
Sadd and Morello-Frosch 2002; Sadd et al. 1999), and some have found only 
weak evidence of environmental racial inequality, inconsistent evidence, or none 
at all (Anderton et al. 1994; Atlas 2002; Bowen et al. 1995; Derezinski, Lacy and 
Stretesky 2003; Oakes, Anderton and Anderson 1996; Yandle and Burton 1996). 

Studies have also varied according to whether they use a local (Bullard 1983; 
Mohai and Bryant 1992), regional, (Bowen et al. 1995; Downey 2003, 2005; 
Sadd et al. 1999), or national sample (Ash and Fetter 2004; Derezinski, Lacy and 
Stretesky 2003; Oakes, Anderton and Anderson 1996). Findings from local and 
regional studies suggest that patterns of environmental inequality vary from one 
locality and region to another (Bowen 2002), with studies of western metropolitan 
areas (Downey 2006; Brown, Ciambrone and Hunter 1997; Pastor, Sadd and Hipp 
2001; Pulido 2000) generally finding stronger evidence of environmental racial 
inequality than studies of rust belt cities or cities in the northeast and mid-Atlantic 
regions (Downey 2006; Bowen et al. 1995; Brown, Ciambrone and Hunter 1997; 
Downey 2005). 


Proximity Indicators 


Most environmental inequality researchers use residential proximity to 
environmental hazards, rather than exposure or risk, to measure environmental 
inequality (Bowen 2002; Sadd et al. 1999). Researchers use residential proximity 
data rather than exposure or risk data because exposure data are very difficult to 
obtain, and until recently air pollutant concentration data, which can be used to 
estimate certain kinds of health risks, have not been readily available (Ash and 
Fetter 2004: Morello-Frosch, Pastor and Sadd 2001). 
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The field's reliance on proximity data has been heavily criticized? For example, 
Bowen (2002) argues that lack of exposure and risk data seriously undermines the 
quality and usefulness of environmental inequality research because it prevents 
researchers from linking environmental hazards to specific public health outcomes 
in specific communities. This is problematic because it prevents researchers from 
determining whether environmental inequality exists and whether exposure to 
environmental pollutants has negative and inequitable public health outcomes. 

Other scholars argue that lack of exposure and risk data, while problematic, is not 
nearly as serious as Bowen claims. Sadd et al. (1999:109) note, for example, that: 


Several epidemiological studies have... demonstrated a 
significant relationship between residential proximity to 
urban toxic substances and/or air release facilities, and 
increased health risk and disease incidence, especially 
among pregnant women and infants. 


Moreover, empirical evidence suggests that environmental hazards also 
negatively affect nearby property values, beliefs about local health risks, 
psychological stress, local employment opportunities, sense of community, 
and local economic activity (Downey and Van Willigen 2005; Liu 2001; Mohai 
1995; Sadd et al. 1999), outcomes that are less likely to be affected by chemical 
exposure than they are to be affected by residential proximity to environmentally 
hazardous facilities, the size and visibility of environmentally hazardous facilities, 
and perceptions of facility safety and neighborhood desirability. Thus, it is likely 
that in many cases proximity measures are more valid indicators than chemical 
exposure of environmental inequality. 

This is not to say that exposure and risk data are unimportant or that the field 
would not benefit greatly from an increase in their use. It is simply to say that 
the field would also benefit greatly from improved proximity measures that take 
distance to hazardous facilities into account and, when possible, the size and 
visibility of these facilities and the social stigma that is attached to them. 


Measuring Proximity and Risk 


Environmental inequality researchers have employed several strategies to 
measure residential proximity to environmental hazards and, when possible, 
to estimate environmental hazards' potential health risks. These strategies fall 
into three broad categories, the unit-hazard coincidence method, the buffer 
method and pollution plume modeling. The term unithazard coincidence was 
coined by Mohai and Saha (2006) to describe the most commonly used method 
of measuring residential proximity to environmental hazards. Researchers using 
this method locate environmental hazards on a map and (1) sum the number of 
hazards located in each of their study area analysis units, (2) sum the pounds 
of pollutants emitted in each of their study area analysis units, or (3) create a 
dummy variable that indicates whether or not an analysis unit contains a hazard. 
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negative effects are confined solely to and distributed evenly throughout its host 
analysis unit. Figure 1 illustrates the problematic nature of these assumptions. 
Figure 1 examines the distribution of Toxics Release Inventory (TRI) facilities in 
a subset of Detroit metropolitan area census tracts in the year 2000. TRI facilities 
are industrial facilities that manufacture, process or otherwise use specified 
toxic chemicals in specified quantities, and are required to report this use to 
the Environmental Protection Agency on an annual basis. (A more complete 
description of the TRI database can be found below.) Not only is it quite evident 
that TRI facilities are distributed unevenly within census tracts, it is also evident 
that TRI facilities are often located near the boundaries of multiple census tracts, 
in many cases closer to adjacent census tracts than to the far end of their host 
census tract. (Maps not shown here demonstrate that the same basic patterns 
hold for the region's hazardous waste facilities and National Priority List sites.). 
Given the spatial distribution of these TRI facilities, it appears quite unlikely 
that their negative effects are confined solely to their host analysis units. It is 
also quite unlikely that the strength of their negative effects remains constant 
as the distance from each facility increases (Liu 2001; Pollock and Vittas 1995). 
Nevertheless, the unit-hazard coincidence method assumes both these things.' 


The Buffer Method 


Researchers using the buffer method to determine the geographic area and 
population affected by some set of environmental hazards locate these hazards 
on a map and then construct circular buffers around each hazard. These buffers, 
which are usually the same size for each facility? are matched to areal units of 
analysis (such as census tracts) in a variety of ways. One approach is to define 
as an "affected analysis unit” any analysis unit that has at least 50 percent of its 
area encompassed by the buffer. Other approaches include defining an analysis 
unit as an "affected unit" if that unit's centroid is encompassed by the buffer or if 
the buffer touches or in any way covers a portion of the analysis unit (Chakraborty 
and Armstrong 1997; Mohai and Saha 2006). Regardless of which approach is 
taken, they all consider any individual who lives in an affected analysis unit to be 
residentially proximate to all the hazards affecting that unit. 

A fourth approach associated with the buffer method, the area/ apportionment 
method, calculates the proportion of each analysis unit that is actually 
encompassed by a buffer and then assigns that proportion of the analysis unit's 
population to the buffer (Mohai and Saha 2006). For example, if 22 percent of an 
analysis unit's area is encompassed by a buffer, then 22 percent of that analysis 
unit's population is considered to be affected, and 78 percent to be unaffected, 
by that buffer's hazard. 

The buffer method offers some important advantages over the unit-hazard 
coincidence method. Most importantly, the buffer method does not assume 
that environmental hazards’ negative effects are confined solely to host 
analysis units. In addition, the area/ apportionment method does not assume 
that environmental hazards’ negative effects are distributed evenly within 
analysis units. Nevertheless, all buffer methods assume that the strength of 
a hazard's negative effects remains constant within the circular buffer drawn 
around the hazard. 
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Pollution Plume Modeling 


Pollution plume modeling techniques, such as those used to derive the data 
employed in Ash and Fetter (2004) and Morello-Frosch, Pastor and Sadd (2001), 
do not make this problematic assumption. These studies use air pollutant 
concentration and toxicity data drawn from the Environmental Protection 
Agency's (EPA) Cumulative Exposure Project (CEP: Morello-Frosch, Pastor and 
Sadd) and Risk-Screening Environmental Indicator's project (RSEI: Ash and Fetter) 
to estimate health risk scores for each analysis unit in their respective datasets. 

These data are unique in environmental inequality research, not only because 
they allow researchers to estimate the potential health risks associated with 
specific environmental hazards and specific analysis units, but also because 
the plume modeling techniques used to derive these data take into account 
factors such as wind speed, wind direction, air turbulence, smokestack height 
and the rate of chemical decay and deposition (Ash and Fetter 2004). As a 
result, these modeling techniques allow the concentration of air pollutants and, 
therefore, the estimated health risks associated with these air pollutants to (1) 
decline continuously as distance from the emitting source increases and (2) vary 
according to compass direction. In addition, because the pollution plumes used 
to derive the risk estimates can extend for miles in any direction (up to 44 miles 
in the RSEI model), this modeling technique allows hazards and emissions in one 
analysis unit to affect people living in other analysis units. 

This is a clear improvement in many respects over the unit-hazard coincidence 
and buffer modeling approaches. Nevertheless, the plume modeling approach 
is not without its own set of limitations. First, the health risks associated with 
pollution exposure are not the only set of risks associated with environmental 
hazards. As noted above, environmental hazard presence can also negatively 
affect nearby property values, perceptions of local health risks, psychological 
stress, local employment opportunities, sense of community and local economic 
activity. For researchers interested in these potential negative impacts, plume 
modeling data are clearly inappropriate. 

Second, creating plume modeling datasets for large geographic areas is a 
time consuming and expensive process. As a result, there are few large-scale 
plume model datasets and those that exist are limited to specific sets of hazards. 
Third, because of the difficulty inherent in estimating plume models for hundreds 
of thousands of releases across the entire United States, the plume modeling 
techniques used in Ash and Fetter (2004) and Morello-Frosch, Pastor and Sadd 
(2001) make several necessary but problematic assumptions. For example, 
each facility in the RSEI database is given a single smokestack height estimate. 
However, many industrial facilities have multiple smokestacks of varying height, 
smokestack height estimates are often based on the median smokestack height 
for an entire industry (based on the facility's three-digit SIC code), and in the RSEI 
model "stack height has the greatest impact on predicted concentrations of air 
pollutants." (Bouwes and Hassur 1999:ii) Moreover, the RSEI model assumes 
constant emissions rates and uses chemical decay estimates that are not 


necessarily accurate. 
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Thus, although plume modeling data represent a significant improvement in 
many respects over unit-hazard coincidence and buffer analysis data, such data 
are not as accurate as many researchers might think. More importantly, their use 
is limited to specific research questions (those having to do with specific public 
health risks) and specific hazards (those covered by the plume models). 

Thus, in the next section of the article, | describe a GIS technique for creating 
hazard proximity indicators that can be used to model environmental hazards’ 
non-exposure related negative effects. This technique takes into account the 
location and distribution of hazards within analysis units, allows the strength of 
an environmental hazard's negative effects to decline continuously as distance 
from the hazard increases, and permits hazards and emissions in one analysis 
unit to affect people living in other analysis units. 

As noted earlier, this technique has some important limitations, and it is by no 
means a substitute for pollution plume modeling. Instead, it should be viewed as 
an important complement to the buffer analysis and plume modeling techniques 
already found in the literature. 


Distance Decay Modeling Using GIS 


A GIS is a software package that unites spatial data, such as the /ocation of 
factories and census tracts, with data about the features making up the spatial 
database, such as the number of people living in each census tract or the pounds 
of pollutants emitted from each factory. In a GIS, data are stored as map layers 
that can be precisely positioned on top of each other. There are two basic types of 
map layers in a GIS: vector map layers and raster map layers. A vector map uses 
points, lines and polygons to represent physical features such as cities, rivers 
and state boundaries. (Vector maps are what most people think of when they 
think of maps.) A raster map stores and displays spatially referenced numeric 
data in rectangular grids composed of square cells that are described in terms 
of resolution. For example, a 25-foot resolution raster map contains square grid 
cells with sides that are 25 feet long. 

In order to create the negative effects indicators used in this article, industrial 
facilities from the 2000 Toxics Release Inventory were first located on a Detroit 
metropolitan area vector map, and then three, 105.6-foot resolution raster grids 
were calculated for each facility (105.6 feet is 1/50th of a mile). The first grid 
calculated the distance from the center of each cell in the metropolitan area to 
the center of the cell containing that grid's TRI facility. The second grid was a 
weighting grid that provided, for each metropolitan area grid cell, a weight (w, 
where 0 < w < 1) that indicated the relative strength of the facility's potential 
negative effect on that cell. This grid was calculated by inserting the distance 
values from the first grid into a distance decay function such as the following: 


(1a) F(w) = 
(1b) F(w) 


1-—15.0 * d * 109)—(5.360422.* q** 103) for Dessus 200 
0 for d 213,200, 


Il 


where d equals distance in feet from the TRI facility (this and other distance 
decay functions are employed in the analyses presented below). 


Environmental Racial Inequality in Detroit ° 779 


The third grid, the re/ative effects grid, was calculated by multiplying each cell 
in the weights grid by the total pounds of air pollutants emitted by that grid's TRI 
facility in 2000. Ideally, this grid would be calculated by multiplying each cell in 
the weights grid by some measure of facility size or visibility in order to account 
for the fact that the strength of an environmental hazard's non-exposure related 
negative effects is likely a function, at least in part, of the size and visibility of the 
hazard (see earlier discussion). However, the TRI does not provide researchers 
with facility size or visibility data. Thus, | had to select a proxy for facility size from 
the variables included in the TRI dataset. | selected air emissions as my proxy 
because TRI facility air emissions are strongly correlated with facility size for a 
subset of facilities for which facility size data are available. 

The relative effects grids for all the facilities in the database were then 
summed together to create a new grid in which each cell value represented the 
summed relative effect of all Detroit metropolitan area TRI facilities on that cell. 
For example, if there had been five facilities in the study area, and the re/ative 
effect of these facilities on grid cell A had been 0, 300, 10, 500 and 0 respectively, 
then their summed relative effect on grid cell A would have equaled 0 + 300 + 
10° 5000 or 810. 

Finally, the cell values in the summed relative effects grid were aggregated to 
the census tract level by summing together the cell values in each census tract 
and then dividing each census tract total by the number of cells in that tract. The 
resulting indicator, the mean relative effect indicator, provides an estimate of the 
relative effect of all study area facilities on each study area census tract.? 


Functional Form 


One of the strengths of the variable construction technique described in the 
previous section is that it allows researchers to vary the distance decay functions 
used to calculate the relative effects grids in any manner they want. Unfortunately, 
however, the environmental inequality literature provides researchers with little 
guidance on how to determine, for any potential negative effect, the proper 
functional form forthe distance decay equations. As a result, there is no theoretical 
or empirical reason for favoring one functional form over another. 

Given this problem, this article provides results for six distance decay indicators. 
Figure 2 presents graphs of the weighting functions used to calculate two of these 
indicators. Each graph in figure 2 graphs the line created by plotting distance (in 
feet) from the hazard against the weight assigned for that distance by the graph's 
particular decay function. Graph A plots a 1.5 mile curvilinear distance decay 
function and graph B plots a 1.5 mile /nverse curvilinear distance decay function. 
The curvilinear function assumes that an environmental hazard's negative effects 
decline relatively slowly at first and more quickly as distance increases. The 
inverse curvilinear function assumes that an environmental hazard's negative 
effects decline relatively rapidly at first and more slowly as distance increases. In 
each of these graphs, the weight reaches zero at 1.5 miles and remains at zero 
thereafter. However, results are also reported below for curvilinear and inverse 
curvilinear functions that reach zero at .5 miles and 2.5 miles. 
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The .5, 1.5 and 2.5 mile cutoff points were chosen because studies that 
examine the impact of hazardous waste sites on property values have generally 
found that property values are affected at distances ranging from .25 miles in 
some studies to more than 2 miles in others (Mohai 1995; Liu 2001). In addition, 
because little is known about the spatial extent of other potential negative effects, 
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it makes more sense to examine how results vary across multiple cutoff points 
than it does to select a single cutoff point for examination. 

Finally, | ran regression models using other linear and non-linear distance decay 
indicators. Results for these models are not reported in this article because they 
do not differ appreciably from the regression results reported below. 


Study Area 


The Detroit metropolitan area is defined here as the six counties that the U.S. 
Census Bureau designates as comprising the 2000 Detroit Primary Metropolitan 
Statistical Area (these counties are Lapeer, Macomb, Monroe, Oakland, Saint 
Clair and Wayne counties). | selected Detroit for this study because it represents 
one of this nation's most important rust belt cities and because TRI emissions 
and waste transfers in Wayne County, Detroit's host county, are among the worst 
in the nation. (Wayne County consistently ranks among the 10 most polluted 
counties in the United States.) 

In contrast to studies of western metropolitan areas (Downey 2006; Brown, 
Ciambrone and Hunter 1997; Pastor, Sadd and Hipp 2001; Pulido 2000), studies 
of rust belt cities have generally found relatively weak evidence of environmental 
racial inequality (Bowen et al. 1995; Brown, Ciambrone and Hunter 1997; 
Downey 2005). In Detroit, this may be explained by the fact that unlike industry, 
blacks have been confined primarily to the urban core. Thus, research has shown 
that by 1990, industrial facilities in Detroit's urban core were surrounded almost 
entirely by black neighborhoods, while industrial facilities in Detroit's suburbs 
were surrounded almost entirely by white neighborhoods, resulting in relatively 
low levels of environmental racial inequality at that time (Downey 2003, 2005; 
Krieg 1995 found similar patterns in the Boston metropolitan area in 1990). 

Of course, environmental inequality levels in Detroit may have changed 
between 1990 and 2000. For example, demographic changes may have placed a 
greater share of Detroit's black residents in suburban industrial neighborhoods 
than was previously the case, increasing environmental racial inequality levels in 
the region. Conversely, industrial facilities may have left Detroit's urban core in 
numbers large enough to weaken environmental racial inequality in the region, as 
appears to have happened in the period between 1970 and 1990 (Downey 2005). 
Finally, this study may find different levels of environmental inequality in Detroit 
than have previous studies simply because this study uses different hazard 
indicators and a different study area definition than have previous studies. 


Environmental Hazard Data 


Environmental hazard data were obtained from the Environmental Protection 
Agency's 2000 Toxics Release Inventory. The TRI records the number of pounds 
of specified toxic chemicals released into the environment each year by industrial 
facilities that fall into one of seven industrial categories (manufacturing, metal 
mining, coal mining, electric generating facilities that combust coal or oil, 
chemical wholesale distributors, petroleum terminals and bulk storage), employ 
the equivalent of 10 or more full-time workers, and manufacture, process 
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or otherwise use the specified chemicals in specified quantities» In 2000, the 
specified quantities were 25,000 pounds for facilities that manufactured or 
processed TRI chemicals and 10,000 pounds for facilities that otherwise used 
TRI chemicals (Rtknet 2004)*. 

The TRI reports toxic chemical releases to various media, including air, land, 
water and underground injection. It also provides data on off-site waste transfers 
and waste generated on-site. 7ota/ air releases, which are used to calculate the 
relative effects grids discussed above, are the sum of each facility's stack and 
fugitive air emissions. Stack air emissions are emissions that exit the TRI facility 
through a confined air stream such as a pipe or a stack. Fugitive air emissions, 
Such as leaks and evaporation, are air emissions that are not released through a 
confined air stream (Rtknet 2004). 

As noted above, total air releases is used as a proxy for facility size because the 
TRI provides no direct measure of facility size and such measures are unavailable 
from other sources. Dun and Bradstreet, for example, provide square footage 
data for many industrial facilities, but for only 49.5 percent of the facilities 
included in the database used in this article. However, when total air releases is 
correlated with the square footage of TRI facilities for which square footage data 
is available, the correlation equals .71 (p « .0001), providing evidence that total 
air releases is a good proxy for facility size. (None of the other TRI variables are 
as strongly correlated with the square footage variable.) 

Finally, TRI facilities were located on a map using latitude and longitude 
coordinates provided by the EPA. Because the estimated accuracy of these 
coordinates varies from less than 100 meters to 11,000 meters, only those 
facilities with coordinate accuracy estimates /ess than or equal to 150 meters 

were included in the dataset. In other words, only facilities estimated to be within 
150 meters or less of the latitude and longitude coordinates provided by the EPA 
are included in the dataset.* The resulting dataset includes approximately 85.8 
percent of the original observations.® 


Demographic Data 


Tract-level demographic data were obtained from the 2000 U.S. Census. 
Demographic variables were selected based on their inclusion in prior studies 
that have attempted to determine the relative importance of race and income 
in predicting environmental hazard presence (Anderton et al. 1994; Oakes, 
Anderson and Anderton 1996; Pastor, Sadd and Hipp 2001; Sadd et al. 1999). 
Demographic variables include percent Hispanic, percent non-Hispanic black, 
median household income, the percentage of employed tract residents who are 
engaged in manufacturing occupations (percent employed in manufacturing), 
the number of people per square mile (population density), the percentage of 
tract residents living in poverty (percent poverty), the median property value 
of owner occupied housing (median property value), the median age of owner 
occupied housing (median housing unit age), the percentage of residents 25 
years old or older who have completed 12 or more years of schooling but who 
do not have a four-year college degree (percent high school or some college), 
the percentage of housing units that are vacant (percent vacant housing units) 


H 
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and the distance from each tract's average grid cell to the nearest railroad line 
(average railroad distance). 

Median household income, percent living in poverty and median property value 
were selected because several scholars have argued that the reason minorities 
are overrepresented in environmentally hazardous neighborhoods is that housing 
costs are relatively low in such neighborhoods, making them attractive to lower 
income people who, in turn, are disproportionately non-white (Hamilton 1995; 
Mohai and Bryant 1992; Oakes, Anderton and Anderson 1996). 

Percent employed in manufacturing is included because some researchers 
have hypothesized that industrial facilities and industrial workers tend to locate 
near each other (Anderton et al. 1994a). Population density is included because 
manufacturing facilities are often sited in areas with plenty of open space 
(Downey 2005) and because some researchers have argued that local officials 
work to reduce pollution levels in areas with high population densities (Ash and 
Fetter 2004; Sadd et al. 1999). Average railroad distance is included because 
Detroit's industrial neighborhoods tend to be located near the region's railroad 
lines (Downey 2005). Median housing unit age and percent vacant housing units 
are included because | hypothesize that industrial facilities are overrepresented 
in older, somewhat run-down neighborhoods. Finally, the education variable 
is included because neighborhood education levels have been significantly 
associated with environmental risk levels in prior environmental inequality 
research (Ash and Fetter 2004). 


Results 


In order to determine whether environmental inequality existed in the Detroit 
metropolitan area in 2000, Table 1 correlates percent Hispanic, percent non- 
Hispanic black and median household income with the distance decay indicators 
discussed above. Table 1 shows that percent Hispanic is_ insignificantly 


Table 1: Pearson's Correlation Coefficients for the Distance Decay Indicators and 
Demographic Variables 





Percent Hispanic 


2.5 Mile 1.5 Mile 5 Mile 
Curvilinear -.0198 -.0033 .0169 
Inverse Curve -.0214 -.0001 .0189 
Percent Non-Hispanic Black 
2.5 Mile 1.5 Mile 5 Mile 
Curvilinear SS Ke .0436 
Inverse Curve Biman. | 0981" .0403 
Median Household Income 
2.5 Mile 1.5 Mile .5 Mile 
Curvilinear -.2364*** -.1541*** -.0902** 
Inverse Curve -.2147*** -.1467*** -.0894** 


*p<.05 *p<.01 mp DO) 
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associated with all six distance decay indicators, that percent^non-Hispanic 
black is positively and significantly associated with the 1.5 and 2.5 mile distance 
decay indicators, but not with the .5 mile distance decay indicators, and that 
median household income is negatively and significantly associated with all six 
distance decay indicators. In addition, Table 1 shows that none of the significant 
correlation coefficients are very large. Thus, it appears that black environmental 
inequality and income-based environmental inequality both existed in the Detroit 
metropolitan area in 2000, but they were both relatively weak at that time. 

In order to determine whether percent non-Hispanic black is still a statistically 
significant predictor of environmental hazard presence after controlling for other 
important predictors of environmental hazard presence, | regress each of the 1.5 
and 2.5 mile distance decay indicators on percent non-Hispanic black (tables 2-3), 
controlling for percent Hispanic, median household income and the other control 
variables discussed above. (Results are not reported for the .5 mile distance 
decay indicators because neither of them is significantly correlated or associated 
with percent Hispanic or percent non-Hispanic black.) 

Although it is possible that percent Hispanic, percent non-Hispanic black and 
median household income are linearly related to the dependent variable, this is 
not necessarily the case. For example, it may be that after neighborhood incomes 
reach a certain point, further income increases can only buy limited environmental 
improvements because environmental conditions are already at or near perfect. 
Similarly, if researchers are correct in arguing that environmental racial inequality 
exists because minorities have limited residential choice (Downey 2005), then 
any positive association that exists between percent minority and environmental 
hazard presence may level off as the percentage of minorities in a neighborhood 
increases beyond a certain point. In other words, just as increased residential 
choice may no longer buy improved environmental conditions beyond a certain 
point, it may also be the case that decreased residential choice no longer results 
in worse environmental conditions beyond a certain point. 

Thus, most of the regression models presented below include logged 
transformations of percent Hispanic, percent non-Hispanic black and median 
household income. The one exception is found in Table 3, where percent Hispanic 
is employed rather than percent Hispanic logged because the latter variable is 
insignificantly associated with either of the dependent variables in either the full 
or reduced regression models. 

Table 2 regresses each of the 2.5 mile distance decay indicators on the 
transformed race and income variables, controlling for the demographic and 
housing variables discussed above. Models 1-3 present results for the curvilinear 
indicator and models 4-6 for the inverse curvilinear indicator. The independent 
variables are stepped into the equation in three stages: models 1 and 4 only 
include percent non-Hispanic black logged; models 2 and 5 step in percent 
Hispanic logged and median household income logged; and models 3 and 6 
step in the remaining control variables. Percent Hispanic logged and median 
household income logged are stepped into the equation in the same model 
because inserting percent Hispanic logged into the equation separately has 
virtually no effect on the coefficients found in models 1 and 4. Thus, models 2 
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and 5 allow me to determine whether inserting median household income logged 
into the equation weakens any of the positive associations that exist between 
percent non-Hispanic black logged and the 2.5 mile distance decay indicators. 
Finally, because the residuals from ordinary least squares (OLS) estimates were 
spatially correlated with one another, these models, and those found in table 3, 
control for spatial autocorrelation. 

Table 2 shows that percent non-Hispanic black logged is significantly and 
positively associated with the 2.5 mile curvilinear indicator in model 1 and the 2.5 
mile inverse curvilinear indicator in model 4. The associations between percent 
non-Hispanic black logged and the two dependent variables are weakened when 
median household income logged is inserted into the equation in models 2 and 5 
and when the remaining control variables are inserted into the equation in models 
3 and 6. Nevertheless, percent non-Hispanic black logged is still significantly 
associated with the dependent variable in both full models. Thus, as percent 
non-Hispanic black increases, the 2.5 mile curvilinear and inverse curvilinear 
indicators both increase, but at a declining rate. 

Table 3, which presents results for the 1.5 mile distance decay indicators, 
shows that percent non-Hispanic black logged is significantly and positively 
associated with the 1.5 mile curvilinear indicator in model 1 and the 1.5 mile 
inverse curvilinear indicator in model 4. The associations between percent non- 
Hispanic black logged and the two distance decay indicators are weakened when 
median household income logged is inserted into the equation in models 2 and 5 
and when the remaining control variables are inserted into the equation in models 
3 and 6. Nevertheless, percent non-Hispanic black logged is still significantly 
associated with the dependent variable in model 3 and marginally associated 
with the dependent variable in model 6 (p =.0585). Thus, as percent non-Hispanic 
black increases, the 1.5 mile curvilinear and inverse curvilinear indicators both 
increase, but at a declining rate. 

As previously stated, regression results are not reported in table form for the .5 
mile distance decay indicators because neither of these indicators is significantly 
associated or correlated with percent Hispanic or percent non-Hispanic black. 
Nevertheless, it should be noted that in the .5 mile regression analyses, median 
household income logged is significantly and negatively associated with both 
distance decay indicators in both full regression models. 

Finally, unreported regression analyses were also run using two unit-hazard 
coincidence indicators as dependent variables: the total number of TRI facilities 
in each census tract in 2000 and the total pounds of TRI air pollutants emitted in 
each census tract in 2000. In these analyses, median household income logged 
is significantly and negatively associated with both hazard indicators. However, 
neither hazard indicator is significantly associated with percent Hispanic, percent 
non-Hispanic black or either of these variables’ logged terms. 


Discussion 


The results presented in the previous section demonstrate that Detroit's black 
neighborhoods were disproportionately burdened by TRI facility activity in 2000 
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Figure 3. Regression Curves for 1.5 and 2.5 Mile Distance Decay Indicators KR 
(a) 2.5 Mile Distance Decay Indicators 
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and that the associations between percent non-Hispanic black logged and the 1.5 
and 2.5 mile hazard indicators remained statistically or marginally significant even 
after controlling for a set of theoretically relevant neighborhood characteristics. 
Thus, it appears that neighborhood racial status played an important role in 
shaping environmental inequality in the Detroit metropolitan area in 2000. 

One could argue that the relatively weak correlation coefficients found in Table 
1 and the unreported regression results for the .5 mile hazard indicators show 


Figure 3. continued 
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(b) 1.5 Mile Distance Decay Indicators 
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that neighborhood racial status played little or no role in shaping environmental 
inequality in the Detroit metropolitan area. However, as Mohai (1995) points out, 
it is unlikely that environmental hazards’ negative effects are ever confined to 
an area with a radius as small as .5 miles. In addition, the findings in Table 1 do 
not negate the fact that percent non-Hispanic black logged is significantly or 
marginally associated with the 1.5 and 2.5 mile hazard indicators in all four full 


regression models. 
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But perhaps most importantly, when the relationships between percent non- 
Hispanic black and the 1.5 and 2.5 mile hazard indicators are graphed out, as they 
are in Figure 3, neighborhood racial composition appears to be a substantively 
important predictor of a neighborhood's mean relative effect levels. 

Graphs A and B in Figure 3 were created using the results from the full models 
in tables 2 and 3, holding the values of the other statistically significant or 
marginally significant independent variables constant at their respective means 
and the values of the statistically insignificant independent variables constant at 
zero. The values of the dependent variables (divided by 1,000) are listed on the 
y-axes and are different in each graph. 

Graphs A and B demonstrate that even after controlling for a host of other 
factors, predominantly black neighborhoods in the Detroit metropolitan area 
had much higher mean relative effect values than did other neighborhoods. For 
example, graph A shows that as percent non-Hispanic black increases from 0 to 
100, (1)the 2.5 mile curvilinear indicator increases from 74,316 to 131,464, a 57,148 
point increase, and (2) the 2.5 mile inverse curvilinear indicator increases from 
24,481 to 38,111, a 13,630 point increase. Thus, if we consider the curvilinear and 
inverse curvilinear functions to represent the endpoints of a reasonable "distance 
decay function continuum," Detroit metropolitan area neighborhoods that were 
100 percent non-Hispanic black had mean relative effect values between 13,630 
and 57,148 points higher than did Detroit metropolitan area neighborhoods that 
were 0 percent non-Hispanic black.® 

Moreover, graph A shows that although much of this increase occurs in 
neighborhoods that are between 0 and 10 percent non-Hispanic black, much of 
it does not. For example, the inverse curvilinear indicator, which equals 24,481 
at 0 percent non-Hispanic black and 31,296 at 10 percent non-Hispanic black, 
increases to 36,059 at 50 percent non-Hispanic black and 37,799 at 90 percent 
non-Hispanic black. Similarly, the curvilinear indicator, which equals 74,316 at 
0 percent non-Hispanic black and 102,890 at 10 percent non-Hispanic black, 
increases to 122,863 at 50 percent non-Hispanic black and 130,157 at 90 percent 
non-Hispanic black. 

The substantive importance of neighborhood racial composition is further 
confirmed by comparing graphs A and B to graphs C and D. Graphs C and D, 
which are identical to each other, graph out the empirical distribution functions 
of the metropolitan area's black and white populations when metropolitan area 
census tracts are ranked (from 0 to 100) according to the percentage of non- 
Hispanic blacks in each tract. These graphs show, for example, that 81.7 percent 
of metropolitan area whites, but only 3.7 percent of metropolitan area blacks, live 
in census tracts that are 5 percent or less non-Hispanic black. They also show that 
99.9 percent of metropolitan area whites, but only 5.9 percent of metropolitan 
area blacks, live in census tracts that are 10 percent or less non-Hispanic black. 

Thus, the graphs in Figure 3 demonstrate that the independent effect of percent 
non-Hispanic black on the 1.5 and 2.5 mile hazard indicators is quite substantial. 
As a result, we can safely conclude that environmental racial inequality was a 
serious social problem in the Detroit metropolitan area in 2000. 
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An important question still remains however. How exactly does a 13,630 or 
57,148 point increase in a tract's mean relative effect level affect tract residents? 
Are such increases large enough to affect nearby property values, beliefs about 
local health risks, local economic activity, psychological stress or sense of 
community? Are they large enough to produce some of these negative effects 
but not others? 

In order to answer these questions, researchers will need to carefully 
investigate the relationship between residential proximity to environmental 
hazards and these hypothesized negative effects. As noted above, some 
researchers have already begun to do so (Downey and Van Willigen 2005; 
Liu 2001; Mohai 1995). However, what is needed is research that links these 
hypothesized negative effects to specific distance decay indicators and specific 
mean relative effect levels. 

For example, researchers could merge distance decay indicators such as 
those employed in this article with survey and census data to examine the effect 
that environmental hazards have on neighborhood disorder and depression. 
Similarly, researchers could merge distance decay indicators with economic 
data to examine the impact that environmental hazards have on nearby property 
values and local economic activity. 

Researchers could then use their findings to link specific distance decay 
indicators to specific negative effects, allowing them to determine, for each of 
thése negative effects, (1) how much relative effect values have to increase in 
order to negatively affect individuals and neighborhoods or (2) whether there 
is some threshold value at which individuals and neighborhoods are negatively 
affected. These results could then be combined with the kind of results reported 
in this article to determine whether minority and low income neighborhoods 
are disproportionately burdened by environmental hazards' proximity-related 
negative effects. 

Although conducting such analyses is clearly beyond the scope of this 
article, the true substantive significance of the findings reported here, and 
of environmental inequality research in general, cannot be understood until 
researchers are able to link inequitable proximity to specific proximity-related 
negative outcomes. (Bowen 2002 makes the same point about environmental 
hazards' exposure-related risks.) Because distance decay indicators are highly 
flexible and provide researchers with more accurate estimates of environmental 
hazards' non-exposure related risks than can be obtained using other hazard 
proximity indicators, they are ideally suited for establishing such a link. 


Conclusion 


As with any study, caution must be taken in interpreting the findings reported 
here. For example, because the data for this study are drawn from a single 
metropolitan area, the results cannot be generalized to other metropolitan 
areas or to the United States as a whole. In addition, because this study uses 
aggregated census tract data, the cell values in the summed relative effects 
grids had to be aggregated to the census tract level, resulting in a significant 
loss of information and minimizing to some unknown degree the advantages of 


792 * Social Forces Volume 85, Number 2 * December 2006 


using grid cell data (the aggregated data problem). Finally, because'the literature 
provides little guidance on properly estimating distance decay rates, the hazard 
indicators employed in this article were calculated using a range of distance 
decay functions and cutoff distances rather than a set of decay functions and 
cutoff distances tailored to specific negative outcomes. As a result, the hazard 
indicators employed in this article do not provide as precise a set of proximity- 
based risk estimates as they otherwise would. 

These caveats notwithstanding, this study makes several contributions to the 
environmental inequality literature. First, it introduces a distance decay modeling 
technique that more accurately estimates proximity-based environmental risk 
than do other modeling techniques currently found in the literature. It more 
accurately estimates proximity-based risk because it does not assume that an 
environmental hazard's negative effects are confined solely to its host unit of 
analysis, that the strength of these negative effects remains constant as distance 
from the hazard increases, or that these negative effects are distributed evenly 
within analysis units; and unlike plume modeling techniques, this technique can 
be used to estimate non-exposure related risks. The technique is also highly 
flexible, capable of incorporating any distance decay function that researchers 
deem appropriate. 

Moreover, the technique's shortcomings are not as serious as they first appear. 
As noted in the introduction, the aggregated data problem results from a lack 
of address-specific, individual-level, demographic data, not from any problem 
with the technique. The solution, therefore, involves either gathering or obtaining 
appropriate demographic data or developing techniques for apportioning 
aggregated demographic data across grid cells so that it can be matched to the 
grid cells used in the summed relative effects grids. Mennis (2002) provides one 
interesting approach for apportioning aggregated demographic data across grid 
cells and researchers should consider other approaches as well. 

The lack of academic knowledge regarding distance decay rates is likewise 
not a limitation inherent to the technique. Rather, it results from the fact that 
environmental inequality is a relatively new field of research and thus, researchers 
have not had the time to solve all the important methodological issues facing 
them. As the field develops and researchers gain a better understanding of the 
relationship between proximity and risk, the distance decay functions researchers 
use will become more and more precise. 

Second, this article provides environmental inequality researchers with a solid 
rationale for using hazard proximity indicators. Prior to this, many researchers 
have argued that the only reason to use proximity data is that proximity data 
are a reasonable proxy for exposure data, which are generally unavailable to 
researchers (Sadd et al. 1999). This article argues instead that proximity indicators 
are just as important as exposure indicators because environmental hazards have 
proximity-related negative effects that are poorly captured by exposure data. 

Third, this article demonstrates that environmental racial inequality can be a 
serious social problem even when correlations between minority presence and 
environmental hazard presence are relatively weak, suggesting that researchers 
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need to be attentive not only to the statistical significance of their findings but to 
their substantive significance as well. 

Finally, the finding that percent non-Hispanic black logged is significantly 
associated with the 1.5 and 2.5 mile hazard indicators, but not with the .5 mile 
or unit-hazard coincidence indicators, has two important implications. First, 
it suggests that although neighborhoods between approximately .5 and 2.5 
miles of Detroit's TRI facilities are disproportionately black, neighborhoods that 
actually house TRI facilities or are immediately adjacent to TRI facilities are not. 
This is consistent with Anderton et al.'s (1994) findings regarding hazardous 
waste sites, suggesting that this may be an important residential pattern for 
researchers to explain. 

Second, this finding suggests that previous studies that have used unit- 
hazard coincidence indicators may have underestimated the significance 
of environmental racial inequality. This is important because environmental 
inequality researchers have used unit-hazard coincidence indicators more often 
than they have used any other type of environmental hazard indicator (Mohai 
and Saha 2006). Thus, this study suggests that a large body of environmental 
inequality research may underestimate the significance of race in shaping 
environmentally inequitable outcomes. 


Notes 


1. Theunit-hazard coincidence method assumes that an environmental hazard's 
negative effects remain constant until you reach the borders of its host 
analysis unit, at which point these effects abruptly drop to zero. 


2. See Chakraborty and Armstrong (2001) for an interesting exception. 


3. Itshould be kept in mind that these re/ative effects averages do not represent 
average air pollutant concentration levels, the total pounds of air pollutants 
emitted in the average analysis unit grid cell, or some absolute measure of 
hazard impact on each analysis unit. Instead they are estimates of the re/ative, 
non-exposure-related impact of all study area facilities on each study area 
analysis unit. Thus a score of 1,000 indicates twice the estimated impact of 
a score of 500, but has no absolute meaning of its own (RSEI risk scores are 
also interpreted relative to one another and not in absolute terms). 


4. See Sadd et al. (1999) and Downey (2006) for a detailed discussion of the 
advantages and limitations of TRI data. 


5. Details on the process that the EPA uses to determine TRI facility latitude and 
longitude coordinates and to estimate the accuracy of these coordinates can 
be found at the following website: http://www.epa.gov/opptintr/rsei/docs/ 
tech app d.pdf. 


6. Restricting the dataset to facilities with coordinate accuracy estimates of 
less than 150 meters would have resulted in a dataset with only 1.5 percent 
of the original observations. 
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7. loriginally included two education variables in the analysis, tfe percentage 
of tract residents 25 years old and older with a college degree or higher and 
the percentage of tract residents 25 years old and older with less than a 
high school degree. Because these variables were both positively associated 
with all the dependent variables, | re-ran the regression models using the 
education variable that is defined in the main text. This variable is consistently 
and negatively associated with the dependent variables. 


8. Comparing graphs A and B, we see that the predicted values of the dependent 
variables vary more when we hold these variables' cutoff distance constant 
and vary their functional form than when we hold their functional form 
constant and vary their cutoff distance. In other words, in these models it 
appears that functional form has a stronger influence on regression results 
than does cutoff distance. This is a striking and somewhat surprising finding, 
especially when we consider that varying the cutoff distance from .5 to 1.5 
miles has a substantively significant effect on correlation and regression 
results. Explaining why functional form matters more than cutoff distance in 
the 1.5 and 2.5 mile regression models is beyond the scope of this article, 
but it clearly merits further investigation. 
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I (Don't) Hate School: Revisiting Oppositional Culture Theory 
of Blacks' Resistance to Schooling 


Angel L. Harris, University of Texas at Austin 


This study provides an extensive test of Ogbu’s oppositional culture 
theory that accounts for student maturation over time. Using data from 
the Maryland Adolescence Development In Context Study (MADICS), 
I test the proposition that blacks resist school more than whites, and 
that this difference grows with age. Analyses were conducted across 24 
outcomes and revealed two major findings with implications for the 
study of race and school achievement. First, five major tenets of the 
theory were not supported, which challenges the existence of a pervasive 
oppositional culture among black Americans. Second, maturation 
after grade 7 had minimal impact on white-black differences on the 
outcomes. Findings are discussed in terms of their implications for 
sociological theory and educational policy. 


Introduction 


By age 17 the average black student is four years behind the average white 
student; black 12th graders score lower than white 8th graders in reading, 
math, U.S. history and geography (Thernstrom and Thernstrom 2003).' Although 
there was a convergence in black-white test scores from the early 1970s to 
approximately 1990 (Grissmer, Flanagan and Williamson 1998; Hedges and 
Nowell 1999) the convergence has been slow. For example, using data from the 
National Assessment of Educational Progress (NAEP) — created by Congress in 
1969 to regularly test nationally representative samples of students in grades 4, 
8 and 12 (or sometimes ages 9, 13, 17) — Hedges and Nowell (1999) conclude the 
rate of change over the past 30 years suggests gap convergence would take 30 
years in reading and about 75 years in math. They also conclude gap convergence 
on non-NAEP surveys would take 50 years in reading and more than a century in 
math. However, more recent analysis of achievement trends in the 1990s indicate 
the modest convergence in reading and math gaps made in the 1970s and 1980s 
has either ceased (Smith 2000) or reversed (Grissmer et al. 1998). 

Many theories and studies have attempted to explain the gap's persistence. 
Prominent explanations include blacks' inferior genetic makeup (Jensen 1969; 
Herrnstein and Murray 1994) — no longer seriously considered given lack of 
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empirical evidence (Nisbett 1998) - their greater exposure to non-intact family 
structures (Moynihan 1965), and various forms of economic disadvantage (Massey 
and Denton 1993; Wilson 1987, 1996). Even standardized testing has been 
critiqued for rigidity and lack of accounting for the overall academic experience 
(Kozol 2000; Meier 2000), or for having racial biases making minorities destined 
for lower scores (Sacks 1999). However, differences in background resources 
generally account for no more than a third of the achievement gap, and the gap 
exists on non-test measures such as grade point averages, even among middle- 
class blacks in suburbs (Ogbu 2003:35). The literature makes the gap appear 
intractable and creates fertile ground for theories suggesting its origin is rooted 
in the culture of minorities and their own refusal to succeed. 

Itis important to distinguish between the terms “black” and “African American.” 
To some, “African Americans” are a subgroup within a larger black community that 
includes those who may be first-generation immigrants or who do not identify as 
African American. A theory garnering much attention within the social sciences 
is the oppositional culture explanation (or resistance model) (Ogbu 1978), which 
contends the achievement gap occurs because blacks perceive lower returns to 
education and fewer occupational opportunities than whites. Ogbu (1978) argues 
motivation for maximizing school achievement results from beliefs that more 
education improves chances for attaining better jobs and higher wages and social 
status. Because African Americans are involuntary minorities — groups historically 
enslaved, colonized or conquered who interpret the incorporation of their group 
into the United States as forced by white Americans (i.e., dominant group) - they 
experience or perceive barriers to success with regard to future employment 
and earnings due to racial discrimination and structural inequalities.?? These 
experiences/beliefs lead them to become disillusioned about the future and 
doubt the value of schooling. Consequently, they develop a culture oppositional 
to the dominant group and resist educational goals, often resulting in early school 
withdrawal altogether. Thus, an assumption of the theory is that ameliorating 
the achievement gap will require a change in their culture. While all subjects 
in my research self-identified as African American, to remain consistent with 
terminology in previous research, | employ the term "black." 


Popularity of the Resistance Model 


In the past decade, the theory has moved from the academy into the mainstream 
press. In the Atlanta Journal-Constitution, for example, Thomas Sowell (1994) 
wrote, "the most painful of the new developments has been the growth of an 
attitude in ghetto schools across the country that trying to learn is “acting white.'” 
Similarly, Brent Staples (1996) of The New York Times wrote, "some education 
experts demean the gifted programs as elitist and unfair. Even rival students 
get into the act, harassing the achievers for ‘acting white.’” Bob Herbert (1995) 
of The New York Times provides a more scathing description: “Some African 
Americans, unable to extricate themselves from the quicksand of self-defeat, 
have adopted the incredibly stupid tactic of harassing fellow blacks who have 
the temerity to take their studies seriously. According to the poisonous logic 
of the harassers, any attempt at acquiring knowledge is a form of ‘acting 
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white." These excerpts illustrate the pervasiveness in the belief blacks have 
an antagonistic relationship toward schooling. Also, in addition to strong sales 
in books propagating this belief (Steele 1990; McWhorter 2000), the theory 
that blacks resist schooling has been embraced by educators and the general 
public and is practically regarded as "common sense." Although some studies 
fail to find support for this theory (Ainsworth-Darnell and Downey 1998; Cook 
and Ludwig 1997, 1998), Farkas, Lleras and Maczuga (2002:154) conclude "it 
is premature to reject oppositional culture as one of the possible mechanisms 
influencing lower school performance among ethnic minorit[ies]. . ." The debate 
regarding the theory's merit is still ongoing. 


Limitation of Previous Research 


Despite the resistance model's popularity, some aspects of the theory are largely 
untested. An often-overlooked component of the theory is that involuntary 
minorities deve/op perceptions that their educational efforts are undervalued 
within society. As they transition into adulthood, they gain awareness of other 
minorities’ misfortunes with the opportunity structure and system of social 
mobility. In his most recent analysis of poor achieving blacks in the affluent 
suburb of Shaker Heights, Ohio, Ogbu (2003:20) describes their academic 
efforts as decreasing markedly from elementary to high school. He (2003: 154) 
notes that although lack of effort "might not be serious in the early grades, it 
became more serious as students got older and began to think that they, too, 
would have difficulty in the opportunity structure . . . just because they were 
black." He suggests they do not try hard despite believing in American ideals 
(e.g., success through education and hard work) because they are "mistrustful, 
ambivalent, and skeptical, especially as they got older and moved up in their 
school career." (Ogbu 2003:41) 

Previous studies provide limited assessments of the theory's maturation 
component. Quantitative studies examining school resistance across multiple 
age groups (e.g., Farkas, Lleras, and Maczuga 2002; Downey and Ainsworth- 
Darnell 2002) use cohorts comprised of different children; these studies 
consist of cross-sectional analyses of different cohorts and focus on only one 
component of the resistance model: the "acting white" hypothesis.^ Tyson (2002, 
2003) qualitatively examines the development of oppositional schooling attitudes 
among blacks in elementary school. In general, she finds black children begin 
school very much engaged and achievement-oriented and that the rejection of 
school norms does not characterize the larger black culture. However, she finds 
the schooling experience plays a central role in the development of negative 
schooling attitudes. In addition to poor achievement, school officials place 
strong emphasis on transforming many aspects of black children's culture, which 
inadvertently communicates inadequacy associated with "blackness." She notes 
children's negative statements reflect desires to avoid further failure in school, 
implying schooling attitudes are part of a developmental rather than cultural 
process. Unfortunately, her observations are limited to one school year and 
assessing the relative change in school resistance between blacks and whites 
across adolescence was not her purpose.” 
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Another limitation of previous quantitative studies is the lack of breadth in 
resistance measures among datasets used to assess the theory. The resistance 
model is a rich, multi-dimensional theory. Although Ainsworth-Darnell and 
Downey (1998) and Cook and Ludwig (1997) systematically compared African 
Americans and whites across resistance measures, they were unable to provide 
a more exhaustive test due to the relatively few indicators of school resistance 
across waves within the National Education Longitudinal Study (NELS). This data 
limitation forced them to employ a cross-sectional design. In contrast, recent 
qualitative studies (Akom 2003; Carter 2005; Horvat and Lewis 2003; O'Connor 
1997; Tyson 2002; Tyson, Darity and Castellino 2005) — which find little support 
for the theory — have been able to capture the theory's richness. However, they 
do not assess school resistance across adolescence. As such, no study provides 
a systematic assessment of the theory that accounts for many of its nuances 
within the same sample. 

This study contributes to the literature by providing a test of the resistance 
model that might bridge the aforementioned qualitative studies, which 
capture the theory's multi-dimensionality, and quantitative studies, which 
make systematic racial comparisons of only certain components of the theory. 
| use panel data that contains a rich collection of measures allowing for the 
examination of whether blacks become more resistant to schooling over time. 
Because the goal of this research is to determine if an oppositional culture exists 
among involuntary minorities relative to whites, immigrant blacks and Latino/as 
are excluded from this study. 


Methods 


Data 


Data for this study come from the Maryland Adolescence Development In 
Context Study (MADICS), which contains a unique collection of measures on 
1480 adolescents. The MADICS (51 percent male and 49 percent female) sample 
was drawn from a county on the Eastern seaboard of the United States and 
consists of 1,407 black and white families (66 and 34 percent, respectively). Data 
were collected from the time the target youths entered middle school in the 
fall of 1991 until they were three years removed from high school. The sample 
was selected from approximately 5,000 adolescents in the county that entered 
middle school during 1991 using a stratified sampling procedure designed to 
get proportional representations of families from each of the county's 23 middle 
schools. As such, students' socioeconomic (SES) backgrounds are varied as the 
sample includes families from neighborhoods in low-income urban areas, middle 
class suburban areas, and rural farm-based areas. While the mean family income 
in the sample is normally distributed around $45,000-$49,000 (range $5,000- 
$75,000), white families report significantly higher incomes ($50,000-$54,999) 
than black families ($40,000-$44,999). 

The current study uses white and black students from the first three waves 
of the MADICS, which were collected when they were in grades 7 (n = 1407), 8 
(n = 1004), and 11 (n = 954). In supplemental analyses not shown, blacks were 
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not more likely to attrite than whites; the proportion of whites and blacks within 
the sample remains the same for each wave. Thus, any white-black differences 
observed over time in this study could not be an artifact of differential rates of 
sample attrition. It is also important to note most attrition occurs between grades 
7 and 8; only 3 percent of the sample is lost between grades 8 and 11. This 
suggests it is unlikely sample attrition results from students dropping out of high 
school (see Appendix for an assessment of attrition bias).® 

Although MADICS is primarily used by psychologists for understanding 
psychological determinants of behavior and developmental trajectories during 
adolescence, it is well suited to the goals of this study. It has the necessary 
quality and greater breadth than previous datasets used to assess the resistance 
model. Its richness and longitudinal design provide a good opportunity to 
determine whether development from middle school to high school leads to 
greater opposition toward schooling among blacks than whites using a wide 
range of measures. 

One unfortunate limitation is that MADICS was not designed to draw inferences 
to the national population of students. However, | am unaware of theoretical 
models positing that the underlying causal mechanisms of school resistance 
vary by social class or geographic area (e.g., east/west, urban/suburban). The 
resistance model attributes an oppositional cultural frame of reference to the 
wider “black community.” (For further elaboration, see Tyson 2002:1166-67.) 
However, findings from this study indicate results based on MADICS yield 
similar conclusions to studies that use national data (i.e., Ainsworth-Darnell and 
Downey 1998; Cook and Ludwig 1998). This is not surprising given there is no 
prior theoretical or empirical basis for expecting the processes and parameters 
underlying school resistance in the current study to differ from those observed 
in nationally representative samples. The strength of MADICS is that it allows for 
the resistance model to be tested with greater depth, which should compliment 
previous studies that use nationally representative samples. 


Analytic Plan 


This study consists of a series of racial comparisons on numerous factors. 
Four models are estimated for 24 outcomes using pooled cross-sections and 
developmental effects are tested by interacting grade level and race. Whereas 
the first model regresses the outcome on race to determine whether racial 
differences exist, the second model controls for grade level. Because Ogbu 
posits being an involuntary minority leads to oppositional culture independent 
of social class, the third model assesses racial differences net of grade level and 
socioeconomic (SES) factors associated with race and the outcomes such as 
family income, parental education and family structure. To determine whether 
blacks fare worse on the outcomes as they move closer to adulthood relative to 
whites, the final model includes interaction terms between grade level and race. 
Because an individual can enter the sample three times, robust standard errors 
are used to account for correlation across individuals. 

Table 1 contains detailed information on all outcomes. Constructs comprised of 
multiple items are weighted sums; responses were added and the sums divided 
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by the number of items. It is important to note not all resistance measures were 
collected in each wave. Therefore, the number of observations varies across 
outcomes. Item non-response rate is .05 or less for outcomes measured in all 
waves, .06 or less for outcomes measured in only two waves (i.e., grades 7 or 
8 and 11), and less than .02 for outcomes measured only in grade 7. Although 
the within grade racial differences in means in Table 1 appear similar, models 
discussed above allow for systematic racial comparisons to be made after 
adjusting for SES. 


The Oppositional Culture Model: Five Hypotheses and Results 


Hypothesis 1: B/ack children perceive fewer returns to 
education and more limited opportunities for upward social 
mobility than white children. 


Ogbu (1978) notes the expected gains from education held by children and their 
parents or community serve as an important determinant of school performance. 
Furthermore, Ogbu (2003:154) notes older black students become discouraged 
from striving academically in part because of their greater perceptions of limited 
chances in the opportunity structure. Hypothesis 1 is tested by determining 
whether black children perceive fewer returns to education, have lower 
educational aspirations, and lower educational expectations than white children. 
Also, the extent to which blacks perceive more limited opportunities for upward 
social mobility than whites is examined by determining whether there is greater 
discrepancy between their aspired and expected educational attainment. 
Presumably, how far students believe they will go in school (educational 
expectations) should be similar to the educational level they wish to attain (i.e., 
educational aspiration). However, because blacks are posited to perceive more 
limited opportunities with regard to upward mobility, their expected educational 
attainment should increasingly diverge from their educational aspirations as they 
move closer to adulthood. 

Table 2 shows no support for the first part of Hypothesis 1. Specifically, while 
black students report greater returns to education and have higher educational 
aspirations than whites (6 = .091 and .123 for each outcome in Model 1, 
respectively), the bottom panel of Table 2 shows a non-significant difference 
between whites and blacks in educational expectations. Furthermore, when 
background factors are controlled in Model 3, blacks’ advantage on perceived 
returns to education increases by 27 percent (to 6 = .116), their advantage 
on educational aspirations nearly doubles (6 = .239), and they have greater 
educational expectations than whites (b — .123). In addition, while Model 3 shows 
perceived returns to education decreases for both groups over time, Model 4 for 
these outcomes indicates group differences remain constant over time. 

Studies using NELS also find that blacks perceive greater returns to education 
than whites and that no differences exist in educational expectations between 
these groups (Ainsworth-Darnell and Downey 1998; Voelkl 1993; Cook and 
Ludwig 1997). Blacks' perceptions are consistent with analyses that show each 
step along the educational ladder yields an increase in earnings (Thernstrom 
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and Thernstrom 2003). In fact, blacks actually earn more than whites when 
achievement is held constant (Farkas and Vicknair 1996). 

Analysis for the final outcome in Table 2, however, shows blacks perceive 
greater limited educational opportunities than whites (b = .193 in Model 1), 41 
percent of which is due to differences in SES (b declines to .114 in Model 3). The 
discrepancy between the educational attainment students would like to reach 
and the educational attainment they believe they will reach is greater for blacks, 
even after controlling for SES; something other than SES contributes to their 
greater belief that they will not reach their educational goals. The racial difference 
in this belief remains constant over time. 


Hypothesis 2: Black children have less favorable affect 
toward school than whites. 


It is unclear whether black students differ from whites on attitudes toward 
education. Although studies show they have more favorable schooling attitudes 
than whites (Ainsworth-Darnell and Downey 1998; Coleman et al. 1966), 
Mickelson (1990) notes attitudes toward education are multidimensional. She 
notes students have abstract attitudes (e.g., education is important) that reflect 
dominant ideology regarding the ideal role of education and concrete attitudes 
rooted in life experiences. She finds that concrete attitudes predict achievement 
behavior and that blacks hold less positive concrete attitudes than whites, 
which she attributes to the material realities they experience that "challenge the 
rhetoric of the American Dream." (p. 59) However, these studies assess schooling 
attitudes by asking youths to estimate future educational returns. 

To assess Hypothesis 2, the previous analyses are repeated for affect toward 
school — students' liking of school and schooling activities. According to Ogbu 
(1978), black children should have lower affect toward schooling than whites, 
and this difference should increase as they get older. Racial differences are also 
assessed on go to school because enjoy classes and go to school because have 
to. The aim here is to determine if mandatory school attendance becomes a 
greater reason for blacks' school attendance rather than enjoyment of classes 
relative to whites. These measures further tap into students' affect toward school 
without asking them to estimate the future payoff of schooling. 

Table 3 shows blacks have greater affect toward school (b — .104 in Model 
1 and .118 in Model 3) and rate enjoyment of classes as more important for 
their school attendance (b — .308 in Model 1 and .320 in Model 3) than whites. 
While blacks maintain greater affect toward school than whites over time, the 
difference in the importance of class enjoyment for school attendance narrows by 
grade 11, although blacks still maintain a minimal advantage (b — -.278 for black 
x G11 in Model 4, but only marginally significant). There is no racial difference 
on the importance students attribute to mandatory school attendance for their 
attending school. 

The current findings show both groups experience similar rates of decline 
over time in school affect and feeling that class enjoyment is important for their 
school attendance, and an increase in feeling they attend school because it is 


804 ° Social Forces Volume 85, Number 2 ° December 2006 


`o] ƏABU 








(Loz) (voc) (Qro) (voc) (zz) (goz)  uoseerjueuodur fuen = / | esneoaq jooyos 0] 06 | :¡ooyos Buipuaye | oj e^eH esne»eg 
= ER 00'S 06'y 19v UoSe91 juejioduil 3ON - | Joj uosea1 Buiwojoj eui jo eoueyiodu| |J00U3S 0] 09 
"sessejo Áui Áoluə Sasse|o 
(021) (c91) (cr) (ozi) (pg8)) (241)  uoseajuejodu Ai = / | esneoaq jooyos 0] 06 | jooyos Buipuaye  Aofuy esne»eg 
= Orr gZr gr G6'€ Cor bp UOSe9J jueyoduil JON - |. J0J uoseoJ Buiwoj[oj euj jo aoueyodu] ¡O0Y9S 0] 0€) 
'noÁ oj jueuoduii fe eJe sepeuc) (p) 
0] e [oouos eyij noÁ 'jejeuo6 uj (2) 
(9) (vL) (c9) (e9) eejby Duong - G awy zo asem e sı yroməwoy (q) ,'5uuoq [0049S 
G9 Lge Gre — — GLE 69'€ ooJDesiq Ajbuoyjs = | eJe sjoeíqns 1o sessejo Áu jo }Sow (e) pJeMo | 109 Y 
(c siseyjodA}) joouos piewo | 32e] v 
(07V) (v6) (oc) (99) (er) (cp 'suonejoedxe jeuoneonpe pue suogeudse ^ senunyoddo 
cM UD 9 PE; v9 06 69 Jeuoneonpe ueewgeq Áouedəsosıig — `'jnp3 pəƏluu 
¿Jooyos ul 06 
(e91) (891) (pol) (ec) Uzi) (een 'aW'qud"dfrz6 Im Aene nof WOU) DO op Je} MOH “op suo ejoadx3 
— £69 989 669 elZ 8/9 689 sso|J0 epeJb ug - | 0j jUeM jsou! 9^ jeuM op sÁeWe 1,ueo ƏM Jeuoneonp3 
vi) (eu (Qr) (ec (651) (Gr GiW'"qud"df-6 &éoouos ur o oj exi NOÁ pinom ej moy suoyeJdsy 
EA VS LES TITLE 79:29 17672 sse|Jo epejb ujg - | — 'pejuew noÁ jeu Ápoexe op pjnoo nof Ji Jeuoneonp3 
"pooyJoqubiau 
Áu U! Spi} ay) 10, aj] ul peaye 
196 0) Áem 1s9q eu si uoneonpe poof e 
Bumao (9) "eui exi| spbj 10) jueyoduii os uoneonp3 
(erys Qr) ser)-^() ps9) (29) eelby ADuong -G joust Buroouos (q) *ejij ui sseoons e eq 0} sunay 
O Or Clr] Olt aY eejDesig Á[Duong = | 0} JUEM | J! jooyos ul [jam op o) aney | (8) DƏAI9919d 
(y siseyjodÁAH) seniunuoddQ eunjn j jo suondeojed 
yoelg ƏJUA x»elg HUM JJ SIUM 
eud|y | | epeuc) 9 epeuc) 1 9peJ9 IJAN uonduoseg eueN 9|qeueA 





S9]QPLIPA 103 SUONALIISI(] PUR SUONPIAI( PIPPUEIS “SUBIJA P9ISNÍPPU() :T AQEL 


l Culture Theory * 805 


lona. 


. 


Opposit 


ting 


isi 


Rev: 


—— S C 


( (OU 


01) 
= i072 


ci) 
906 ger 
(6S |) 
— Cb 


(ezz) 
z 


(el) 
09° 287 


er (ve V) (16) 


ev 


(901) 
907 


GN 
0€€ 
91) 
9€'y 


(Uz 
gcc 


(91) 
692 


S9 


(001) 
191 


61) 
09'| 


(4) 
LEZ 


SC 


(cg) 
c8 


81) 
981 


GN 
LL 


(£01) 
897 
(cc V) 
ZER 


(co'z) 
evi 


HN 
HE 


(96) 

v9 
(911) 
4 7 


(c6 1) 
br! 


(c4) 
10€ 


yjuou e souo uey} sse7 = | 


"(Aenyoadsa, 
*| 1 pue g sapelb ul payse) jooyos 
yBbly pue ¡ooyas ajppitu WO! papuadsns 


alow 10 Q} O} uəəq sey y}noÁ səw jo Jaquiny 


USHO = y ¿SOSSB|O NI JO Joouos 
0] 'JƏAƏN = | paddiys noA aaey ‘Jae 4I ‘UƏYO MOH 


 Ə|qno1 ul 


Jooy9s dhis 


Áouenbuijeg pue Áoueni | 


¿siededsmau pea 

(9) ¿eunseajd Jo} seulzeBew Jo syoog pear 

(q) ¿AL uo SMoYs jenjno 10 Jeuoreonpe 

Jų; ugy} aow ‘Ajeg = 9 “smau Bulyojem (e) noÁ pip uayo 
J9A9N = | MOY MNOQE ‘saam omy, ]Se] ay) ¡oqe xuly | 


Jų; uey} aow ‘Ajeg = 9 ¿yoməwoy Op noÁ pip uəyo 
JƏAƏN = |, MOU JNoqe “SIM OM) SE] ay) 3noqe xulu | 
¿(uawularob apres 10 epp ep Une) 

App Age Mgnern 9 ` song ous ag Aug uo aun pueds 


SOLANO 
SuluJes7] 


JIOMƏWOH 


sqnid 


noÁ pip ugyo Mou JeeK 1se| eu; 8uung senno y joouogs 


seniAnoy omuepeov/Buijoouog uo aui | 


¿diay Jo, spualy anoÁ (a) ¿day 

JO (s)juased nof (p) ¿djay 104 sjuapn]s 

Jeujo (9) ¿djay Jo, 'Jo]n] e ay] jooyos ay, 

ul s]inpe Jayjo (q) ¿djay Joy siayoea] moÁ 

SÁeM|y 1S0UJ|V - G (e) 0] 06 noÁ op ueyo Mou ^uiowoouos 
J9A9N JSOWY = | uo ə|qno1) Bui^eu ay noÁ uaym 


dleH 19898 


(g siseuyodÁH) Ájpeoruiepeoy e^oJdui| oj uojj3 
——————'-—————————  ——— d i i i i: ii 


806 » Social Forces Volume 85, Number 2 * December 2006 


— 5 5 


(p) (ov) (v) (vv) 
— OL 6 SZ GL’ — 


(9c) (oz) (Qr) (et) (e91) 
— 10S mp 96v Gët Ob 


(or) 
7 -= z — 96 | 


(cv V) 
E = em s ess 
(ze) 
= = = — 25 


(0601) (pet) (Leb) (vel)  (88'1) 
—  |4Vv EC Op Cp Sr 


(c) (ev) (ec) (ec) Ge» 
jg ewa seh e El E 


(cc |) 
Ivy 
(Lot) 
66 | 


(6v |) 
60'S 


(S6 |) 
9€ y 


(sc |) 
LS 


spueu jo Ajue|d e^eH - |, 


spueuj Áue eAeu JOU 'Seu eu/s spueuj ,SpueluJ 

op Jo ajo pey | ySIM = 0 jo Jequinu euj jo jueuissesse sujnoA jo jequinN 
Jejndod 9Jou yan = / ¿noÁ ase Jejndod 

Je|ndod ssə| uonl = | moy 'eBe JnoÁ spi 1eujo oj pejeduio? Ñue|jndoq 


Shem }sowly = ç 
JƏAƏN 1SOUI|V = | 


spy 


¿SDI Jeu uj Buoje sp! uJ Buo|y 
Bume6 ajqnoyy aney noÁ op uayo MOH Buməo əlqnoil 


Jayjo Uey) Jayeq yonu = 4 .SJ99d 01 

sply "spualy Bueu ul op noÁ op anejay spueu 

au UEL asjom yonu = | ¡am moy 'aBe mol sp Jauyjo 0] pesedwog — exe O Aug 
¿HUM 

ON = 0 'S9A— = | Bunoe, jo ued sepe16 poob Buyəf s| SUM PY 


(p siseujodAH) seunseey Ajuejndog pue ejiuM Bunoy sy eoueuuojed pooo 


sBulu) Jeuyo ueu eui 

0j jueuoduul eJoul uonjN - / 
sBulu) Jong uey} əw 

o) )ueluoduii ssə| u9nl, = | 


¿no o) (suods) ae jueuoduul suods 
moy “op noÁ sBuly, sayjo 0] pajedwog ^ jo eoueyoduJ| 


sBuly) Jeuyo ueu eui 

0] jueyioduui eJoul uonjy - / 
suly} Jeujo ueuj eui 

0] jueyioduii ssa| uon = | 


¿sjoelqns ¡ooyos 

1900 (q) ¿yen (e) ¿noÁ o) saniAnoe 
Buiwo||o) au Jo uoee ee jueuoduui ONWIPeoy 
moy “op noÁ sBuly, sayjo 0] pajedwog jo adueylodu] 
Soiulepeoy-uow 0) eAnejos soruepeoy jo eouejioduj| 


saat 


gd 


poanu13uo» | 9[qeL 


Oppositional Culture Theory * 807 


ting 


Revisi 


¿pea]sul asja Bulyjauos 

Op 0] juem spual JIeu JI YIOMEWOY 1194} 
op 0} jou exo Sj! yulyy (8) ¿Jooyos ul sy 
Sunye6 uey) jueyoduu! aso si SpUaL} YIM 
Je|ndod Buləq yulu! (p) ¿Ioouos o) Əuloo 
03 Bui^eu Əyl| 1,uop (9) ¿s1sə) joou5s uo 


Jeayo (q) ¿aun jo ejse e si sepeJ6 poof  Bulloouos 
(99) (89) (02) 69° weyl Joly =SG je 0} prey Burjsom yuly, (e) uw euin inoÁ uo senjeA 
99° eG IEZ cZ 8772 > — Ueu|Jjo9SuoN-zj| JO }SOW pueds noÁ spuall] BU} JO Auew MOH 1994 Sauna 


¿sBulu) snojəB5uep pue Áysu pip 

(J) Zem joouos e Buryeeuq jueeui ji JI ue^e 
Op 0j pe1ueA noÁ jeuM pip (9) siayoea) 0} 
xoeq pexje1 (p) zjoouos uiu ur ejuw Áqeq 
e pejeujej Jo Áqeq e peu (9) ;s8nup jebal 
Jayjo 10 euenfueui 3od pesn (q) /1onbi| 


(62) (91) (c9) (og) 1009 Map = G J0 ‘SUM '198Q ueJp (e) noÁ ji joooun SN 
ig (Que ¿AZ Oe E e = joooun Ala/\ = | JO [009 Sem}! YUIL] spueujjnoÁ pino ` 1e9d eAneBeN 
¿sBnup asn 


0} NOA uo aunssed znd (9) ¿09$ uey} asou 
yuom Buryyaulos uajo]s aaey (q) ¿s6ueb 
(9) (er) (69) (es) (pz) (ci uleu| jo Iv - G jeejs 40 unoÁ ui ese (e) uy eun anoÁ sIO/Neyag 
69' al BEA LE en, RE 80+ 6071 WAY] JO SUON = | jojsoui pueds noÁ spueuj əy} zo Jup MOH} Jeeg ago 


(c siseuyodÁH) seunseey aunyno 1994 
A 7772-7717 ERNREEE CEN UU eS Ee O 


808 ° Social Forces Volume 85, Number 2 + December 2006 


(ordures 1230} 29} 7O %7'98) T067 = SƏSLI PILA p 


'(s[dures ope18 q/ [e303 21 Jo 1972313 10 %86) 86ET 01 08€ T uro1j ogue1 sose pr[eA. 's1ope18 q1/ Suoure srsA[eue 10j ÁpUO posn oq [[IM samnseour ƏSL ə» 


*(sdures ope18 Y)1] Pue y)8 [830] 9Y} JO 1212213 10 %6) OE6T 0} TEST WO1} ƏSULI 


Sosed pl[eA “TT 01 8 apes woy sodueqo ssasse yey} sasdyeue r—eurpniyi8uo[ ur pəsn əq Ayuo [[IM pue apes 7 Y) UL pə12ə|[05 Jou 91IM SIIMSPIVI JSƏJY L q 


`(əldures apesd UIT T pue Y3 T230} IY} JO 1972213 10 %6) 9££7 0) 9777 WOY ogue1 soseo 


PILA "TT 01 Z 2pei8 u101j soSueup ssasse yey} sasdyeur [eurpnyrguo[ ur posn oq Áuo Tt pue apes qyu8ro ou] ut paj29[[02 oU 212M soJnseaui osa, x 


'(spdures 


[£303 21] JO 19]e218 IO 9656) LSEE OL PSTE Wor ISUBI SISPI pI[£A 'SuonerAop paepuejs 21€ sasoqjuo1ed ur si9qumNs "pepo» 2$12A2J JTE SWI. x ‘SOJON 


———— M — —— e — M— ——— M —— — HM —ə — 


(c1) 
— 2927 


(81) 
(E oce 


uonnqujsip 
vd9 9ujjo ejuenb uogoq eujulsyde ^ eoueuuojleg 
— = — -— — sof = | ‘OU=0 UJIA sjuepnis J0j e|qeueA snouiojoupiq 100d 
"'uonnqujsip 
vd9 eujjoe[ienb doy egy uisydo ^ eoueuuojed 
— — -— — — SsƏÁ = | 'ou = 0 UJIM Sjuepnis J0j ejqeueA snouiojoupIq poog 


'peAie291 juepnjs 

səpe! 1ejjo| Jo Jequunu [630] əu) Aq papinip 

(0X 8,3) + (1 X8/0) + (2 x 5,9) * (exs.) 

+ (y X Sy) :Jauueu Bulmo]oj ay] Ul peo 
(673. e Ue). sl (ce) uodeijse|uo sepeJ6 peuode.-jes Sjuepnjs ^ ,9oueuuopeg 
vocc A IA IES 0'y0)0 wo pejejnojeo (ydo) abesene julod apero) l00U9S 
seJnsee|y eoueuiojieg [euorjeonp3 


¿MJOM-[OOY9S UO 
puey yom oj jueuodui si 3 yutu) (p) ¿noÁ 
UjIM sBuiuy jenjoej[ejur Jeujo Jo 3yoMoou»s 


ssnosip oi eyi| (2) ¿ebajjo9 0} 06 oj ue|d Buijoouos 
(09) e OL EL) X02) WAY] JO ly = (q) ¿¡ooyos ur jjam op (e) yum awy JnoÁ uo sanje/ 
UPC 76:6 SE LE way] JO SUN =| Jo ]sou pueds noÁ spualy ay) jo Áueui MoH 1994 IMHSOH 


gd 


panu13uo) | 31q8L 


Revisiting Oppositional Culture Theory ° 809 


required. Whites' greater school achievement leads researchers to overlook this 
phenomenon. Ogbu (2003) emphasizes that affect toward school declines over 
time for blacks. However, he builds a theory of group differences by observing 
one group. The current findings suggest white-black achievement differences 
results from factors other than affective schooling attitudes. 


Hypothesis 3: Black children exhibit greater resistance to 
school than whites. 


Ogbu(1978) contends that because blacks do not share the premise that education 
leads to success, they resist school. He notes “[Is] it logical to expect that blacks 
and whites would exert the same energy and perform alike in school when the 
caste system, through the job ceiling, consistently underutilizes black training and 
ability and underrewards blacks for their education?” (Ogbu 1978:195) According 
to Ogbu, blacks’ repudiation of schooling is marked by truancy, delinquency and 
lack of serious effort. 

Ogbu (1991:444) notes using students’ attitudes toward school to measure 
school resistance is inadequate because “direct questions will generally elicit 
responses similar to those given by white Americans.” School resistance is 
better gauged by assessing students’ behavior; what they do or fail to do to 
improve academically (Ogbu 1991). As such, Hypothesis 3 is tested by assessing 
whether blacks seek he/p, and spend time on schoo/ activities/clubs, homework, 
and /earning activities outside school less than whites. 

The analyses in Table 4a do not support the claim that blacks put forth less 
effort to improve academically than whites. Findings show blacks seek help more 
and spend the same amount of time on homework and educational activities than 
whites. Also, both groups spend a similar amount of time on school activities/clubs 
after adjusting for socioeconomic differences. The lack of significant interaction 
terms indicates these patterns remain constant over time. These results do not 
contradict Ainsworth-Darnell and Downey's (1998) finding that black students 
put forth less effort in school than whites. Whereas their measure of effort was 
a scale of teacher assessments regarding student effort, this study measures 
effort as the things students do to improve academically. This measure makes 
issues of teacher bias in evaluations of minority students, which has been well 
documented (Alexander, Entwisle and Thompson 1987; Ferguson 1998; Leacock 
1985), less important. 

The finding of no difference in time on homework is consistent with other 
nationally representative studies (Cook and Ludwig 1997; Thernstrom and 
Thernstrom 2003). Studies finding group differences generally show only a modest 
black disadvantage (Ainsworth-Darnell and Downey 1998; Cook and Ludwig 
1998). This does not mean blacks are efficient with their time on homework, 
however. For example, in an analysis of 15 affluent school districts, Ferguson 
found black students were 20 percent less likely to complete their homework 
than whites (also, Thernstrom and Thernstrom 2003:143). 
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Hypothesis 3 is further tested by repeating the analyses for skip schoo/ and 
in trouble. Group differences on the importance students attribute to academic 


and non-academic activities are also examined. Ogbu (1978) notes the American 


caste system encourages a dual system of social/status mobility leading ethnic 


minorities to disinvest from academics and focus on perceived non-white domains 
such as sports, which they perceive will be more rewarding. In his recent study of 
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Shaker Heights, he notes (2003:28), “one major issue in prioritizing was deciding 


" 


between sports and academics." Therefore, blacks should find academic activities 


less important and value sports relative to whites, particularly as they get older. 


Table 4b indicates that although blacks do not skip school/cut classes more 
than whites, they are in trouble (i.e., suspended) more often than whites. Only a 


little more than one-third of their higher suspension levels can be attributed to 
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socioeconomic differences (b — .375 in Model 1 and .232 in Model 3). Findings 
also show while all students skip or cut classes more as they matriculate through 
the school system (b = .487 for grade 8 and .899 for grade 11 in Model 2), blacks 
report being truant slightly more than whites in the 8th grade (b — .154 for black 
x G8 in Model 4). However, their truancy levels are similar to those of whites by 
grade 11. The similarity between blacks and whites in frequency of skipping class 
has also been found among 10th graders in the NELS (Cook and Ludwig 1997). 

It is important to note “in trouble” (and skipping school) is not necessarily 
equivalent to school resistance. Several studies on blacks' differential disciplinary 
rates suggest school practices are partially to blame (e.g., Delpit 1995; Lewis 
2003; Ferguson 2000; Tyson 2002, 2003; Morris 2005). Specifically, these 
studies find cultural discontinuity between black families and the institutionalized 
structure of schools, which value cultural norms and standards of “mainstream” 
white middle-class society, results in school personnel placing greater emphasis 
on black children’s behavior. For example, Ferguson (2000) finds school personnel 
view the dress and behavior of black males as recalcitrant and oppositional and 
exert strict control over them. She notes culturally based assumptions about 
black males lead them to face constant regulation of their dress, behavior, and 
speech. Similarly, Morris (2005) finds whereas white and Asian American children 
are viewed as non-threatening, black (and Latino/a) children are considered 
dangerous and therefore face constant surveillance and greater discipline for 
behavioral infractions. In sum, these studies show “schools react to students 
based on perceptions of race and gender and use these concepts as a basis for 
specific patterns of regulation.” (Morris 2005:28) Thus, “in trouble” might reflect 
differential treatment black children experience in schools. The causal ordering 
might be reversed; being “in trouble” might be an effect of being a target of 
school personnel. 

The bottom panel of Table 4b shows blacks place greater importance on 
academic activities than whites (6 = .416 in both unadjusted and adjusted 
models). Furthermore, although the importance placed on academics declines 
over time for both groups (b = -.352 and -.600 in Model 2 for grades 8 and 
11, respectively), the non-significant interactions indicate blacks maintain this 
advantage as they matriculate through school. The findings also show blacks 
place greater importance on sports than whites (b = .178 in Model 1). However, 
this difference disappears after adjusting for social class. Interestingly, sports 
increase in importance for both groups in the 8th grade (6 = .267 in Model 2) and 
become least important for both groups in grade 11 (b = -.283 in Model 2). 


Hypothesis 4: High-achieving black children are negatively 
sanctioned by their peers to a greater degree than high- 
achieving white children: The “acting white” hypothesis. 


The most widely known tenet of the oppositional culture theory is the “acting 
white” hypothesis (Fordham and Ogbu 1986). Ogbu (1994) notes subordinate 
groups define certain behaviors, in this case achievement, as inappropriate for 
them because they are the domain of their oppressors. Therefore, academic 
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success is equated to "acting white." (Fordham and Ogbu 1986) There is empirical 


" 


n 


evidence showing that school resistance manifests as a fear of "acting white. 
(Fordham and Ogbu 1986) However, no study has systematically compared 


blacks with low achievement to those with high achievement on whether they 


" 


regard education as a "white" domain. Ainsworth-Darnell and Downey (1998) 


do show high-achieving blacks are more popular than their white counterparts 
regardless of whether achievement is measured as standardized test scores or 


grades. Furthermore, Cook and Ludwig (1997, 1998) find blacks do not experience 
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greater social cost for high achievement than whites. Unfortunately, these studies 
were unable to tap into students' perceptions regarding whether good school 
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“ 


performance is “acting white. 


Table 5a shows whether low-achieving blacks differ from those with high 


n 


“ 


achievement in the belief that getting good grades is a part of “acting white. 


In addition to this direct test of the “acting white 


hypothesis, findings are 


" 


reported for whether high-achieving blacks receive greater negative sanctioning 


from their peers than white students. Specifically, this study assesses whether 
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Findings indicate an equal proportion (only about 17 percent) of poor- 
distribution) believe good school performance is "acting white. 


high-achieving blacks and high-achieving whites. Because MADICS does not 
contain these measures in multiple waves, these analyses are conducted only 
performing blacks (those in the bottom quartile of the 7th grade GPA distribution) 
and good-performing blacks (those in the top quartile of the 7th grade GPA 


ability to make friends and trouble getting along with other kids differ between 
among /th graders. 
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the best illustration of whether good performance is negatively sanctioned is 


the fear of appearing white for performing well in school is not the reason for 
differences in school performance between these groups. The findings also 
show students’ level of school performance is not related to their ability to make 
friends and blacks find it easier to make friends than whites at all levels of school 
contained in the last outcome of Table 5a. These models show good-performing 
students have less trouble getting along with their peers (b = -298 in Model 


performance (note the non-significant interaction terms in Model 4 
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Notes: Numbers in parentheses are standard errors. White is the omitted category for race. Models 3 and 4 also include controls for sex and family structure. 


Chi-Square 





* Logistic Regression Models. Good performance, which is the omitted category, is measured as the top quarter of the grade 7 GPA distribution among Afri- 


can American students (GPAs 3.49, mean 


2.44). Valid cases 


3.71) while poor performance consists of the bottom quarter of the GPA distribution (GPAs < 2.83, mean 


241 (second and third quartiles of the grade 7 GPA distribution for African Americans are not included in the analysis). 


3.76) while poor perfor- 


2.44). Students in the second and third quartile of the GPA distribution 


3.21), which represent the students with average achievement, is the omitted category for performance. N for models 


predicting making friends and trouble getting along with peers are 1133 and 1144, respectively. 


Models. Good performance is measured as the top quartile of the grade 7 GPA distribution (GPAs > 3.54, mean 


> OLS Regression 


mance consists of the bottom quartile of the GPA distribution (GPAs < 2.84, mean 


(GPAs between 2.84 and 3.54, mean 
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2 and -.243 in Model 3) than average 
and poor-performing students (note 
the non-significant but positive 
coefficient for poor performance in 
Models 2 and 3). More importantly, 
Model 4 shows good-performing 
blacks do not have significantly more 
trouble getting along with their peers 
than good-performing whites. 

The outcomes for the analyses 
in Table 5b assess whether good- 
performing blacks are rewarded less (a 
social cost) by their peers than whites 
as they matriculate through the school 
system. Findings show popularity is 
constant across school performance 
(all years pooled) and grade levels, 
as illustrated by the non-significant 
interaction terms in Model 4. Table 5b 
also contains analysis on white-black 
differences in the likelihood of students 
reporting they have plenty of friends 
vs. not having any friends/wishing they 
had more friends. Blacks are about 
42 percent more likely than whites 
to report they have plenty of friends 
(b — .350 in Model 1). The analysis 
also shows good performing students 
(top quarter of the GPA distribution 
pooled across grades 8 and 11) are 
only 75 percent as likely as students 
with average school performance to 
report they have plenty of friends (b — 
-.285 in Model 3). However, although 
the effect is only marginal, the greater 
odds of having plenty of friends for 
good-performing blacks than good- 
performing whites shown in Model 4 
(black x GP) is opposite to what the 
resistance model predicts. 

The finding that black students 
receive no greater social cost for good 
school performance than whites is 
consistent with studies that use NELS 
data (Ainsworth-Darnell and Downey 
1998; Cook and Ludwig 1998). In 
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Black * G8 * GP 
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Black * G11 * GP 
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Chi-Square 9.14** 
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Notes: Numbers in parentheses are robust standard errors. White is the omitted category for race. Models 3 and 4 also include controls for sex and family structure. 
* OLS Regression Models. Grade 7 is the omitted category for grade and average-performing students is the omitted category for performance (students in 


the second and third quartiles of the GPA distribution, which are GPAs between 2.71 and 3.50, mean = 3.15). Poor performance are GPAs < 2.71 (mean 


2.21) and good performance are GPAs > 3.50 (mean 


3.76). N = 2862. 


° Logistic Regression Models. Grade 8 is the omitted category for grade and average-performing students is the omitted category for performance (GPAs 


3.77). N = 1716 


3.10). Poor performance are GPAs < 2.61 (mean = 2.08) and good performance are GPAs > 3.50 (mean 


between 2.61 and 3.50, mean 


(88% of the 8th and 11th grade sample). 


tp <.10 
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***p < .001 (two-tailed tests) 
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fact, differences between these 
groups favor blacks. Also, school 
matriculation has no effect. Horvat 
and Lewis (2003) introduce a 
framework that helps reconcile 
Fordham and Ogbu's findings with 
those of this and other studies 
that show no support for the 
"acting white" hypothesis. They 
found students "navigated and 
negotiated multiple friendship 
groups within the African 
American peer group. There were 
some cases of students modifying 
or downplaying their academic 
success .. . [and also] instances 
in which students share their 
academic success with peers who 
are supportive of their academic 
endeavors.” (Horvat and Lewis 
2003:266) Thus, the resistance 
displayed by some may be 
conditional on the peers present 
within a particular setting; they 
can “camouflage” their success 
around poor achieving blacks as 
Fordham and Ogbu (1986) observe, 
and they can also “discuss freely 
and receive applause for their 
academic achievements and 
aspirations” among high-achieving 
blacks as Horvat and Lewis find 
(2003:266). 


Hypothesis 5: The peer groups 
of black children have a greater 
counter-educational culture than 
those of white children. 


The hypothesis that blacks 
view academic success as "acting 
white" and negatively sanction 
high-achieving students suggests 
they negotiate a social space with 
a culture counterproductive to 
academic pursuits. Hypothesis 5 


is examined by first assessing whether black students’ peers have a negative 
culture in general (i.e., negative peer behaviors and negative peer values), 


which might lead to more opportunities for resisting school. Finally, the extent 
to which the peers of blacks place a lower value on academics than the peers 
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Table 6 illustrates black students do not have peers that engage in negative 
behaviors more than whites after controlling for differences in SES. Furthermore, 


negative behaviors among peers increase for both groups as they get older (b — 
.255 and .325 in Model 3 for grades 8 and 11, respectively), though the increase 
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is slightly larger among the peers of blacks (5 — .086 for black x G8 in Model 4). 
However, negative peer behaviors are similar for the groups by grade 11. Negative 
peer values also increase over time, though there are no group differences. 

The findings in the bottom panel of Table 6 are consistent with those in the 
top panel. Positive peer values on school decline for both groups toward the end 
of middle school (b = -.125 for grade 8 in Model 3) and in grade 11 are still lower 
than they were in grade 7 (b = -.118 for grade 11 in Model 3). Although Model 
3 shows blacks’ peers have modestly higher positive values toward school after 
adjusting for SES (b = .089), the findings in Model 4 are consistent with the 
resistance model; the positive values on school decline more over time for the 
peers of blacks than those of whites (b — -.128 for black x G11 in Model 4). 
However, results for the final outcome suggest this decline is not synonymous 
with an increase in negative peer values toward schooling. In addition to having 
less negative values toward school than the peers of white students (6 = -.137 
in Model 1 and -.143 in Model 3), the non-significant interaction term in Model 4 
indicates blacks' peers maintain lower levels of negative values on schooling than 
whites' peers over time. This is consistent with Ferguson's (2001) findings that 
black students' peers are not more oppositional to achievement than the peers of 
whites in Shaker Heights, the same community studied by Ogbu (2003). 


Discussion 


The goal of this study was to provide an extensive quantitative test of Ogbu's 
oppositional culture theory. Two major findings emerged that have several 
implications for the study of race and school achievement. First, major tenets 
of the theory were not supported, which challenges the existence of a pervasive 
oppositional culture among blacks. It seems the resistance model fails to account 
for the high degree of variability within this population. Second, maturation after 
grade 7 had minimal impact on group differences in the outcomes. While Ogbu's 
theory may hold true for some subgroups of the population, these findings suggest 
the extent to which the oppositional culture theory explains racial differences in 
achievement is limited. This raises two primary questions. 


1. /s there an alternative interpretation for the 
counterproductive schooling behavior by some blacks 
documented by Ogbu and his colleagues? 


The oppositional culture observed by Ogbu and others (e.g., Fordham and 
Ogbu 1986; Ogbu and Simons 1998; Ogbu 2003) among some blacks might 
not result from students' perceptions of their prospects within the opportunity 
structure or from their resentment of the dominant group. Tyson (2002) finds 
younger children are capable of having deep ambivalence about their relationship 
to school independent from beliefs about the opportunity structure. In a study 
of two all-black elementary schools, Tyson (2002:1184) finds "[C]hildren who 
were experiencing academic failure were more likely to express negative school- 
related attitudes than children who were not . . . children's negative statements 
about school reflected a desire to avoid further experiences of failure." 
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The same might be true regarding schooling behaviors. Congressional 
testimony by Braddock (1990) warned that children who "suffer a crisis in their 
academic abilities . . . will begin adopting counterproductive, effort-avoidant 
strategies." The narratives of some children in Tyson's study provide evidence 
of the onset of this process. Therefore, poor performance early in the schooling 
process might be partially to blame for the counterproductive schooling behavior 
observed among some blacks during adolescence, a reverse order of causation 
than the resistance model suggests. Tyson (2002) raises the question of whether 
negative (or oppositional) schooling attitudes and behaviors among some blacks 
reflect the masking of feelings such as fear, hurt or embarrassment resulting 
from prior academic failure and constant negative feedback from school officials 
about their cultural norms. Furthermore, numerous studies show despite having 
poor school performance and ineffective schooling behaviors, many black youths 
desire to learn and become upwardly mobile (e.g., Tyson 2002; Tyson, Darity 
and Castellino 2005; Carter 2005). Whereas negative attitudes of some blacks 
might reflect desire to avoid future experiences of failure, ineffective schooling 
behaviors might reflect deficiencies in skills necessary for academic success 
rather than lack of desire to learn. The divergence between the current findings 
and those of Ogbu and his colleagues highlight the importance of separating 
students' academic behaviors/results from their academic intent. 


2. How can future research add to the existing literature on 
the resistance model? 


Future studies should assess the resistance model among preadolescent 
children. Most prior studies focus on adolescents because counterproductive 
schooling behaviors are posited to be heightened during this stage, presumably 
because adolescents have more developed perceptions of the opportunity 
structure than younger children. However, although previous quantitative 
(Ainsworth-Darnell and Downey 1998; Cook and Ludwig 1997; 1998) and 
qualitative studies (O'Connor 1997; Akom 2003; Tyson, Darity, and Castellino 
2005: Carter 2005) do not find an oppositional culture to be pervasive among 
blacks during adolescence, it is possible lack of academic success early in the 
schooling process leads to poor educational practices prior to adolescence that 
compromises children’s achievement throughout their academic careers. 

Future efforts of data collection should focus on improving the breadth and 
quality of cultural measures in national datasets. Although the current study 
quantitatively assesses some nuances of the oppositional culture theory, it should 
be regarded only as a starting point. Ogbu's work tended to involve qualitative 
methods, which perhaps allowed him to solicit experiences and processes current 
quantitative data cannot. The lack of strong support for Ogbu's theory in this study 
suggests perhaps more dynamic quantitative techniques are necessary to find 
greater explanatory contributions of this theory. Perhaps quantitative researchers 
can better capture how adolescents process ordinary daily experiences by using 
more open-ended questions or obtaining respondents’ reactions to descriptions 
of scenarios they might encounter. Whereas nothing can substitute for the rich 
information obtained through participant observation, coding such information 
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would approximate the data gathered via qualitative methods." Also, further 
collaboration between quantitative and qualitative researchers is necessary. 

It is important to note the advantage held by blacks relative to whites on most 
measures in this study (even prior to adjusting for SES) should not be considered 
solely positive bias in their response patterns. As Ainsworth-Darnell and Downey 
(1998) eloquently note, determining whether blacks are overestimating or whites 
are underestimating the value of schooling is a non-resolvable debate. It is quite 
possible blacks’ attribute greater value to schooling than whites because they 
base their assessments on a more disadvantaged reference group than whites. 
Furthermore, findings from this study are consistent with other studies that 
use nationally representative samples to assess racial differences in school 
resistance (Ainsworth-Darnell and Downey 1998; Cook and Ludwig 1998), 
suggesting the patterns observed in this study might be systematic. The decline 
in youths’ favorable attitudes for academics during adolescence found in this 
study — regardless of race — was also found four decades ago by Coleman (1961) 
and more recently by Steinberg (1996). 


Conclusion 


Theories address why and how things occur and guide social science literature 
and policy-making. The way problems and their causes are perceived is equally 
important as (if not more important than) proposed policies. The implications of 
policy based on theories lacking widespread empirical support can be staggering. 
For example, in the late 1960s and 70s Jensen suggested a series of proposals 
for improving black achievement directly following from his theory of black 
genetic inferiority. He suggested since the basis for the gap was genetic, the 
gap in inherent intelligence could not be closed. Instead, Jensen (1969:112-17) 
proposed black children should be educated through special forms of classroom 
instructional techniques that emphasize associative learning (e€.g., memorization 
or rote learning, trial-and-error learning). 

Despite work by numerous scholars challenging the notion that an oppositional 
culture is prevalent among blacks (O'Connor 1997; Ainsworth-Darnell and 
Downey 1998; Cook and Ludwig 1998; Akom 2003; Tyson 2002; Tyson, Darity 
and Castellino 2005; Carter 2005), the theory remains popular among researchers, 
educational practitioners, and the general public. For example, in 7he Content 
of our Character (1990:51) Shelby Steele sternly asserts even in the very worst 
schools "there are accredited teachers who teach the basics, but too often to 
students who shun those among them who do well, who see studying as a 
sucker's game and school itself as a waste of time. One sees in many of these 
children almost a determination not to learn, a suppression of the natural impulse 
to understand, which cannot be entirely explained by the determinism of poverty." 
More recently, John McWhorter (2000:28) argued blacks suffer from a culture 
of “self sabotage” that “condones weakness and failure." However, treating 
the theory as an overriding explanation for racial differences in achievement 
might lead policymakers to narrowly conceive of the gap's causes and assume 
convergence is unlikely because blacks refuse to learn. 

The current study's lack of support for many of the theory's major tenets 
suggests blacks’ lower school performance should be open to alternative 
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explanations. Given the current findings (and those reported by O'Connor 1997; 
Ainsworth-Darnell and Downey 1998; Cook and Ludwig 1998; Akom 2003; Tyson 
2002; Tyson, Darity, and Castellino 2005; Carter 2005), researchers and policy 
makers should reconsider the extent to which they attribute blacks' lower school 
achievement to lower levels of desire for success. Evidence suggests black 
youths want to learn; they simply are not acquiring skills necessary for academic 
success. 


Notes 


d 


| use the term black to refer to people of African Diaspora, and to such 
populations residing within the United States. 


Previous studies employ the term "black" when racial comparisons are 
made on academic outcomes within the United States (e.g., black-white 
achievement gap). However, not all blacks have the same minority status 
within Ogbu's minority classification scheme (Ogbu and Simons 1998). 
Whereas African Americans are involuntary minorities, immigrant blacks (€.g., 
Caribbean Americans) are voluntary minorities — groups who willingly move 
to the United States seeking better opportunities (e.g., employment, greater 
political or religious freedom). They have no history of being oppressed by 
white Americans and are more trusting of them and their institutions for 
upward mobility. For further discussion on characteristics of black immigrants, 
see Waters (1999), who documents differences between African Americans 
and immigrant blacks from the West Indies. 


Although Ogbu (1978) extends the resistance model to Latino/as, they 
are excluded from this study. Ogbu claims Mexican Americans — who 
comprise nearly two-thirds of the Hispanics living in the U.S. (U.S. Census 
Bureau 2000) — also feel alienated from American society because of 
bitter memories of their incorporation into the United States via American 
imperialistic expansion in the 1840s. However, roughly 50,000 Mexican 
nationals remained within the newly acquired U.S. territory, a small fraction 
of the more than 20 million people of Mexican Ancestry currently living in 
the United States: most Mexican Americans are immigrants or descendants 
of immigrants who arrived after the Mexican revolution of 1910 (see Jaffe, 
Cullen and Boswell 1980). Despite often being regarded as the largest 
oppressed minority group within the United States, almost all Latino/a 
children in American schools derive from voluntary immigration. Ogbu's 
classification of this group as involuntary minorities is highly implausible 
(Thernstrom and Thernstrom 2003). 


It is important to note Downey and Ainsworth-Darnell (2002) were replying to 
Farkas, Lleras and Maczuga (2002), who provided a retest of the "acting white" 
hypothesis conducted earlier by Ainsworth-Darnell and Downey (1998). 


The resistance model was developed with reference to high school 
students. A key component of the theory is that school resistance 
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results from perceptions of an unfair opportunity structure? Connections 
between societal conditions (e.g., structure of opportunities and system 
of social mobility) and individual-level characteristics (e.g., race) are more 
developed during adolescence. Nevertheless, Tyson's (2002) work informs 
us younger children are capable of having deep ambivalence about their 
relationship to school. 


6. Table 7 serves as a further check for attrition bias. The first three columns 
for each group shows the seventh grade mean on the 24 outcomes in this 
study for the full sample, the retained sample (i.e., students present in the 
final wave), and the attrited sample (i.e., students not in the final wave). 
The difference between the retained and full sample is shown in Column 
2-1. There are no significant differences on any of the outcomes between 
the retained and full samples for both groups. Furthermore, Column 3- 
2 — which is informative in determining who attrites — indicates students 
who attrited from the sample do not differ from those who did not attrite 
on most outcomes. Relative to blacks retained in the sample, those who 
attrited had lower educational aspirations and expectations, affect toward 
school, and greater negative behaviors among their peers. However, with 
the exception of affect toward school, these differences also exist among 
whites, indicating group differences on these outcomes over time are not 
likely due to attrition bias. With regard to white students, those who attrited 
had lower perceptions of educational returns, spent less time on educational 
activities, and skip school and were in trouble more than those who did not 
attrite. The bias on these outcomes favors whites; in the current analyses, 
the retained black sample — which does not differ on these outcomes from 
their attrited counterparts — is being compared to the retained white sample 
which fairs better on these outcomes than their attrited counterparts. 
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Education and its Discontents: 
Overqualification in America, 1972-2002 


Stephen Vaisey, University of North Carolina at Chapel Hill 


The study of education-occupation mismatch, once central to the 
sociologicalinvestigationofthelabormarket, hasbeenlargelyabandoned. 
While labor economists and scholars in other nations continue to 
investigate overqualification, it has been more than two decades since its 
last sociological assessment in the United States. Drawing on previous 
work and guided by Bourdieu’s concept of habitus, I hypothesize that 
workers who have more educational attainments than needed for their 
jobs will be less satisfied with their jobs, be more politically liberal, and 
be less likely to endorse an effort-based achievement ideology. Using 
the 1972-2002 General Social Survey, I find that overqualification has 
increased substantially, that the expected effects are generally found, 
and that these effects remain relatively stable over time. I discuss the 
implications of these findings for understanding the persistence of 
existing stratification hierarchies. 


The study of mismatch between educational and occupational attainment, once 
central to the sociological investigation of the labor market, has now been largely 
abandoned. Burris (2005) recently noted that while interest in this phenomenon 
has not waned among all social scientists, sociologists (and American sociologists 
in particular) have contributed little to our current understanding. Of 42 articles 
published on overqualification' in the previous six years (as listed in the Social 
Science Citation Index), only four have been by sociologists and only three have 
treated the United States. For interesting international work on the subject see, 
Batenburg and de Witte 2001; Brynin 2002; Chauvel 2002; de Grip, Heijke and 
Willems 1998; Livingstone 1998; Lowe 2001). 

Overqualification — which occurs when a worker has more education than is 
required for the performance of his or her job — is an important social problem 
for several reasons. At a macro level, it deals with the intersection of two of 
the major institutions of modernity — education and the labor market. Though 
in recent years this terrain has been largely ceded to economists and human 
capital theorists (Burris 2005), sociologists have intellectual resources that make 
them well suited to understanding these processes as well (Gibbons 2005). 
Rather than assuming an unproblematic or automatic link between educational 
institutions and the labor market, overqualification research makes the relationship 
between them an empirical problem. As Berg (1971) argued in his seminal (and 
newly reissued [2003]) book, Education and Jobs: The Great Training Robbery, 
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educational expansion may have unintended and negative conSequences for 
society. To take one example, Berg (1971) argued that employers' increasing 
reliance on educational credentials for hiring can place severe limitations on the 
achievement of disadvantaged groups (see also, Burris 2005). At a micro level, 
overqualification is important because it matters for the way individual actors 
experience their lives. Work is not only about obtaining money or other material 
goods, but also about creating and sustaining worldviews and identities (Akerlof 
and Kranton 2005; Sayer 2005). 

Though labor economists have done excellent work developing more precise 
models to estimate the effects of overqualification on economic outcomes 
(Sicherman 1991; Cohn and Kahn 1995), there is much we do not know about its 
effect on social and political outcomes. By contrast, Berg (1971) looked at a host 
of worker outcomes, such as job satisfaction and political orientations. Burris 
(1983) continued this multidimensional approach with an assessment of the social 
consequences of overqualification, but his work is now more than two decades 
old. The time has come for an update. This paper therefore has two objectives: 
first, to assess the prevalence and social distribution of overqualification in the 
U.S. labor market between 1972 and 2002; and second, to explore the effects of 
overqualification on several economic, social and political outcomes. Based on 
these analyses, | conclude that the incidence of overqualification has substantially 
increased among all social groups since 1972, that it has significant effects on 
diverse outcomes, and that these effects have not diminished with time. 


Theory and Previous Research 


Many theorists have considered the question: "Where does overqualification 
come from?" This question has been intensely debated among and between 
advocates of human capital theory (Becker 1993; Freeman 1976) and those 
favoring a credentialist view of the labor market (Berg 1971; Brown 2001; Collins 
1971, 1979; Livingstone 1998, 1999, 2001). The intractability of this debate 
stems from the fact that both major schools of thought — human capital theory 
and credentialism — make similar macro-level predictions (i.e., a strong link 
between educational and occupational attainment) but rely on vastly different 
explanatory mechanisms. | focus here on the distribution and consequences of 
overqualification in the labor force. 

Though overqualification was once the object of intense interest among 
sociologists, one probable reason for declining interest is that it failed to live up 
to the theoretical predictions originally associated with it (Burris 2005). From the 
late 1960s, social observers from radical Marxists to government planners linked 
the coming glut of college graduates to predictions of wholesale dissatisfaction 
and revolt against the system that blocked their advancement (Berg 1971; 
Bowles and Gintis 1976; Carnegie Commission on Higher Education 1973; Gorz 
1967; see also Burris 1983). Blumberg and Murtha (1977:46) went so far as to 
call it "the central dynamic for social discontent emerging in America." While 
early research seemed to support the alarmist view, subsequent findings (not 
to mention relative political calm) slammed the door on it. Burris' (1983) analysis 
yielded little evidence for the radicalizing effects of overqualification (see also 
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Glenn and Weaver 1982). A few years later, Smith (1986) showed that the fear of 
declining economic returns to education (advanced primarily by Freeman [1976]) 
was also unfounded. After a brief downward blip, the economic returns to a 
college degree had not only remained stable, but had in fact rísen over time. Thus 
the two main predictions of overeducation's theoretical edifice had crumbled 
under the weight of the evidence. Moreover, these predictions had come with no 
explicit micro-level mechanism that would explain why a discrepancy between 
educational and occupational attainments — net of wage returns — would matter 
to workers in the first place. 

There is no single, unitary framework that will allow us to theorize the 
micro-level effects of overqualification. Yet, where this subject is concerned, 
seemingly disparate theories point in complementary directions. The work of 
Pierre Bourdieu, especially his notion of habitus, can provide a useful framework 
for understanding the how a mismatch between educational and occupational 
attainments might cause subjective problems for actors (Bourdieu 1990, chapter 
3; Sayer 2005, chapter 2). Though habitus can be a slippery concept, recent 
work in the sociology of education has profitably operationalized the habitus as 
occupational aspirations — or more generally, as an active orientation to probable 
future structural position that has been formed out of one's past structural 
position (Dumais 2002; McLelland 1990). What does this mean in practice? It 
means that individuals moving through the educational system come to expect 
an "appropriate" type of work upon completing their education. By this | do not 
mean simply a high paying job, but rather one that will allow the "right" kind of 
income, working conditions, associates and identity, to name a few (see Sayer 
2005, chapter 2; also Akerlof and Kranton 2005). Thus a lack of fit in the labor 
market can lead to cognitive dissonance and dissatisfaction when the worker 
finds himself in a position "beneath" that for which he was prepared (Festinger 
1957). In a similar vein, Merton's (1968) "anticipatory socialization" speaks to the 
general social process of generating occupational expectations. 

Given these considerations, the mechanism of the habitus can serve to explain 
the importance of status consistency to which several theorists have pointed 
(see Lenski 1954 for a related argument). Bourdieu, Merton, Festinger, Lenski 
and many others have approached this phenomenon with different theoretical 
vocabularies, but each points to a similar, basic conclusion: that actors desire to 
achieve consonance between their various social statuses and that social and 
cognitive discomfort that can result when they are not successful. This "drive" 
can perhaps be most profitably seen as grounded in the habitus; that is in one's 
internalized expectations about future trajectories. This theoretical orientation 
would lead us to believe that when actors' expectations are not met, dissonance 
and dissatisfaction will likely result (Festinger 1957). The question remains, 
however, whether workers will adapt to undesirable circumstances perhaps by 
blaming themselves (see Burris 1983; Smith 1986) or whether they will channel 
their dissatisfaction into more collective outlets. With this in mind, the primary 
emphasis here will be on testing whether overqualification is related to outcomes 
such as job dissatisfaction, political liberalism or rejecting the narrative of the 
“American Dream.” All of these represent some degree of the externalization 
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of blame - respectively, toward the employer, the government or widely shared 
views about the nature of success. 


Measurement and Data 


Measuring Mismatch 


Though overqualification is a seemingly simple concept, it has proven difficult to 
measure. The problem is determining how much education a job really requires. 
There are three general approaches to this. The first is to rely on the expert 
assessment of a job analyst, who rates the requirements of a given job (Burris 
1983; Livingstone 1998). The second is to ask workers how much education 
their job requires (Halaby 1994). The third is to estimate the average level of 
education in each occupation, and then to classify those who are (say) more 
than one standard deviation above the mean as overqualified (Clogg and Shockey 
1984; Verdugo and Verdugo 1989). In this paper, | rely on the first method, basing 
my estimates of required education on the General Educational Development 
(GED) scale developed by the U.S. Department of Labor. The GED scale is really 
the maximum value of three scales — reasoning, math and language (Livingstone 
1998). The GED scale does not capture "soft" (e.g., interpersonal) skills, but the 
presence of the language component implies that it does not measure technical 


Table 1: General Educational Development Scale 





Applies common sense understanding to carry out simple one or two step 
instructions. Performs simple counting. Learns job duties from oral instruction or 
demonstration 










Applies common sense understanding to carry out detailed but uninvolved 
instructions. Uses arithmetic with whole numbers. Files and copies data. 






Applies common sense understanding to carry out instructions involving several 
concrete variables. Makes arithmetical calculations involving fractions. Files and 
copies data. 


Applies principles of rational systems (e.g. bookkeeping, electric wiring systems) to 
solve practical problems. Performs algebraic/geometric procedures. Interprets 
technical manuals. 


Applies principles of logical or scientific thinking to define problems and interpret 
extensive technical instructions. Applies advanced mathematical techniques. 
Evaluates technical data and writes reports. 


Applies principles of logical or scientific thinking to a wide range of problems and 
applies them to a variety of abstract and concrete variables. Applies advanced 
mathematical techniques. Evaluates technical data and writes reports. 


Source: Livingstone (1998, Figure 1.1) 
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proficiency alone. Many jobs with a high language requirement necessitate, for 
example, the ability to communicate complex concepts to others. The values and 
definitions of the GED scale are given in Table 1. 

My choice of the job analyst method requires some explication. The general 
consensus in the literature is that the job analyst method and the self-report 
method are superior to the “plus one standard deviation” method in that they are 
less arbitrary and non-dichotomous (Halaby 1994; Cohn and Kahn 1995: van der 
Velden and van Smoorenburg 1997). Analysts have also found that while the job 
analyst and self-report approaches provide different estimates of the overall level 
of overqualification in a sample, they yield very similar coefficients as predictors, 
at least when predicting wages (van der Velden and van Smoorenburg 1997). 
There is therefore no clear advantage to choosing one method over the other. 
The current analysis is constrained by data availability. Very few U.S. surveys 
ask respondents about the educational requirements of their job and about non- 
monetary outcomes of interest. Those that do (notably the 1973 and 1977 Quality 
of Employment Surveys) are three decades old. Other data sources must therefore 
be used if we wish to make inferences about the incidence and consequences of 
overqualification over a longer period. 


Data 


The data come from the 1972-2002 General Social Survey (GSS). The GSS is 
a repeated cross-sectional study conducted by the National Opinion Research 
Center (NORC) that interviews approximately 1,500 non-institutionalized adults 
in the United States about every year. The data collection procedures have 
generally remained the same since 1972, although the 1972-74 surveys used 
a multi-stage area probability sample rather than a full probability sample. This 
may lead to slightly underestimated standard errors in these years, but correcting 
for such differences would not change the results. | exclude the 1982 and 1987 
oversamples from the analyses, and use the unweighted data. Aggregating 
surveys from the entire period presents an unparalleled opportunity to examine 
this issue over a 30-year span. Following previous work in this area (e.g., Burris 
1983; Clogg and Shockey 1984; Halaby 1994) | restrict the sample to full-time 
workers between the ages of 25 and 65. [his is not because the effect of 
mismatch would be less interesting for part-time workers; in fact, a comparison 
between full- and part-time workers would probably be interesting and important. 
Yet to maintain comparability with previous research, it seems that focusing on 
full-time workers is a better choice. With these restrictions, the final sample over 
the 30-year period is 18,172. 

To estimate the educational requirements for each occupation, | make use of 
data compiled by Autor, Levy and Murnane (2003) to measure changes in skill 
requirements over time. Autor and colleagues used information from the Dictionary 
of Occupational Titles (DOT 1977, 1991), in combination with special CPS and 
census samples containing both 1970 and 1980 three-digit census occupational 
codes, as well as more detailed DOT job codes, to estimate the GED values for 
1970 and 1980 census occupations at two time points — 1977 and 1991? These 
values were separately estimated for men and women, which (because men and 
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women tend to have different jobs within a given occupational tithe) reduces the 
jobs-within-occupation distribution bias that comes with operating at the three- 
digit census level (Halaby 1994). | attach these values to the 1970 and 1980 
three-digit occupational codes available in the GSS. (Where both 1970 and 1980 
occupation codes are available [1998-1990], | use the 1980 codes.) 

With this information, | have a reasonable indicator of GED educational 
requirements at two widely-spaced time points. To apply these estimates to 
other years, | use a linear interpolation, extending the observed average yearly 
change between 1977 and 1991 back to 1972 and forward to 2002 for each 
occupation. The rationale behind this is that the change between 1977 and 
1991 for a given occupation represents an educational requirements trajectory 
generalizable to the entire 1972-2002 period.? Though this rationale is debatable, 
it ultimately will have few consequences for two reasons. First, the difference 
between the 1977 and 1991 overall means is slight — only .11 GED units. Though 
perhaps surprising, what this means is that relatively little skill upgrading (within 
or between occupations) took place over that 14-year period (at least according 
to Department of Labor experts). Second, the 1977 and 1991 GED values are 
correlated at .99, which means that what skill upgrading occurred almost entirely 
uniformly across occupations. 

To convert the final GED score into an estimate of required schooling, | opt 
not to use the conversions suggested by Burris, Berg, Eckhaus and others (see 
Livingstone 1998 for an overview). Though they have several commonalities, 
these conversions are based on widely divergent assumptions about what 
kinds of educational distinctions matter and about what level of education is 
required to do certain tasks. Each therefore produces a very different estimate 
of the incidence of overqualification. For example, Livingstone (1998) shows that 
these measures produce estimates of overqualification ranging from 24 to 76 
percent of the full-time workforce in 1990. Rather than rely on unsupported a 
priori assumptions, | derive my GED-to-required education conversion from one 
of the few data sources that contains both three-digit occupation codes and self- 
reported job education requirements — the 1977 Quality of Employment Survey 
(QES). The OES is a national probability sample containing data gathered from 
1,515 workers aged 16 and older who were working for pay for 20 or more hours 
per week in the United States in 1977. To convert GED values to an estimate of 
required schooling, | regressed the respondent's self-estimated required years of 
schooling on the GED values from 1977. | did not assume that this relationship was 
linear and used Bayesian Information Criterion (BIC) comparisons^ and likelihood 
ratio tests to evaluate different polynomial specifications. Both criteria confirmed 
that a third order polynomial offered the best fit (required education = -.52 + 
6.96*GED -1.46*GED* +.14*GED°). Though the correspondence between the 
two is not perfect, the polynomial-transformed GED and self-reported required 
education are relatively highly correlated (.59, A? = .35). 

There are several indications that this conversion establishes a good 
correspondence between GED and required education. First, as Figure 1 
demonstrates, the modal difference between actual and required education is 0, 
centering the conversion on matched workers. Furthermore, half of all workers are 
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Figure 1. Matching between Actual and Required Education 
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classified within one year of a perfect match. Second, the full cross-classification 
of actual and required education (Appendix 1) also shows a significant grouping 
around the diagonal. Of course, the fact that some categories are more populated 
than others prevents a perfect correspondence, but the overall fit between the 
two appears to be a good one. Thus, there is considerable prima facie evidence 
that my estimate of required education is at least reasonably accurate at the 
three-digit census occupational level. 

At this point, readers may raise two objections. First, there is an assumption 
of a uniform level of school quality and student performance, and both may have 
decreased dramatically between 1972 and 2002. Second, because the latest 
estimate of job requirements is from 1991, the data do not consider the influence 
of widespread personal computing or other recent technological advancements 
in the workplace. Both are serious objections discussed in the conclusion where 
their plausibility is assessed in light of empirical results. 


Methods 


The first part of the analysis will describe the extent of overqualification in the 
working population over the 1972-2002 period. In this section, | follow Tsang and 
colleagues (1991) and use two benchmarks - the simply overqualified (at least 
one more year of education than required) and the highly overqualified (three or 
more years of education than required). Because one cannot be sure that those 
with only one or two years of "surplus schooling" are truly overqualified, | will 
focus mainly on the highly overqualified.° | further decompose the incidence of 
overqualification by gender, age, race and type of degree. 
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Thesecond part of the analysis will estimate the effects of education-occu pation 
mismatch on three outcomes predicted to be influenced by overqualification: job 
satisfaction, left political views, and achievement ideology. In addition, | look at 
the effect of overqualification on earnings to see if this method of estimation 
yields similar results to those found by Sicherman (1991) and Cohn and Kahn 
(1995) using a self-estimate of educational requirements. To measure earnings, 
| use the natural log of yearly income, adjusted for inflation to year 2000 dollars. 
To measure job satisfaction, | use the variable SATJOB, and contrast those with 
full job satisfaction to those without full job satisfaction. For left political views, 
| use the variable POLVIEWS, and contrast those who call their views "liberal" 
or "extremely liberal" to those who offer other responses. Finally, to measure 
belief in achievement ideology (i.e., endorsement of the "American Dream”), | 
use the response to the survey question GETAHEAD: "Some people say that 
people get ahead by their own hard work; others say that lucky breaks or help 
from other people are more important. Which do you think is most important?" 
The response is coded 1 if the answer is “hard work” and 0 if another response 
was given. The equation used to estimate mismatch effects has been used for 
both monetary (Sicherman 1991; Cohn and Kahn 1995) and non-monetary (Tsang 
et al. 1991) outcomes and is as follows: 


Y =PX +0E7+E? FORCERE 


where X is a vector of independent variables, E' represents an estimate of the 
required years of education for the respondent's job, E? represents the number 
of years of education beyond the job requirement (or O, if the respondent is 
either matched or underqualified), and E" stands for the number of years of 
- education below what is required for one's job (or 0, if the respondent is matched 
or overqualified). Thus a is a parameter estimating the effect of the education 
that is actually used (i.e., the effect of the job itself), +< is a parameter estimating 
the effect of additional years of education beyond what is required, and 8 is a 
parameter which estimates the effect of the years of underqualification. 

This model assumes that a match has occurred when E? — E" — 0. While 
this assumption is relatively unproblematic when workers are self-reporting 
job requirements, some may object that one cannot be sure that a match has 
occurred when using GED-derived measures. This objection, while of course true, 
should not significantly affect the analysis because in this case, a match simply 
refers to the most //kely point of matching given the GED-based estimate. There 
is no reason to believe that the estimates should be biased in either direction. 
Even so, as a sensitivity test, | conducted all eight analyses again using over- and 
underqualification variables that treat one or two years of mismatch as a perfect 
match; in essence, this specification allows a grace period before classifying a 
worker as mismatched. In every case except one, the substantive results for both 
measurement strategies are identical. For all analyses, | use the model in equation 
(1) (or a binomial logistic version for the non-monetary outcomes). Controls for 
age, age squared, the year of the survey, nonwhite ethnicity, never-married 
status and (for non-monetary outcomes) respondent income are also included.® 
In addition, because it is possible that men and women experience education- 
occupation matching differently, | conduct separate analyses by gender. 
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Whether a genuine mismatch effect can be inferred from these models 
depends on the sign and magnitude of these three parameters, and varies 
depending on the outcome variable under consideration. In the case of 
earnings, for example, an overqualification effect would be inferred if the 
returns to overqualification ("surplus education") were less than the returns to 
the required education (i.e., a >; see Sicherman 1991; Tsang et al. 1991: Cohn 
and Kahn 1995). This is because the model would predict that a worker with 
16 years of education (for example) would earn less as an overqualified worker 
than in a matched job. On the other hana, if the overqualification and required 
education effects were identical (a = q), then this would be the straightforward 
case of job-independent “returns to education.” 

The criteria for evaluating mismatch effects are different for different 
outcomes, depending on theoretical predictions. In the case of job satisfaction 
and achievement ideology, a mismatch effect can be inferred if the 
overqualification effect (7) is negative. This would reflect the fact that educational 
attainments that are matched by corresponding occupational attainments lead 
to social consonance, while educational attainments in excess of occupation 
achievements lead to dissonance and discontent. In the case of liberal political 
views, a genuine overqualification effect could only be inferred if the positive 
effect of overqualification (7) on left political views is significantly greater than 
the effect of required education (a). This is so because the effect of education on 
liberalism is already positive — for mismatch to have an effect beyond the primary 
education effect, there must be an acceleration of liberalism beyond the point of 
required education. 


Results 
The Incidence of Overqualification, 1972-2002 


Figure 2 shows trends in overqualification using both simple (1+ years) and 
highly overqualified (3+ years) benchmarks. Using the simple benchmark, 
overqualification has increased from around 30 percent to near 55 percent of the 
full-time working population. This estimate falls roughly in the middle of those 
that would be generated by other conversion schemes (see Livingstone 1998). 
The more conservative benchmark shows a trend from roughly 10 percent in 
1972 to around 20 percent in 2002. Both trends appear to be approximately 
linear. This visual impression was confirmed with BIC comparisons between 
binomial logistic models using linear and non-linear specifications of the survey 
year as predictors of overqualification. Contrasting linear, quadratic, cubic and 
all possible functional forms specifications of the model, the linear model 
comes out as the best for estimating the yearly trend for the percentage of the 
workforce with three or more years of overqualification. For the measure using 
the simple benchmark, a quadratic model offered the best fit, and showed a 
leveling off in recent years. 

If the measure used here is capturing overqualification as opposed to 
random noise or a measurement artifact, it should behave according to 
plausible predictions. While data limits the ability to test such predictions, it is 
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Figure 2. Overqualification 1972-2002, Full-time Workers Aged 25-65 ` 
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reasonable to infer that the state of the economy should have an impact on 
the incidence of overqualification. During hard economic times (i.e., when loose 
labor markets prevail), workers may be more willing to accept a job whether or 
not it fits perfectly with their level of education. One would therefore expect 
that the unemployment rate should have a positive effect on the incidence of 
overqualification. When lagged one year, the unemployment rate is indeed a 
significant positive predictor of overqualification (highly overqualified only; p = 
.003). The linear year measure is still highly significant net of unemployment, 
however (p « .000), and comparing the range of predicted probabilities for the 
year and unemployment rate demonstrates that the effect of the linear trend is 
approximately four times greater than the effect of the unemployment cycle. 
Thus, while economic cycles appear to play a role in promoting mismatch, they 
cannot account for the large, monotonic rise in its incidence. This result also 
suggests that the "highly overqualified" benchmark is probably a more realistic 
measure of overqualification than the simple benchmark, since only the former 
behaved according to predictions. 

By examining the trends in both educational attainment and educational 
requirements over the same period, the reason for the increase becomes clear 
-the level of education of the average full time U.S. worker has increased by 1.75 
years from 1972-2002, while the educational requirements of the average full-time 
job have increased by only .33 years.” This latter figure represents not only skill 
upgrading and downgrading within occupations, but also reflects changes in the 
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Table 2: Overqualification Among Full-time U.S. Workers by Selected 
Characteristics, 1972-2002 


Percent Overqualified (3 or more years) 


1972-82 1983-92 1993-2002 
Gender 
Men 11.6 16.1 20.3 
Women 94 14.5 19.8 
Age 
25-34 13.9 16.1 20.8 
35-44 11.0 17.8 19.0 
45-54 9.2 13.3 20.1 
55-65 6.6 10.1 18.9 
Race 
White 10.4 15.0 19.4 
Non-white 14.0 17.8 22.6 
Education 
0-11 4 6 123 
12 55 4.7 6.0 
13-15 125 15.0 178 
16 WT 257, 29.8 
17-20 39.7 45.3 51.0 


kinds of jobs workers hold. For example, the proportion of workers in technical 
and managerial work has increased, while the proportion in manufacturing has 
decreased. Changes in the mix of jobs held by workers account for a little more 
than half of the increase, while skill upgrading within occupations accounts for 
the rest. Thus, while there has been some upgrading of educational requirements, 
these increases have been exceeded by the expansion of educational attainments. 
Of course we cannot be sure about exactly what has actually happened on the 
demand side (especially since 1991), so these are rough estimates at best. It 
is noteworthy however, that Department of Labor experts saw little change in 
requirements between 1977 and 1991 — a period of significant computerization 
in the workplace (Autor, Levy and Murnane 2003). It may therefore be the case 
that the massive "upskilling" of jobs so often talked about in the media is less 
common than imagined (Rothstein 1999). 

In order to see more clearly the social patterning of overqualification, a 
number of descriptive statistics are included in Table 2: overqualification by 
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gender, race, age and educational level. The statistics in this table are based on 
the highly overqualified benchmark. Table 2 demonstrates that overqualification 
has increased over time in all categories. Men and women are about equally 
affected, with men slightly more likely to be overqualified. Non-whites are still 
significantly more likely to be overqualified than whites, despite their lower 
average level of education. These findings are not surprising given the numerous 
mechanisms that keep minorities from obtaining high quality work (Tilly 1998). 
Also not surprising is the finding that higher levels of education increase the risk 
of overqualification. 

A significant trend that appears in the table is the decreasing salience of 
age for overqualification; while the youngest workers are more than twice as 
likely as the oldest to be overqualified in the earliest period, this gap shrinks to 
almost nothing in subsequent years. A comparison using simple binary logistic 
regressions of overqualification on age in all three periods confirms this finding. 
The age coefficient in the third period is not statistically significant, while the age 
coefficient in the first period is highly significant (o =.000). This pattern undermines 
the objection that overqualification is simply a life course phenomenon; that is, 
it takes younger workers time to sort themselves into occupations that fit their 
credentials and training (Sicherman 1991). While this "bad-job-as-stepping-stone" 
argument is undoubtedly true as far as it goes (and may once have been truer than 
it is today), the age trends in Table 2 show that a life course view of occupational 
matching cannot explain overa// increases in overqualification. This may reflect 
the fact that younger cohorts are increasingly facing job markets where the 
number of qualified workers is greater than the number of commensurate jobs. 
It may also reflect breakdowns in traditional advancement structures such as the 
firm-internal labor market. Taken as a whole, what Table 2 demonstrates is that, 
while there are systematic differences in the risk of overqualification, its absolute 
incidence has increased among all sociodemographic groups. 


The Effects of Overqualification, 1972-2002 


Table 3 shows the results of the regression analyses. | first examine the income 
model to determine whether or not the measures used here produce findings 
similar to those obtained by Sicherman (1991) and Cohn and Kahn (1995) using 
self-report measures. Indeed, the analysis reveals that the magnitudes of the 
coefficients are similar and that the relationships between the three education 
coefficients are identical (compare Cohn and Kahn 1995:73, Table 2). A test of the 
equality of the education coefficients shows that the returns to required education 
are larger than the returns to overqualification (a > 1) for both men and women 
(F = 119.8 and 160.4, respectively). The same test confirms that the required 
education effect is also stronger in absolute terms that the underqualification 
effect (la | > 18!) for both men and women (F = 72.3 and 74.4). The strong 
consistency between these findings and those obtained using different data and 
measures provides additional evidence that the GED-based measure of required 
schooling is similar to subjective reports, and that these measures should yield 
equally valid results in each of the following analyses. 
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Table 3: Results from the Regression of Educational Variables and Controls on Selected Outcome 
GSS Full-time Workers, 1972-2002 


MEN Income (In) Job Liberal Achievement 
Satisfaction 
Required education (a) 116" .069** .088** -.007 
(36.73) (6.48) (7.42) (.50) 
Overqualification (+) .056** .001 We -.081** 
(10.51) (.08) (6.89) (3.90) 
Underqualification (5) -.065** Ore. -.026 .005 
(12.72) (4.40) (1.32) (0.22) 
Age (25 yrs. = 0) .036** -.015 .002 -.021* 
(14.18) (1.88) (.20) (2.02) 
Age’ -.001** .001** .000 .000 
(9.77) (3.64) (1.18) (1.66) 
Year (1972 = 0) -.005** -.005 -.020** .019** 
(5.85) (1.82) (6.23) (5.32) 
Non-white -.158** -.153* .546** -.364** 
(7.41) (2.25) (7.54) (4.41) 
Never married -.210** -.263** AGA -.100 
(10.23) (4.04) (5.93) (1.23) 
Income (In) s: EE -.144** 114 
(5.86) (3.69) (2.67) 
Constant 1.920** -1.680* -1.541** .301 
(89.15) (9.96) (8.20) (1.45) 
Ri 2il — — — 
m — 294.95 251.91 70.62 
N 8756 8419 8179 5630 
WOMEN 
Required education (a) 44 098" 139% SP: 
(36.85) (7.27) (9.33) (3.96) 
Overqualification (+) 062" -.039" 178% 074% 
(10.18) (2.00) (8.58) (3.00) 
Underqualification (8) 075% 000 -036 084" 
(10.13) (.01) (1.27) (2.66) 
Age (25 yrs. = 0) .030** .006 -.003 -.007 
(10.61) (.67) (.32) (62) 


n” ——————PFFFFF-777-/ 
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Table 3 continued 


.000 


Age’ -.001** .000 .000 
(7.14) (1.38) (.96) (.84) 
Year (1972 = 0) 005 -.016** -.001 .021** 
(3.31) (4.76) (.20) (4.98) 
Non-white -.021 -.447** 2387. -.154 
(1.01) (6.66) (3.27) (1.82) 
Never married 10655. -.246** .162* -.210* 
(2.87) (3.45) (2.14) (2.28) 
Income (In) — db .020 .029 
(3.97) (.47) (.59) 
Constant .902** Së -2.871** 1.445** 
(15.68) (8.36) (13.81) (6.13) 
R? He — — — 
x? — 312.14 221.86 70.92 
N 6941 6648 6295 4388 





Notes: Absolute value of t or z statistics in parentheses. Education measured in years. 
opin 0D epi -0l 


The model for job satisfaction yields the predicted mismatch effect for both 
.men and women. Unexpectedly, however, the mismatch effect appears as an 
overqualification effect for women and an underqualification effect for men. 
For women, each year of non-required education is associated with around a 4 
percent decrease in the probability of full satisfaction. Yet, this is the one analysis 
for which the “grace period” measurement strategy yielded a somewhat different 
result; using this specification, there were no mismatch effects for women. Thus 
this finding should be interpreted cautiously. For men, on the other hand, each 
year below the education requirement is linked to a 7 percent increase in the 
probability of full satisfaction. This finding is consistent with the predictions of 
the "classic" overeducation thesis (i.e., of rising expectations) but is harder to 
explain theoretically. Perhaps just as failing to meet one's expectations seems 
to produce a negative reaction, surpassing one's expectations may produce a 
positive reaction. Perhaps succeeding in an occupation where most other workers 
are more highly educated can lead to pleasingly high estimates of one's "native" 
ability. Without different kinds of data, however, this is pure speculation. 

The next model, which predicts liberal political attitudes, also shows some 
evidence of an overqualification effect. Unsurprisingly, both the required 
education parameter (a) and the overqualification parameter (+) are positive 
and significant, confirming that education tends to lead to liberalism whether 
required by one's job or not. However, a Wald test of the equality of the a and 
t coefficients confirms the "acceleration effect" of overqualification, at least 
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for men (x° = 4.43, p = .04). The effect of a non-required year of schooling on 
the odds of liberal political ideology is approximately 45 percent greater than 
that of a required year of schooling for male workers. For women, however, the 
difference between the coefficients falls short of the conventional benchmark for 
statistical significance (x? = 3.08, p = .08). It seems that more educated women 
are more liberal regardless of their occupation. In any case, the effects here 
are not massive; while an overeducation effect on political views appears to be 
supported, there is little evidence of the revolutionary consequences predicted 
by early theorists. 

The final model, which predicts the belief that hard work is the key to success, 
yields differing results by gender. For men, the overqualification parameter (1) is 
the only one of the educational parameters to be statistically significant. Each year 
of overqualification leads to an 8 percent decrease in male workers' probability 
of endorsing a hard work-only achievement ideology. For female workers, the 
picture is different — it appears that education, required or not, has a negative 
impact on this achievement ideology. A Wald test confirms that a = + = -è in this 
model (y^ «37, p.e .83). 


Changes in Effects over Time 


The models above represent the average effect of mismatch over time, controlling 
for the effect of a possible yearly trend in the dependent variables. This does not 
tell us, however, if the effect of mismatch has varied over the 1972-2002 period. 
Knowing whether and how mismatch effects have changed serves two related 
goals. The first and most obvious is that there is a simple descriptive value in 
discerning trends in mismatch effects — if such trends exist, they represent an 
interesting phenomenon to examine and explain. The second, more important, 
reason to examine these trends is that looking at effects over time can help 
confirm my measurement strategy. Skeptics may argue that the measures used 
here do not adequately track changes In educational requirements on the demand 
side (especially since 1991). | cannot completely refute this objection through 
data and measurement. However, if it is true that educational requirements 
have in fact been increasing more quickly than my measures reflect, we should 
observe an attenuation of overqualification effects over time; this is because 
the "overqualification" measured here would be decreasingly "real" (i.e., more 
and more upwardly biased). | therefore argue that evidence of stable mismatch 
effects over time serve as evidence of the reliability of the measures used in this 
research and the reality of overqualification as a legitimate social phenomenon. 
One way to measure such trends would be to add interactions between 
the survey year and the education measures (required, over and under). One 
limitation of this strategy, however, would be that only linear changes in effects 
could be modeled. What is more, entering three interactions with the survey 
year would introduce unacceptably high levels of collinearity and difficulties 
in interpretation. As an alternative strategy, | conduct separate regressions 
for each year of the survey (using all controls from Table 3) and create a new 
dataset containing the education parameters. | graph the results by year and use 
both visual and statistical means to spot trends. This approach has been used 
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Figure 3a. Trends in Educational Effects for Male Workers, 1974-2002 ` 
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Figure 3a. continued 
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profitably in previous research (e.g., DiMaggio, Evans and Bryson 1996). | present 
these graphs separately for men (Figure 3a) and women (Figure 3b). To facilitate 
discerning changes, | use a third order polynomial to draw trend lines. To test 
whether there are meaningful trends, | conduct regression analyses, modeling 
trends in the estimated coefficients with both linear and quadratic specifications. 
While significance tests of these trends are not strictly applicable, they are used 
here for heuristic value (see DiMaggio, Evans and Bryson 1996). 

Figure 3a shows that, despite some yearly volatility, there are few systematic 
changes in the value of the three educational coefficients or the relationship 
between these coefficients over time for male workers. There are a few 
exceptions that are noteworthy. First, the income premiums for required education 
and overqualification increased in parallel from the earliest time point, before 
leveling off around 1990. Over the same period, the underqualification penalty 
increased as well. These polar trends most likely reflect the increasing size of 
the college/non-college wage gap (Smith 1986). Again, the fact that the required 
education and overqualification coefficients are increasing in parallel implies that 
the measure of required schooling used is fairly accurate. If | were dramatically 
underestimating the increase in educational requirements over time (because 
of computerization or other factors), one would expect a gradual convergence 
between the overqualification coefficient and the required education coefficient. 
Again, this is because an increasing amount of "overqualification" would have 
become required education over this period. The results are consistent with the 
contention that these measures are sufficiently accurate and unbiased. 

There are two other significant trends in Table 3a. The effect of required 
education on job satisfaction has followed a U-shaped path from 1974-2002. 

. Why this should be the case is unclear. The relationship between required 
education and liberal political views has followed the same U-shaped trajectory; 
there has been a strong increase in the educational effect on liberalism in 
recent years. In fact, it appears that the required education effect has caught 
up with the overqualification effect. This means that the acceleration effect of 
overqualification — though true on average between 1972 and 2002 - has largely 
been absorbed by the effect of education in general. Apart from this change, 
however, there is no evidence that the mismatch effects reported in Table 3 have 
Systematically decreased over time. In fact, in the case of hard work ideology, 
the overeducation effect is actually increasing in strength. That is, overqualified 
workers are increasingly likely to disavow the "American Dream" of success 
through hard work alone. 

Figure 3b presents the same analyses for female workers. In contrast to the 
models for male workers, the coefficients here seem to be much more stable 
over time, with both fewer trends and fewer fluctuations. In fact, the only 
significant trend in any of the four models is the increasing income penalty for 
underqualification. Again, this probably reflects the growth of the college/non- 
college wage gap between the early 1970s and the early 1990s. Unlike for men, 
however, this has manifested itself only as a decrease for underqualified female 
workers rather than the polarization seen for male workers. In sum, Figures 3a 
and 3b provide some evidence that the effects of mismatch are relatively stable 
over time, with no clear strengthening or weakening trends across the board. 


Education and Its Discontents ° 853 


Figure 3b. Trends in Educational Effects for Female Workers, 1974-2002 
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Figure 3b. continued E 
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The results of these trend analyses lend credence to the measure of required 
education employed here. If the measure were systematically underestimating 
recentincreases in educational requirements on the demand side, the results over 
time would be less stable than they are. The GED-based measure undoubtedly 
contains substantial error, if for no other reason than it was constructed at the level 
of three-digit census occupational codes, which are somewhat heterogeneous 
with respect to actual jobs. Yet the trend analyses, coupled with the steps taken 
in the previous sections, strongly suggest that the measure is capturing real 
overqualification - a phenomenon with stable and significant effects. 


Discussion and Conclusion 


Within the limitations imposed by the data, the analyses undertaken here yield 
several significant findings. First, there is evidence that a substantial- and growing 
- number of American workers are overqualified for their jobs. Though economic 
cycles do have some impact on this phenomenon, the principal time-trend is 
positive and linear, and appears to be the result of the widening gap between 
a large expansion in educational attainment and only modest increases in job 
educational requirements over the past three decades. Second, overqualification 
is not limited to the young nor to any other particular demographic group, though 
disadvantaged groups do appear to be at higher risk. Third, as predicted by 
the theoretical mechanism of the habitus and the corresponding drive toward 
achieving structurally-conditioned expectations, overqualification is linked to 
psychological, social and political outcomes for full-time workers, although the 
effects are generally stronger for men than for women. Finally, the regression 
and trend analyses show no evidence that the effects of overqualification are 
diminishing, despite its increased prevalence. Within the limitations imposed 
by the data, these findings paint a picture of a growing phenomenon with 
substantively significant consequences. 

The discovery of gender differences in the mismatch effects seems to provide 
further support for the theoretical mechanisms | have argued are related to 
overqualification. Because the structural position of women may diminish the 
occupational aspirations associated with a given level of education (due, for 
example, to a greater anticipated likelihood of leaving the labor force, at least for 
a time), the subjective impact of a social discrepancy may be smaller for women. 
Indeed, work in social psychology (Thoits 1992) has shown that, on average, work 
is a less identity-salient role for women than it is for men, reinforcing the plausibility 
of this explanation. This is not primarily because of individual or natural dispositions, 
but because of our culture's disproportionate focus on paid work in men's lives. 
This difference can perhaps explain the fewer discrepancy effects for female as 
opposed to male workers. (As one reviewer suggested, this fits the general pattern 
of men's greater rebelliousness in crime, violence, political extremism and overall 
alienation from society.) It would clearly take more (and different) research to be 
sure if these mechanisms were indeed responsible for the effects outlined here, 
but the results are at least consistent with such an explanation. 

One interesting question that deserves to be addressed regards changes in 
the levels of the dependent variables over time. If overqualification is increasingly 
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common and it affects such outcomes as job satisfaction and political views, 
how have the aggregate levels of these outcomes changed? In the case of job 
satisfaction, separate analyses show that the percentage of full time workers who 
are fully satisfied has declined from a high of near 60 percent in the mid-1970s 
to around 45 percent in the year 2000 (though this figure rebounded to over 50 
percent in 2002). In this case, overqualification is likely a causal factor in this 
decline. More perplexing are levels of political liberalism, which have remained 
essentially unchanged, and belief in the "American Dream" of hard work, which 
has substantially /ncreased since 1972. Clearly there must be other mechanisms 
at work to offset the changes produced by overqualification. These mechanisms 
are certainly worthy of further research. 

At this point, | return to the two objections to my measurement strategy that | 
mentioned above but did not address. The first objection — purported decreases 
in levels of school quality and/or student performance - should be considered in 
light of the recent review by Michael Handel (2003), which explicitly addresses the 
widespread belief that decreases in school quality and student performance have 
rendered the U.S. labor force unprepared for today's "high tech" labor market. 
Handel demonstrates that there is no evidence of systematic decreases in either 
educational quality or in students' cognitive abilities. For example, reading and 
math test scores from the U.S. Department of Education's National Assessment 
of Educational Progress (NAEP) have been almost entirely stable since 1971 
to the late 1990s. Further, data from the National Adult Literacy Survey (NALS) 
provide no evidence for the common stereotype that younger workers are less 
literate than older workers are. In addition to these empirical refutations, Handel 
relies on historical analysis to show how the skills shortage paradigm emerged 
as labor economists’ response to growing wage inequality; that is, they chose 
to focus on individual workers rather than management strategies or patterns of 
job creation. Little attention has been paid, for example, to dramatic increases in 
"bad jobs" over the same period of increasing wage inequality (Kalleberg, Reskin 
and Hudson 2000). In the same vein, Handel (2003) shows that fear about school 
failure emerged as a "moral panic" in response to the economic threat posed by 
Japan rather than as a rational response to empirical analysis. In summary, then, 
the available evidence does not support the argument that decreases in school 
quality or student performance can explain away observed overqualification. 

The second possible objection is that since the latest job skill estimates in 
these data are from 1991, the measure of required education used here cannot 
capture the impact of widespread personal computing or other technological 
advancements in the workplace. This is a valid criticism, as far as it goes. Those 
who invoke it should remember that computers and other technology had 
significantly entered the workplace between 1977 and 1991 (Autor et al. 2003), 
and that the Department of Labor skill estimates only show modest increases in 
skill requirements during this period (see above). The linear interpolation of the skill 
trajectories should pick up some of any post-1991 increase in any case. Ultimately, 
though, this objection would only be fatal to the argument here if increases in 
skill requirements since 1991 could completely make up for the large increase in 
observed overqualification over the same period. This seems unlikely. Moreover, 
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any actual upgrading of skills that went undetected by the data would only serve 
to diminish the effect of overqualification on the outcome variables used in the 
regression analyses. Yet the supplemental analyses showed that the effects of 
overqualification are stable over time. Taken as a whole, then, these findings 
provide a compelling case for the reality of overqualification as a social fact. 

What are the implications of overqualification for processes of social 
stratification? For one thing, if the level of education is increasing faster than the 
education required to do available jobs, this may have serious consequences for 
individuals, especially in lower income families or among other disadvantaged 
groups. While the cost of higher education has increased dramatically over the 
past 30 years, available funding resources - state and federal grants, family 
incomes — have not kept pace in real terms (National Center for Public Policy 
and Higher Education 2002). Though the relative wage returns to post-secondary 
education remain high, this is in large part due to the fa/ling real wages of those 
without it (Livingstone 1998). Because the same jobs now cost more in educational 
terms than ever before, the families and individuals least able to pay the costs 
are the most likely to be squeezed out of the best sectors of the labor market, 
thus maintaining existing stratification hierarchies (at a higher absolute level). 
As others have argued, post-secondary education is not necessarily the key to 
upward mobility; rather, for many, it may be the only effective safeguard against 
downward mobility (Collins 1979). This explanation of increasing inequality can 
be viewed as a complement (and/or competitor) to the skills-biased economic 
change theory that is currently orthodox, which holds that a skills shortage among 
the disadvantaged is primarily responsible for increasing inequality (Handel 2003). 
It does seem that further research would be warranted to investigate this claim. 

The economic costs of overqualification do not only involve workers, however 
— they also affect employers. Overqualification implies underemployment 
— the underutilization of available skills and abilities. Given that an increasing 
proportion of workers may be underemployed, it is in employers’ interests to 
find new ways to make use of these untapped human resources. While there 
are material and other limits to work design, employers can certainly play a 
role in closing the gap between educational attainment and occupational 
content; in fact, it may be beneficial for them to do so. For example, Tsang 
and colleagues (1991) argue (citing several other studies) that decreases in 
job satisfaction brought about by overqualification can have serious negative 
effects on productivity (see also Petty et al. 1984). 

Finally, | do not mean to imply that education does not confer non-workplace 
benefits or that it is not desirable for its own sake (e.g., as a consumption good). 
Nor am | arguing that education should be made less available or portrayed as 
less valuable than it is today. My goal has been to point out that increases in 
educational attainments have unintended consequences that affect society and 
the economy in ways that are not often recognized. The limitations imposed by 
the available data necessarily restrict the precision of these results, but it does 
seem reasonable to conclude this: that the number of overqualified workers in 
the United States is substantial, that it has grown over the past 30 years, and 
that this phenomenon has social, psychological and political consequences that 
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are both substantively significant and theoretically consistent. Though more 
work is needed, this article provides a needed step in reviving the study of an 
underexamined social phenomenon. 


Notes 


1. This phenomena is also known as “overeducation,” “overschooling,” 
“overtraining” and “underemployment.” | see the terms as interchangeable, 
but generally favor "overqualification" as the least normatively charged. 


2. These data were generously provided by David Autor. 


3. Another possibility would be to use a linear interpolation, but only for the 
years between 1977 and 1991, assigning the 1977 values to the years 1972- 
1977 and the 1991 values to the years between 1991-2002. These different 
measurement strategies did not substantively change the results. 


4. Bayesian Information Criterion (BIC) comparisons are an appropriate method 
for choosing between nested and non-nested models (Raftery 1995). 


5. However, since this first part of the trend analysis deals primarily with 
aggregate levels of overqualification rather than with the classification of 
individual worker, the random measurement error should cancel out, yielding 
an unbiased point estimate of the level of overqualification in a given year. 


6. Descriptive statistics for all variables are given in Appendix 2. Alternative 
specifications of the models using quadratic forms of the survey year 
variable were estimated, but they were rejected in nearly all cases by BIC 
comparisons. In no case were the substantive results affected. 


7. Even if we only consider the change between 1977 and 1991 (the years 
for which specific DOT data exists), the average educational attainment 
increases by 1.14 years and the average educational requirement increases 
by only .26 years. 
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Appendix 2 

Descriptive Statistics for All Variables 
Variable N Mean SD Min Max 
Required education 181124 13.24 2.30 6 20 
Education 18172 13.49 2.96 0 20 
Overqualification (years) 18172 1.07 1.50 0 11 
Underqualification (years) 18172 .82 1.49 0 20 
Income (In) 15697 3.44 19 -.69 5.93 
Full job satisfaction 17463 .50 .50 0 1 
Liberal 15658 .28 45 0 1 
Hard work ideology 11348 .66 .48 0 1 
Female 18172 43 .50 0 1 
Age 19172 3. 41.05 2210.51 25 65 
Non-white 18172 .16 .36 0 1 
Never married 18172 A dl 0 1 


—— n eee 


Work Uncertainty and the Promotion of Professional Women: 
The Case of Law Firm Partnership 


Elizabeth H. Gorman, University of Virginia 


Work uncertainty may affect gender disparities in professionals upward 
mobility in organizational hierarchies. Professional work involves three 
forms of uncertainty — problem variability, strategic indeterminacy 
and dependence on autonomous actors — that weaken the association 
between performance and ability, leading organizational decision- 
makers to weigh gender more heavily in promotion decisions. Strategic 
indeterminacy also increases the need for trust, so decision-makers feel 
more comfortable promoting candidates of their own sex. These ideas 
are explored using data on promotions to partnership in U.S. law firms. 
Findings show that (1) promotions are less likely to go to women when 
work involves greater problem variability and strategic indeterminacy; 
and (2) the negative effect of strategic indeterminacy becomes stronger 
as the proportion of male partners increases. 


Which organizations offer professional women better prospects for career 
success? Organizations vary considerably in the extent to which women achieve 
promotion, rank and authority (Blum et al. 1994; Cohen et al. 1998; Guthrie and 
Roth.1999; Reskin and McBrier 2000). What Reskin (2003) calls "organizational 
mechanisms" may accentuate or diminish the role that gender plays in employers' 
hiring and promotion decisions. Certain organizational policies, such as the 
presence or absence of clear selection criteria or accountability requirements, 
may preclude or permit intentional discrimination on the basis of gender (Reskin 
2001; Reskin and McBrier 2000). Other organizational characteristics may create 
conditions that restrict or facilitate biases in the assessment of men's and 
women's competence (Ridgeway 1997). 

One of the settings where organizational features may sustain gender inequality 
is the legal profession. Since the mid-1990s, women have accounted for nearly 
half of law students and entering law firm associates (Epstein 1993; Epstein 
et al. 1995), but they still represent only 17 percent of partners in large firms 
(National Association for Law Placement 2005). Both social scientists (Epstein 
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et al. 1995; Kay and Hagan 1998, 1999; Spurr and Sueyoshi 1994) and legal 
scholars (Reichman and Sterling 2002, 2004; Rhode 1994, 1996) have shown 
considerable interest in studying women's success in law firms. Recent research 
has examined the effects of firm characteristics such as gender-stereotypical 
hiring criteria (Gorman 2005), client relationships (Beckman and Phillips 2005), 
and the previous experience of firm founders (Phillips 2005). 

An organizational dimension that has been largely overlooked is the level of 
uncertainty involved in an organization's work. Some studies offer hints that 
work uncertainty may put women at a disadvantage. For example, Kanter (1977) 
concluded that the uncertainty involved in managerial work makes organizations 
less willing to promote women into management. Mcllwee and Robinson (1992) 
found that women engineers were less successful in moving up in organizations 
where jobs were unstructured than in bureaucratic, rule-based organizations 
where work was more routinized. More generally, researchers have long 
observed that uncertainty leads decision-makers to place greater weight on social 
characteristics when selecting candidates for promotions and other opportunities 
(Dalton 1951; Pfeffer et al. 1976; Salancik and Pfeffer 1978). Nevertheless, the 
relationship between work uncertainty and gender inequality in organizational 
mobility remains underexplored. 

After outlining three forms of uncertainty that are common in professional 
work, | examine the relationship between uncertainty involved in the nature 
of law firms' practice and firms' propensity to select women for promotion to 
partnership. When firm partners assess candidates' abilities to perform well in 
the partner role, work uncertainty is likely to reduce the weight given to past 
performance and increase the weight given to gender, regardless of the gender 
of existing partners. At the same time, one form of work uncertainty — strategic 


indeterminacy — may be especially disadvantageous to women when decision- 
makers are male. 


Uncertainty in Professional Work 


Many aspects of jobs can be uncertain, including the security of employment 
itself. | focus here on uncertainty in the nature of the work that professionals 
do. For the purposes of this article, | adopt Abbott's (1988:8) broad definition 
of professions as exclusive occupational groups applying abstract knowledge 
to particular cases, a definition that extends beyond the traditional professions. 
Like all production processes, professional work involves the transformation of 
inputs into outputs (Mills and Moberg 1982; Perrow 1967). The "raw materials" 
for professional work are problems — constellations of actors and their different 
goals, resources, opportunities and risks (Abbott 1988:35: Delmestri and 
Walgenbach 2005). Legal work, in particular, is often described as “problem- 
solving.” (Blasi 1995; Weinstein 1998) Problems may involve either troublesome 
situations to be remedied, such as a dispute over the performance of a contract, 
or desirable projects to be completed, such as the negotiation of a corporate 
merger. The outputs of professional work are services aimed at securing 
satisfactory resolutions of problems. 
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For heuristic purposes, the professional production process can be divided 
into three phases that follow each other in a logical sequence (Abbott 1988:35- 
52; Mills and Moberg 1982). In practice, phases may be skipped or repeated. 
In the first phase, a professional obtains information concerning the nature 
of the problem and classifies it in one of the categories recognized by his or 
her professional knowledge base. In the second phase, he or she selects a 
course of action. In the third phase, the professional implements the prescribed 
strategy, either alone or in interaction with others. Uncertainty can arise in each 
of these three phases. 

In the first phase, uncertainty stems from the variability of the problems that 
must be handled (Daft and Macintosh 1981; Perrow 1967). When variability is 
greater, the nature of the work to be performed at any given time usually becomes 
less predictable (Argote 1982; Comstock and Scott 1977). Unpredictability is 
consequential to the extent that different problems require different skills and 
knowledge. In that case, professionals must choose between maintaining forms 
of expertise that may not be used and risking that the skills they lack may be 
needed.' In professional work, problem variability is generally greater when a 
professional's practice encompasses a greater range of substantive fields (Argote 
1982). In the legal profession, problem variability is likely to be greater for lawyers 
who work in several practice areas than for those who are narrowly specialized. 

During the second phase, uncertainty arises if the professional knowledge 
base does not clearly link the relevant category of problems to a specific course 
of action. In that situation, the best strategy is indeterminate (Jamous and Peloille 
1970; Wilensky 1964). Problems are linked to standard solutions when the 
causal processes generating them are perceived to be straightforward and well 
understood (Perrow 1967) or (in professions whose knowledge base consists 
of normative rules for judging human behavior, such as lawyers, accountants 
and clergy) when the relevant rules are clear and easily applied to new sets of 
facts (Farago 1980). Thus, lawyers can rapidly determine whether a claimant 
meets clear-cut requirements for Social Security disability payments (Weinstein 
1998) and can use "boilerplate" wills to address most estate planning needs 
(Van Hoy 1995). In contrast, when legal rules are ambiguous, it is impossible 
for legal analysts to derive strategies through simple logical deduction (Moore 
1981; Stratman 2004). 

In the presence of strategic indeterminacy, a professional must exercise 
judgment in selecting a course of action, generally through reasoning by inference 
(Abbott 1988:48-52). The professional recalls previous problems of the same 
type — either from her or his own experience or from the reported experience of 
others — and identifies strategies that were successful or unsuccessful in those 
cases. The greater the similarity between these previous cases and the present 
one, the stronger will be the professional's inference that the same strategies 
will bring about the same outcomes. Thus, lawyers faced with ambiguous legal 
rules turn to previous cases that presented similar issues, identify factors that 
led to court decisions for one party or the other, and apply those factors to the 
case at hand (Stratman 2004). Whereas information-gathering (first phase) and 
strategy implementation (third phase) require interaction with nonprofessionals, 
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the process of inferential reasoning and judgment is "a purely professional act.” 
(Abbott 1988:40). Conversely, when legal rules are clear and no inference is 
required, lawyers’ work may be delegated to — or appropriated by — non-lawyers 
(Abbott 1988:51; Van Hoy 1995). 

In the third phase, uncertainty is greater to the extent that execution of the 
professional's chosen course of action depends on other actors whose behaviors 
are not fully subject to the professional's control. Most professional strategies 
require cooperation from clients, which clients may or may not provide. The 
provision of legal services involves complex processes of negotiation between 
lawyer and client, in which clients may resist as well as collaborate (Felstiner and 
Sarat 1992; Kritzer 1998; Mather et al. 1995). Corporate clients, who can provide 
continuing flows of legal business, are especially likely to insist on exercising 
control in their relationships with lawyers (Abel 1989: 204; Heinz et al. 2001). 
Moreover, other parties close to the process may interfere. Accountants may 
fail to provide information that a lawyer needs to prepare securities registration 
documents, for example, or litigation opponents may adopt tactics that prevent 
a lawyer from persuading a judge or jury. 


Work Uncertainty and Double Standards for Inferring Ability 


Work uncertainty may affect organizational decision-makers’ assessments 
of candidates’ ability. Decision-makers charged with making promotions try 
to assess whether candidates have the ability to perform the job in question 
successfully (Jackall 1988:42; Pinfield 1995:265-66).? Ability cannot be measured 
directly, however, so decision-makers must infer it from observable evidence. 
In particular, decision-makers are likely to consider candidates’ records of 
performance, especially of tasks similar to those involved in the position to be 
filled (Pinfield 1995:256-60). If a candidate has performed successfully in the 
past, that will tend to support an inference that she or he possesses the requisite 
ability; a poor track record will tend to support the opposite conclusion. 

An alternative approach is to infer ability from social characteristics (Kanter 
1977:61-63; Pfeffer et al. 1976). In particular, decision-makers are likely to 
emphasize gender, race, social class and other dimensions that determine an 
individual's social status (Ridgeway 1997). Theory and research indicate that 
people treat such characteristics as having evidentiary relevance for inferences 
about an individual's ability (Berger et al. 1980; Conway et al. 1996). Relevance 
may be direct or indirect. In the case of gender, if the task at hand is culturally 
sex-typed (for example, child care or automobile mechanics), decision-makers 
perceive a direct association between gender and ability to perform the task. 
Even if the task is not sex-typed, gender still has indirect relevance to ability, 
because men are presumed to be more generally competent than women. In the 
absence of clear evidence to the contrary, this general competence is assumed 
to extend to the task at hand (Dovidio et al. 1988). 

Decision-makers considering candidates for promotion combine performance 
and gender information to reach an aggregate expectation about a candidate's 
ability (Berger et al. 1980; Humphreys and Berger 1981). If a man and a woman 
are perceived to have performed equally well in the past, inferences based on 
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gender will still raise the ability expectation for the man and lower it for the 
woman (Pugh and Wahrman 1983; Wagner et al. 1986). To generate expectations 
of equal ability, then, women must have stronger performance records than men 
do. In effect, women must meet a higher standard of performance - a "double 
standard” — before decision-makers will conclude they are equally competent 
(Foschi 1992, 1996; Foschi et al. 1994). 

Work uncertainty affects this process by creating doubt about the validity of 
ability inferences based on past performance. All three forms of work uncertainty 
should weaken the usefulness of past performance as an indicator of inherent 
ability. First, when there is greater variability in the types of problems that 
professionals encounter, past success in handling problems of one type may say 
little about a candidate's ability to handle different problems in the future. Indeed, 
if a lawyer will be called on to handle matters in multiple areas of practice, that 
lawyer's general problem-solving competence may seem more important than 
past success in handling specific tasks — and general competence is associated 
with the male gender. In contrast, when work involves the predictable recurrence 
of similar problems, repeated successful performance in the past is probably a 
good indicator of ability to perform well in the future. 

Second, when the relevant knowledge base does not establish a clear course 
of action and professional judgment is required, it is also difficult to reach 
conclusions about an individual's ability. In general, uncertainty surrounding the 
best approach to producing a desired outcome is often linked to uncertainty on 
the part of evaluators about the producers' levels of quality (Podolny 2005:227- 
29). Past performance, in particular, may be an unreliable basis for such 
evaluations. When applicable rules are ambiguous and the relevant principles 
of cause and effect are unclear, strategies that work well in one case may not 
work well in the next, even if the problems involved are similar (Perrow 1967; 
Thompson 1967:159). For example, a decision to seek summary judgment may 
rapidly resolve one lawsuit, but antagonize the judge or provoke a response from 
a quiescent opponent in another. In such cases, inferences of competence based 
on past successes may prove ill-founded in the future (Kanter 1977:52-53). 

Finally, when the implementation of a strategy depends on autonomous 
actors, those actors may alter the planned course of events in significant 
ways (Thompson 1967:159). As a result, the causal connection between an 
individual's actions and the ultimate outcome may be questionable (Kanter 
1977:52). For example, if a court hands down a judgment favorable to a 
lawyer's client, it may be difficult to say whether this result was due to the 
lawyer's skill. In the next case, with a different judge or different opposing 
counsel, the outcome might be different. 

Doubt about the usefulness of past performance as an indicator of ability 
is likely to affect decision-makers' reliance on past performance in forming 
ability expectations. Although decision-makers combine inferences from past 
performance and gender, the weights given to each factor may vary. When there 
is no question that past performance reflects the relevant ability, observers base 
inferences of ability on performance alone, ignoring gender (Foddy and Smithson 
1999). However, when a successful or unsuccessful performance may reflect 
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factors other than ability, or when the ability demonstrated in the past may not be 
the ability needed in the future, decision-makers are likely to pay less attention to 
past performance and more to gender. The more tenuous the link between past 
performance and future ability, then, the more lenient the performance standard 
is likely to be for men and the stricter it is likely to be for women, creating a 
greater advantage in performance expectations for men. 


Hypothesis 1. When a firm's work involves higher 
levels of each of the three forms of work uncertainty 
— problem variability, strategic indeterminacy, or 
dependence on autonomous actors — the firm is less 
likely to promote women. 


Work Uncertainty and Preferences for Same-Gender Candidates 


In the case of strategic indeterminacy, the negative effect described above is 
likely to be exacerbated when organizational decision-makers are male. According 
to Kanter's (1977) theory of "homosocial reproduction," people are more 
comfortable with — and more willing to trust and cooperate with — others who 
are similar to themselves along social dimensions such as sex, race and class. 
This view is supported by research in the tradition of social identity theory, which 
finds that positive ingroup bias is especially strong with respect to interpersonal 
dimensions such as friendliness, cooperativeness, loyalty and trustworthiness 
(Brewer and Brown 1998; Insko et al. 1990). Consistent with these arguments, 
female lawyers working in male-dominated settings experience less trusting 
relations with both their colleagues and their firms as institutions than do their 
male peers (Kay and Hagan 2003). 

Kanter (1977:49-53) argues that decision-makers' bias toward socially similar 
others is intensified when employees must work together under uncertain 
conditions that require the exercise of discretion. When this is the case (as it 
was among the managers she studied in a large industrial corporation), each 
individual's boss and coworkers must rely on her or him to judge wisely and 
choose appropriate and effective courses of action. Understanding, trust and 
cooperation become especially important. In promotion decisions, then, 
uncertainty surrounding strategic choices should strengthen male organizational 
decision-makers’ preference for men. 

Research on organizational modes of coordination and control offers a slightly 
different — but entirely compatible — route to the same conclusion. When workers 
must exercise discretion, organizations cannot rely on bureaucratic rules and 
performance programs to control workers’ productivity. Instead, they turn to 
cultural modes of control. Employees who have internalized organizational values 
and norms push themselves to perform well (Kunda 1992; O'Reilly and Chatman 
1996). Normative control is easier when employees are socially homogeneous 
and already share similar cultural understandings (Collins 1979). Thus, this 
argument too implies that strategic indeterminacy will intensify male decision- 
makers’ preference for male candidates. 
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Hypothesis 2. 7he effect of strategic indeterminacy will be 
stronger when more of a firm's decision-makers are men. 


Data and Methods 


| apply these ideas to the case of United States law firms that serve corporate 
clients.* Such firms are typically organized as partnerships (or substantively 
equivalent forms). Firms employ junior lawyers as "associates" during a 
"partnership track" period that typically lasts between six and ten years. At 
the end of this period, the firm's partners consider the remaining members 
of an associate class for promotion to partnership (Galanter and Palay 1991). 
Although firms traditionally maintained an "up-or-out" policy, many firms have 
now begun to employ experienced lawyers in non-partner positions on a 
more or less permanent basis, and these lawyers are sometimes promoted to 
partnership (Gorman 1999).5 

The unit of analysis in this study is the event of promotion to law firm 
partnership. | examine a nationwide sample of such events occurring between 
the late summer or fall of 1994 and the corresponding period in 1995 within large 
U.S. law firms. The sample is based on the 1995 and 1996-1997 editions of the 
National Directory of Legal Employers (the "NALP Directory") prepared by the 
National Association for Law Placement (1995; 1996), a nonprofit organization 
established to provide information about legal employment to law schools 
and their students. The NALP conducts an annual survey of several hundred 
law firms, asking for quantitative and qualitative information, and compiles the 
results in annual editions of the NALP Directory. Firms report information as of 
February 1 of the publication year. Law firms with multiple offices usually provide 
information specific to each establishment. After excluding establishments 
where no promotion event occurred or the number of promotion events could 
not be determined, the sample included 503 establishments. 

In these establishments, 1,450 promotion events took place. Information on 
promotions comes from the 1995 and 1996 editions of the Martindale-Hubbell 
Law Directory (“Martindale-Hubbell”), a widely-used directory listing virtually every 
law office in the United States, together with names, titles and brief biographies of 
individual attorneys (Martindale-Hubbell Inc. 1995, 1996). Information published 
in each edition of Martindale-Hubbell is collected during the second half of the 
prior calendar year. A promotion event was deemed to have occurred when a 
lawyer was listed as an associate or another non-partner title in the 1995 edition 
of Martindale-Hubbell and as a partner in the same firm in the 1996 edition. 


Dependent Variable 


The dependent variable measures whether the promotion event involved the 
promotion of a man or a woman (1 = female, O = male). Coding was based on 


lawyers’ first names.’ 
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Explanatory Variables 


To create measures of the three forms of work uncertainty identified above, | 
began with establishments' lists of their practice areas, provided in the NALP 
Directory. | collected data on up to 20 practice areas per establishment and 
coded each practice area by matching it with one of the fields of practice 
identified by Heinz, Laumann and their colleagues (Heinz and Laumann 1982; 
Heinz et al. 1998). Measures of the problem variability, strategic indeterminacy 
and dependence on autonomous actors experienced by lawyers working in 
each field were derived from questions in a 1995 survey of a random sample 
of 788 lawyers practicing in the Chicago area sponsored by the American Bar 
Foundation (Heinz et al. 2001; Heinz et al. 1998)? To create establishment-level 
measures for each field within each establishment, | weighted each of the three 
field-level measures by the proportion of establishment lawyers working in that 
field, obtained from the NALP Directory. For each establishment, | then created 
three weighted-average measures - one for each of the three forms of work 
uncertainty — by summing the weighted field-level measures of each form of 
uncertainty for that establishment. 

The question used as the basis for the field-level measure of problem variability 
asked lawyers who practiced in each field to assess the extent to which they 
specialized in that field or maintained a more heterogeneous practice, using a five- 
point response scale ranging from “The area of law in which | work is so highly 
specialized that it demands | concentrate in just this one area,” to “The nature of 
my legal practice is such that | can handle a range of problems covering quite a 
number of different areas of legal practice.” The percentage of respondents in 
each field who chose the three response points closest to the “quite a number of 
different areas” pole measures the extent to which lawyers working in that field 
also work in other fields and thus experience variability in their practices.? 

The question used as the basis for the field-level measure of strategic 
indeterminacy asked lawyers who practiced in each field to assess the extent 
to which practice in that field was routine or required professional inference and 
judgment, using a five-point response scale ranging from "A para-professional 
could be trained to handle many of the procedures and documents in my area 
of law," to "The type and content of my practice is such that even an educated 
layman couldn't really understand or prepare the documents." The percentage of 
respondents who chose the two response points closest to the latter pole captures 
the strategic indeterminacy encountered by lawyers working in that field. 

The question used as the basis for the field-level measure of dependence on 
autonomous actors asked lawyers who practiced in each field to assess the extent 
to which practice in that field involved isolated analysis or required interaction 
with clients and other parties, using a five-point scale ranging from "My area 
demands skills in handling technical procedures rather than skills in negotiating 
and advising clients," to "My specialty and type of practice require skills in 
negotiating and advising clients, rather than detailed concern with technical 
rules." The percentage of respondents who chose the two response points 


closest to the "negotiating and advising" pole taps the extent of dependence on 
autonomous actors in that field. 


Uncertainty, Gender and Law Firm Promotion * 873 


For each establishment, | averaged the counts of male partners and of all 
partners from the NALP Directory (as of Feb. 1, 1995) and Martindale-Hubbell (as 
of fall 1994) (Cronbach's alphas were .998 and .986, respectively), and calculated 
the proportion of men among establishment partners. 


Control Variables 


Unless another source is specified, measures of control variables were obtained 
from the 1995 edition of NALP Directory. | calculated the proportion of women 
among non-partner lawyers from counts of lawyers. Although sex segregation 
across fields of legal practice is low (Epstein et al. 1995; Gorman 2005; Jacobs 
1989), | controlled for the proportion of each establishment's lawyers working in 
each of three broad areas that could be perceived as gender-typed, which might 
influence either lawyers' preferences or firms' promotion decisions: /itigation 
(masculine), business transactions (masculine), and people-oriented fields, such 
as family law, trusts and estates law, and employment law (feminine) (see Epstein 
1993; Pierce 1995). 

| also controlled for the rapidity of legal change in the establishment's practice 
areas. Rapidity of change in the laws may help women's promotion chances, 
at least when uncertainty in the nature of work is controlled. When the law is 
Subject to rapid change, yet each change is relatively straightforward (for example, 
changes in the number of parts per billion of a chemical allowed in factory 
emissions or in the dollar amount subject to estate tax), strategic indeterminacy 
is low, practitioners are likely to be viewed as technical experts, and gender 
should seem less important. This variable was constructed in the same way as 
the measures of work uncertainty, based on a question that asked respondents 
practicing in each field to rate that field on a five-point scale ranging from “Things 
don't change too rapidly in my area of the law, so there is little need for constant 
revision of my knowledge and activities,” to “My area requires a great deal of 
reading of legal material in order to keep abreast of new developments.” 

Establishment size is measured by the count of lawyers in the establishment. 
To capture the size and scope of the establishment's firm, a binary variable 
was coded "1" if the geographic dispersion of the firm's offices was national or 
international and "0" otherwise.'? Controls were also included for the average 
number of lawyers promoted to partnership over the years 1991-1996 and /ocation 
in the South, where attitudes on many dimensions are more conservative.'' As a 
rough measure of the economic rewards offered by an establishment, | controlled 
for the 1995 starting salary offered to entry-level associates. Dummy variables 
represent minimum billable hours requirements (i.e., hours billed to clients) of up 
to 1,800 hours per year, 1,801 to 1,900 hours per year, and more than 1,900 hours 
per year (the reference category is no minimum hours requirement). Other dummy 
variables indicate whether an establishment's part-time work policy permitted 
part-time work only on a case-by-case basis or made it available to all lawyers 
(establishments in the reference category did not allow part-time work). Family 
leave policy is coded "1" if the establishment's benefits include parental or other 
family-related leave (other than pregnancy disability leave), and “0” otherwise. 
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Table 1: Descriptive Statistics for Variables Used in the Analysis 


A a ee 
Variable Mean S.D. Min. Max. 
Promotion given to a woman Zo 43 0 1 
Problem variability 58.23 7.54 19.00 76.24 
Strategic indeterminacy 62.59 5.13 38.65 85.71 
Dependence on autonomous actors 46.20 5.74 19.05 62.00 
Proportion of men among establishment partners .87 .05 45 1 
Proportion of women among non-partner lawyers .36 .09 0 1 
Extent of establishment practice in: 

Litigation .34 19 0 1 

Business transactions law 139 .16 0 .96 

People-oriented law .10 .10 0 1 
Rapidity of legal change TT.18 4.75 48.18 88.79 
Establishment size 132.67 83.87 6 546 
National or international firm .61 49 0 1 
Mean number of lawyers promoted, 1991-1996 3.81 2.49 .20 14 
Part-time work allowed case-by-case 30 48 0 1 
Part-time work available to all .58 49 0 1 
Family leave policy .64 48 0 1 
Minimum billable hrs requirement = 1800 22 EN 0 1 
Minimum billable hrs requirement — 1800 / « 1900 .21 41 0 1 
Minimum billable hours requirement > 1900 10 .29 0 1 
Location in South .20 40 0 1 
Starting salary (in thousands of U.S. dollars) 64.83 10.43 SED 90 
Number of promotion events 1450 


Note: These statistics describe data on promotion events in large U.S. law firms between late summe; 
or fall 1994 and late summer or fall 1995. 


Results 


Table 1 reports means, standard deviations, and ranges for the variables used 
in the analysis, based on the sample of 1,450 promotion events.'? The average 
promotion occurred in an establishment where 87 percent of existing partners 
were men. It is challenging for anyone, male or female, to attain partnership 
in these firms. Although the typical establishment employed 133 lawyers, it 
promoted only four new partners per year, on average, from 1991 to 1996. Still, 
advancement appears to be more difficult for women than for men. Overall, only 
25 percent of promotions were given to women, although women represented, 
on average, 36 percent of associates and other non-partner lawyers. 

| used multivariate logistic regression to model the probability that a promotion 
event involved the promotion of a woman. Because promotion events in the 
same establishment are not independent of one another, Huber-White standard 
errors are calculated. Table 2 presents the results, including odds ratios to aid 
interpretation. One-tailed tests of statistical significance are used where they are 
justified by theory; otherwise, two-tailed tests are used. 

In Model 1, both problem variability and strategic indeterminacy have the 
expected significant negative effect on the probability that a promotion will go to 
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Table 2: Logit Coefficients from the Regression of Female Promotion on 


Organizational Characteristics 


Variable 
Problem variability 


Strategic indeterminacy 
Dependence on autonomous actors 


Proportion of men among 
establishment partners 

Strategic Indeterminacy x proportion 
of men among partners 

Proportion of women among non- 
partner lawyers 


Extent of Establishment Practice in: 


Litigation 

Business transactions law 

People-oriented law 

Rapidity of legal change 

Establishment size 

National or international firm 

Mean number of lawyers promoted, 
1991-1996 

Part-time work allowed case by case 

Part-time work available to all 

Family leave policy 

Minimum billable hrs requirement 
«1800 

Minimum billable hrs requirement 
- 1800/ «1900 


Minimum billable hrs requirement 
1900 


[AAA A .___eaeC——Q---1>xxəÑəəƏRlp 


Model 1 Model 2 
Coeff. Odds Ratio Coeff. Odds Ratio 
-.033** .968 -.036** .965 
(.014) (.014) 
-.075** .927 -.080**** .923 
(.026) (.026) 
-.019 .981 -.020 .981 
(.019) (.019) 
-2.966** .052 231035 .045 
(1.091) (1.077) 
-.559* 572 
(.294) 
2.674*** 14.504 2:53] 12:572 
(.623) (.630) 
.703 2.021 .659 1.933 
(.611) (.611) 
1.136 3415 1.057 2.877 
(.658) (.664) 
-1.089 397 -1.099 333 
(.691) (.700) 
0517 1.053 .048* 1.050 
(.023) (.024) 
-.002 .998 -.002 .998 
(.001) (.001) 
-.082 922 -.092 912 
(.134) (.135) 
.099*** 1.104 2100757 1.105 
(.028) (.028) 
241 T272 .201 1:223 
(.281) (.279) 
5611 1.752 att 1.705 
(.276) (.276) 
.090 1.095 .097 1.102 
(.123) (.123) 
3321 1.394 3064 1.358 
(.156) (.157) 
-.246 182 -.277 .158 
(.159) (.160) 
-.316 129 -.291 148 
(.215) (.213) 
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Table 2 continued , 
Location in South -.3444 .709 -.346T 107 
(.154) (.155) 
Starting salary (in thousands of -.004 .996 -.005 ¿995 
U.S. dollars) (.008) (.008) 
Constant -3.207 -9.439 
(2.147) (1.870) 
Log Likelihood -784.03 -782.27 
x 108.95 109.08 
Number of promotion events 1450 1450 


*p<.05 *p<.01 ***p< .001 (one-tailed tests); !p < .05 (two-tailed test). 
Huber-White standard errors are in parentheses. 
“Variable is centered. 


a woman. When the problem variability score increases by 1 point (over a range 
of 19 to 76), the odds that the firm will promote a woman decrease by about 
3 percent. When the strategic indeterminacy score increases by 1 point (over 
a range of 39 to 86), the odds that a woman is selected decrease by about 7 
percent. These results support Hypothesis 1. The point estimate for the effect of 
dependence on autonomous actors has the anticipated negative sign, but it falls 
short of statistical significance at conventional levels (p < .16, one-tailed). 

The proportion of existing partners who are male depresses the odds that 
the firm will promote a woman. An increase in the proportion of men from 0 to 
1 would reduce the odds that a promotion is given to a woman by roughly 95 
percent. This result is consistent with other research finding that the gender 
composition of high-level employees affects the probability that positions will be 
filled by women (Chambliss and Uggen 2000; Cohen et al. 1998). 

To test Hypothesis 2, Model 2 introduces an interaction between strategic 
indeterminacy and the proportion of partners who are male. Because both 
variables were centered before the multiplicative interaction term was created, 
the coefficient for strategic indeterminacy reflects that variable's effect when 
the proportion of men is at its mean. Similarly, the coefficient for the proportion 
of male partners reflects that variable's effect when strategic indeterminacy is 
at its mean. Both of these coefficients are negative and statistically significant. 
For an establishment with the mean proportion of male partners, an increase 
of 1 point in the strategic indeterminacy score would reduce the odds that a 
promotion goes to a woman by approximately 8 percent. For an establishment 
with the mean strategic indeterminacy score, an increase in the proportion of 
male partners from 0 to 1 would reduce the odds that a promotion is given to a 
woman by about 95 percent. 

The result of most interest in this model is the coefficient on the interaction 
term, which is negative and significant. The negative effect of strategic 
indeterminacy on the promotion of women is thus especially strong when a 
firm's existing partners are more heavily male. At the same time, the negative 
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effect of a predominantly male partnership is especially strong when a firm's 
work involves more strategic indeterminacy. This result supports Hypothesis 2. 

Several of the other results are interesting as well. The more lawyers an 
establishment typically promotes, controlling for establishment size, the greater 
the probability that a promotion will go to a woman. This result is consistent 
with Phillips’ (2005) finding, based on a sample of Silicon Valley law firms, that 
firms are more likely to make women partners when they experience high 
partnership growth. It is also in harmony with Blum, Fields and Goodman's (1994) 
observation that women’s promotion prospects are better in organizations where 
vacancies occur more frequently. These findings may be explained by the idea 
that employers have “labor queues” (Reskin and Roos 1990) that rank men ahead 
of women, so that women are selected only for vacancies that remain after 
qualified men have been promoted. If so, women should have a better chance 
when the total number of vacancies is higher. 

It is also noteworthy that more promotions go to women in firms that make 
part-time work widely available than in those where part-time work is not allowed. 
(The point estimate for a case-by-case part-time policy is also positive, but falls 
short of statistical significance.) This result is inconsistent with the perceptions 
of many lawyers that part-time work has negative consequences for promotion 
to partnership (Epstein et al. 1995; Ziewacz 1996). Of course, we do not know 
whether the women promoted in firms with generous part-time policies ever 
took advantage of those policies. Still, firms that offer part-time schedules seem 
to be more open to the promotion of women. 

In contrast, policies allowing lawyers to take family-related leave do not 
increase firms' propensity to promote women. It may be that many women who 
take such leaves decide not to return, or that those who return have fallen behind 
their peers in skill development. Leave policies may also tend to reinforce cultural 
schemas of "work devotion" and "family devotion" that portray work and family 
responsibilities as incompatible (Blair-Loy 2003). 

Finally, it is interesting that women's promotion prospects are better in offices 
with relatively low minimum billable hours requirements than in those with no 
hours requirement. The coefficients for higher hours requirements are negative, 
as we might expect, although they do not reach statistical significance. As long 
as hours requirements are not too high, they may benefit women by providing an 
objective standard of productivity and commitment. 

Figure 1 illustrates the effect of problem variability on the predicted probability 
that a promotion will be given to a woman, with other variables fixed at their 
sample means, based on the estimates from Model 1. For a problem variability 
score of 20, near the sample minimum, the predicted probability is .52. The 
predicted probability steadily decreases to .13 in an establishment with a problem 
variability score of 80, just above the sample maximum. 

Figure 2 depicts the effect of strategic indeterminacy on the predicted 
probability that a woman is selected, using the estimates from Model 2, at the 
25th and 75th percentile values of the proportion of men among existing partners 
(.84 and .90, respectively). In an establishment where 84 percent of the partners 
are men, the predicted probability falls from .74 for a strategic indeterminacy 
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Figure 1. Predicted Probability that a Promotion Will Be Given to a Woman at Different 
Levels of Problem Variability 
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Figure 2. Predicted Probability that a Promotion Will be Given to a Woman at Different 
Levels of Strategic Indeterminacy, by Proportion Male Partners 
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score of 30 to .06 for a score of 90. In an establishment where 90 percent of the 
partners are men, the decline in the predicted probability is steeper, dropping 
from .88 to .02. It is interesting that at low levels of strategic indeterminacy, 
predicted probabilities are actually higher in establishments where the partnership 
is more heavily male. This suggests that predominantly male partnerships seek 
to increase the representation of women, but only when the level of discretion 
and the corresponding need for trust are low. 


Discussion and Conclusion 


This study has investigated the impact of the uncertainty involved in an 
organization's work on variation across organizations in the pattern of gender 
inequality in promotion, using empirical data concerning promotion in law firms. 
Professionals encounter three forms of uncertainty in their work. First, the types 
of problems that professionals must solve may vary substantially from one day 
or week to the next. When problem variability and unpredictability are greater, 
it is more difficult to maintain the skills and resources needed to solve different 
problems. Second, the relevant professional knowledge base may not specify 
a clear course of action for the problem at hand. In that case, the professional 
must reason by inference and exercise judgment in choosing a strategy. Third, 
implementation of the chosen strategy may depend on the behavior of other 
actors not subject to the professional's control. 

Among law firms, measures of two forms of work uncertainty — problem 
variability and strategic indeterminacy — have clear negative influences on the 
probability that promotions are offered to women. These findings are consistent 
with the argument that uncertainty heightens the performance standard 
used to infer ability from women's performance, making it less likely that a 
woman will be judged as able. Decision-makers are likely to consider both past 
performance and gender in drawing inferences about a candidate's ability to 
perform a higher-level job. The male gender carries a presumption of general 
competence that extends to the particular ability at hand; the female gender 
carries a corresponding presumption of lack of competence. To offset these 
presumptions, women must perform better than men do. When these forms 
of uncertainty are high, the link between performance and ability is noisy and 
weak, and past performance is an unreliable predictor of future performance. 
Decision-makers may therefore tend to discount evidence of past performance 
and place greater weight on gender, with the result that even more outstanding 
performance is required of women before competence will be inferred, while 
performance standards become more lenient for men. 

The negative impact of strategic indeterminacy on women's promotion 
prospects in law firms is greater when there are more men among the group of 
people making promotion decisions. This finding provides support for Kanter's 
(1977) argument that uncertainty intensifies decision-makers' preference for 
candidates of their own gender, giving rise to "homosocial reproduction." It is 
consistent with the view that people are more willing to trust others who are 
socially similar, rather than those who are socially different, to make the "right" 
decisions. When strategic indeterminacy is high and coworkers must rely on each 
other to exercise discretion wisely, trustworthiness becomes particularly salient. 
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Although the findings are consistent with the theoretical mechanisms 
discussed above, | cannot rule out the possibility that other unobserved processes 
may also link work uncertainty to gender disparities in promotion. A limitation of 
the study is that no individual-level information other than gender is available, so 
it is possible that gender differences in individual characteristics are correlated 
with work uncertainty. In particular, it is possible that women's performance 
under uncertainty is objectively worse than men's, and this weaker performance 
reduces their promotion rates. However, this interpretation of the results seems 
unlikely. An objective difference in the performance of women and men under 
uncertainty cannot explain the interaction between uncertainty and the gender 
composition of the partners making promotion decisions. Presumably female 
partners could perceive such an objective difference as well as male partners 
could. More importantly, no empirical research, to my knowledge, supports the 
suggestion that women's performance under uncertainty is objectively worse 
than men's. In the case of lawyers, in particular, we know that men and women 
enter law firms with essentially the same amounts of education and relevant 
experience, and so presumably have equivalent skills. Indeed, in the 1990s, 
women's performance in law school was actually better than men's (Clydesdale 
2004). Of course, it is possible that their skills diverge over the course of their 
employment, so that male lawyers become more capable of handling uncertainty. 
But such a skill divergence would itself probably be due to firms” differential 
allocation of clients and work assignments to men and women, rather than to 
innate differences between them. 

Although the findings here cannot be statistically generalized beyond law 
firms, they can be theoretically generalized in several directions that future 
research should investigate. First, work uncertainty is likely to affect the gendered 
allocation of other opportunities besides promotion, including hiring, work 
assignments and the formation of mentoring relationships. For example, in law 
firms, challenging work assignments play a crucial role in enabling associates to 
develop the skills and social contacts they need to function effectively in senior 
roles (Wilkins and Gulati 1996; Ziewacz 1996). When work uncertainty is high, 
both a stronger perception of male competence and a greater need for trust are 
likely to lead partners to give the most desirable assignments to men. 

Second, it is probable that the processes linking uncertainty and gender 
inequality in promotion also operate in other professional and managerial 
contexts. Most professional and managerial work involves substantial uncertainty. 
In addition, most professional and managerial promotion decisions are largely 
based on subjective impressions, which are vulnerable to the influence of 
stereotypes and ingroup preferences. Future research should assess whether 
the causal relationships found here in law firms do in fact hold in other settings 
involving professional or knowledge-based work. 

Third, the logic of the theoretical arguments offered here extends beyond 
gender to other socially meaningful dimensions, such as race and ethnicity. 
Gender and race in the workplace have often been studied separately, but it is 
likely that many similar processes operate in both cases (Reskin 2003). Greater 
work uncertainty should mean that members of all lower-status groups will 
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face higher hurdles for proving ability. At the same time, greater uncertainty 
should mean that members of such groups have fewer opportunities to prove 
themselves, because organizational leaders will not feel comfortable entrusting 
socially dissimilar employees with responsibility. Thus, the causal relationships 
found here with respect to gender are likely to hold for race and ethnicity as well. 

Finally, although the arguments made here have focused on professionals, 
the level of work uncertainty in developed nations may be rising over time 
across a wide range of occupations. Increasingly, simpler, more routine jobs 
are being outsourced to lower-wage countries or performed by computer-based 
technology. Many of the remaining manufacturing and service jobs are becoming 
more demanding as organizations adopt more flexible, less hierarchical "high 
performance" structures and practices (Green 2004). If this view is correct, 
relationships among gender, uncertainty and promotion that hold for professionals 
may increasingly apply to other workers as well. Future research could usefully 
explore this issue by investigating the impact of work uncertainty on gender 
disparities in upward mobility in various industrial and occupational settings and 
at various levels of organizational hierarchies. 


Notes 


1. To the extent that variability decreases familiarity with specific types of 
problems, it may also increase the frequency of encountering strategic 
indeterminacy. This has led to confusion in previous attempts to define 
similar concepts (see Fry 1982; Withey et al. 1983). 


2. Most professional positions call for multiple abilities and skills. For simplicity, 
the term "ability" is used here to refer to the entire set of abilities and skills 
needed for successful performance. 


3. Asaseparate matter, gender bias may also influence performance evaluations, 
so that men's performance is evaluated more highly than women's (Goldberg 
1968), although evidence on this point is mixed (see Bowen et al. 2000; Swim 
et al. 1989). Gender bias in the evaluation of performance is distinct from 
gender bias in the inference of ability from performance after an evaluation 
has been made (Foschi 1992). 


4. There is little overlap between the “hemispheres” of the bar that serve 
corporate and individual clients (Heinz and Laumann 1982:127-75; 
Heinz et al. 1998). 


5. Recently, a growing number of firms have added a second tier of 
salaried non-equity partners who may or may not be eligible to move to 
equity-owning status. In supplemental analyses, | estimated the logistic 
regression models presented below on single-tier and two-tier firms 
separately. Because results were highly similar for both, the two groups 
were combined for the final analyses. 


6. Personal communication from Martindale-Hubbell Customer Relations 
(Nov. 21, 1996). 
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7. In the case of androgynous names, | examined the information provided 
for clues as to gender. For example, a lawyer named "Chris" who was a 
member of a state women's bar association would be coded as female. In 
the absence of useful information, | followed previous studies (Beckman and 
Phillips 2005; Cohen et al. 1998; Gorman 2005) in coding gender based on 
whether the name was more frequently given to girls or boys at the time of 
the individual's birth. 


8. Because fields of practice were classified by the type of client served, 
respondents who answered questions about "corporate-client" fields were 
largely lawyers who worked only for corporate clients. Analyzing the 1995 
Chicago survey data, Heinz, Laumann and their colleagues (Heinz et al. 1998) 
found that only a very small proportion of lawyers devoted a substantial 
amount of time to work for both corporate and individual clients. 


9. The data were provided to me in this form. Equivalently the inverse 
percentage - the percentage of respondents who chose the two response 
points closest to the specialization pole — can be viewed as a measure of the 
specialization of lawyers who work in that field. 


10. In supplemental analyses, the total number of lawyers in the firm (i.e., in 
all firm offices) was also included as a control variable. Because it had no 
significant effect, this variable was eliminated from the final models. 

11. | included dummy variables for other regions in supplemental models, but 
eliminated them from the final models because they never came close to 
statistical significance. 

12. Bivariate correlations are presented in the Appendix. 
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Coping with Rural Poverty: 
Economic Survival and Moral Capitalin Rural America 


Jennifer Sherman, University of California, Berkeley 


The experience of rural poverty is in many ways unique from that of 
urban poverty. In the rural setting, social cohesion creates pressure on 
the poor to behave in ways that are consistent with local values. This 
paper, based on qualitative research done in an isolated, rural Northern 
California community, argues that in this setting the survival strategies 
of the poor are chosen because they are socially rational rather than 
economically optimal. The choice of socially acceptable coping strategies 
is ultimately beneficial because it creates “moral capital,” which can be 
traded for both social capital in the form of community support, and 
economic capital in the form of job opportunities. 


Introduction 


Poverty in the United States is often conceptualized as an urban minority problem 
(Dehan and Deal 2001). Thus, much of what we know about the poor is based 
in this setting. Yet rural poverty rates have been higher than urban poverty 
rates since the 1960s (Albrecht, Albrecht and Albrecht 2000; Flora, Flora and 
Fey 2004; Struthers and Bokemeier 2000; Summers 1995). The isolated town of 
Golden Valley, California! is a high-poverty rural community. Due to its location 
in a remote, forested region of Northern California, the forest sector was the 
dominant industry and employer there for nearly a century. Beginning with the 
spotted owl ruling in the early 1990s, the timber industry rapidly abandoned the 
area.? Today, Golden Valley's 2,000 residents struggle and compete with each 
other in order to secure the limited jobs that remain. Those who are unable to 
obtain or do the available work must either move away or find ways to survive 
grinding poverty. For many, the strategies they employ to survive on low incomes 
greatly affect their chances for ultimately escaping poverty. 

Early culture of poverty theorists postulated that the poor share a single culture 
that "cuts across regional, rural-urban, and even national boundaries." (Lewis 
1959:2) They further contended that this culture is both a product and cause 
of poverty (Harrington 1962). However, within the United States there is ample 
evidence to suggest much diversity in experiences and cultural norms between 
different poverty settings. Most previous literature on poverty coping strategies 
lacks an understanding of the ways in which the community limits opportunities 
for survival and affects the meanings of available strategies. | argue that within a 
small, tightly-knit, rural community, the choices that the poor make influence not 
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only their material survival but also their community standing through the creation 
or diminution of “moral capital,” a form of symbolic capital (Bourdieu 1984) based 
on perceived moral worth. Coping strategies that may be commonly used in 
the urban setting and seem financially rational, can be socially and economically 
irrational in the rural setting. 


Theoretical Background: Surviving Poverty 


Scholars of poverty in both the urban and rural setting have described the various 
ways in which the poor make do (Edin and Lein 1997; Ehrenreich 2001; Jensen 
and Eggebeen 1994; Nelson and Smith [1998] 1999; Newman 1999; Stack 1974; 
Tickamyer and Wood 1998; Venkatesh 2000). In urban settings, these coping 
strategies range from those that are consistent with mainstream cultural norms 
to those that explicitly resist the mainstream.? Researchers who study urban drug 
dealers find that their behaviors and coping strategies are often based within 
a "backlash" culture that eschews white mainstream values in favor of street 
toughness and focuses on the perceived financial rewards of illegal activities 
such as the drug trade (Anderson [1989] 1990; Bourgois 1995; Venkatesh 2000). 
Economically, these strategies may be sensible, promising potentially higher 
earnings than most low-skill jobs in the legal workforce. The existence of the 
resistance culture allows individuals to preserve self-esteem and social standing 
while pursuing economic gain through ethically and legally questionable means. 

In urban communities, heterogeneity in cultures, values and activities allows 
for competing options for surviving poverty while retaining dignity. For some, the 
frequency and proximity of illegal activities allows them to be viewed as morally 
appropriate (Wilson 1996). For others, morality continues to be constructed in 
opposition to illegal pursuits. As Lamont (2000) argues, adherence to mainstream 
moral standards often allows individuals to maintain self-worth when they are 
unable to achieve success in economic terms. Thus, despite the lure of illegal 
opportunities, many individuals in the urban setting prioritize mainstream 
morality over economic maximization. Edin and Lein (1997: 146) and Newman 
(1999) found that poor single mothers and low-wage workers tended to choose 
the most legal and morally accepted coping strategies over more economically 
lucrative but illegal ones, because these activities provided them with greater 
“self-respect.” Gowan (2003) and Duneier (1999) found that even among homeless 
populations, some individuals chose less profitable informal work activities over 
drug dealing or panhandling because they provided greater self-esteem through 
the connection to mainstream American morality and work ethic. 

As these diverse studies suggest, in the urban setting poor individuals are 
often able to choose between economic maximization and cultural optimization 
(Wilson 1996). The same does not necessarily hold true in rural areas, in which 
mainstream American culture is often more pervasive and hegemonic (Larson 
1978; Snyder and McLaughlin 2004), and alternative lifestyles such as illegal 
activities are less plentiful.^ In rural communities, survival strategies tend to 
be heavily influenced by local cultural and gender norms (Davis 2000; Fitchen 
1991; Nelson and Smith 1998; Norem and Blundall 1988), which often dictate 
a preference for informal work and self-provisioning (Jensen, McLaughlin and 
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Slack 2003; Nelson and Smith [1998] 1999; Tickamyer and Wood 1998). Such 
culturally-appropriate provisioning activities as gardening, wood cutting, hunting 
and fishing are often supplemented by barter and trade in rural communities 
(England and Brown 2003; Moore 2001). Furthermore, struggling families will 
often choose only those coping activities that are consistent with local cultural 
ideals such as self-sufficiency, even when this means cutting back on what 
they consume (Fitchen 1991; Struthers and Bokemeier 2000). Additionally, it is 
generally found that public assistance use is less common among the rural poor 
than the urban poor (Jensen and Eggebeen 1994; Lichter, Roscigno and Condron 
2003), and often carries with it a powerful stigma (Bokemeier and Garkovich 
1991; Duncan 1999; Struthers and Bokemeier 2000). 

Within the literature that looks at informal work activities and poverty survival, 
rural subsistence activities are relatively understudied (McGranahan 2003), as 
are the ways in which these culturally appropriate adaptations are encouraged 
and enforced through social support and sanction (Sayer 2004; Wilson 1996). 
Wilson (1996) suggests that small rural communities are more cohesive than 
many urban communities. Golden Valley is also much less diverse than most 
urban areas, both ethnically and economically. Class distinctions are minimal in 
this setting, as opportunities for economic success are scarce. But as previous 
researchers have noted, even in the absence of clear class differences, social 
distinctions may arise based on non-economic factors such as behavioral and 
moral norms (Duncan 1999; Lamont 2000). 

According to Lamont (2000), morality can allow those of low social and 
economic status to locate themselves above others of similar or even higher 
status. Sayer ([2000] 2004) similarly argues that moral criteria are used by social 
groups to distinguish between themselves, and that moral considerations often 
constrain economic activities. In Golden Valley, moral worth has evolved into a 
form of symbolic capital. This "moral capital" allows the poor to create distinctions 
among themselves in the absence of significant economic capital (Bourdieu 
1991). Perceptions of individuals' moral worth are often based on their coping 
behaviors, including how much they do or do not work, and their involvement 
in illegal activities. A person's moral status contributes to more than just his or 
her reputation, however. Those who are perceived as having lower moral worth 
are often denied access to the community’s increasingly rare jobs, as well as 
to many forms of community-level charity. Thus moral capital can be traded for 
economic capital in the form of job opportunities and charity, or social capital in 
the form of community ties and social support (Coleman 1988; Putnam 2000). 

This paper investigates the importance of setting to survival strategies. 
Findings suggest that the small size, cultural homogeneity and lack of anonymity 
in a small rural community can create greater social pressure on the poor to be 
culturally acceptable according to the existing local standards. For those whose 
coping strategies are not morally adequate according to local norms, the result 
is often not only diminished self-respect, but also community-level censure that 
further affects their quality of life and chances for eventually escaping poverty. On 
the other hand, those who have higher moral capital generally have more access 
to social capital as well as economic capital in the form of job opportunities and 
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community support. In this way, the rural setting differs substantially from the 
urban, which allows for a greater range of survival strategies that are acceptable 
within separate subcultural spheres (Fischer 1975). 


Research Methods 


Golden Valley was chosen as a field site because of its isolation and historical 
dependence on a single industry, now in decline. The community's economic 
and social life has been severely injured by job loss and restructuring in the 
timber industry, much of it directly related to the spotted owl ruling. In 1990, 
environmentalists won a two-decade long battle to preserve the habitat of 
the northern spotted owl, with its listing as threatened under the Endangered 
Species Act. Subsequently, a 1991 court ruling placed injunctions on timber sales 
on public land in the owl's territory, which included more than 80 percent of the 
forested land in and around Golden Valley (Watershed Research and Training 
Center 2006). Although the restrictions were later lessened to some degree, large 
scale logging has never returned to the region. Since the ruling, local logging and 
sawmill jobs have evaporated almost completely (Wilson 2002). According to one 
estimate, 40 percent of the community's payroll disappeared during the decade 
of the 1990s (Watershed Research and Training Center 2006). Golden Valley's last 
sawmill closed in 1996. It was the town's largest employer, providing 150 (mostly 
men's) jobs (Howard 1996). The mill closure left a large sector of the population 
with little prospect for employment, particularly anything that paid comparably to 
the jobs lost. Although many chose to remain, there was a large out-migration at 
that time, and the population has still not recovered. From 1990 to 2000, Golden 
Valley lost approximately 300 residents (U.S. Census 1990, 2000). 
|». | moved to Golden Valley in the summer of 2003. Over the year that | lived 
there, | completed 55 tape-recorded, open-ended, in-depth interviews with native 
and longtime members of the community, as well as a full year of ethnographic 
fieldwork. | interviewed about equal numbers of men and women (25 women 
and 30 men), singly and in couples, depending on the subjects' preferences. 
The average interview lasted between one and a half and two hours, with the 
possibility of a second follow-up interview. The subjects ranged in age from 23 
to 60 years old, and the average age of the sample was about 39. The majority 
(92 percent) of subjects were white, which reflects the dominant racial group in 
Golden Valley (85 percent, according to the 2000 Census). The remaining 8 percent 
reported some Native American heritage, which is about the same percentage of 
Golden Valley residents who reported this race on the 2000 Census. With only a 
few exceptions, most subjects had low incomes that placed them below or close 
to the poverty line, which also reflects the class makeup of the community itself. 
Two-thirds of households have incomes under $30,000, according to the 2000 
Census. Subjects were recruited through contacts in the community, including 
several in organizations that focus on job creation and retraining, and a snowball 
sample was used to find subsequent interviewees. | further engaged in volunteer 
and outreach work in the community, particularly related to the poor and the 
parents of small children, in order to meet and recruit subjects from as many 
different social networks as possible. 
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Ethnographic methods were employed in order to enhance the understanding 
gleaned through the interviews, to identify inconsistencies, and to observe social 
practices in their natural environments. | attended community events such as 
the County Fair and local festivals, school and bar dances, community meetings, 
Sports events and school activities. | took part in daily life in many ways, including 
volunteering with several local organizations, teaching an adult education class, 
regularly observing at local businesses and frequenting public spaces. Basing my 
own life in the community provided me with a broader perspective on life there, 
and helped me to gain a fuller understanding of Golden Valley and its unique set 
of norms, customs and concerns. 


Findings from Golden Valley 
Poverty Then and Now 


Table 1 compares Golden Valley with the state of California on several quality of 
life measures. In 2000 Golden Valley had significantly higher poverty rates than 
the state as a whole, and its median household income was less than half the 
state average. It also had three times the unemployment rate and a considerably 
lower labor force participation rate. Meanwhile, 16 percent of Golden Valley 
households lacked telephones, compared to just 2 percent in the state as a whole 
— an indication of the kinds of amenities the rural poor sometimes go without.* 


Table 1: Quality of Life Measures 


Golden Valley California 


Individual Poverty 24% 14% 
Child Poverty 31% 20% 
Unemployment 21% 1% 
Labor Force Participation 45% 62% 
Lacking Telephones 16% 2% 
Median Household Income $22,024 $47,493 


Source: U.S. Census Bureau 2000 


Table 2 shows basic economic statistics for Golden Valley before and after the 
1991 spotted owl ruling and subsequent mill closures. Despite the loss of jobs 
over the decade, poverty rates were lower in 2000 than they were in 1990. This is 
not actually due to people moving out of poverty, but rather to the out-migration 
of people who could not find work, as well as women's increased workforce 
participation and some in-migration by wealthier retirees (Wilson 2002). But 
poverty has long been endemic in Golden Valley, which has always had its share 
of people who lived marginally. Many of my subjects grew up with fathers who 
were only weakly attached to the workforce. Yet most believe that the since the 
closing of the woods more people do not work, and more receive welfare than ever 
before. This perception is only partially correct; while unemployment has doubled 
over the 10-year period, public assistance receipt dropped by almost half, due at 
least in some part to welfare reform. According to many of my respondents, the 
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Table 2: Economic Changes in Golden Valley 


———————— S 


1990 2000 
Poverty 27% 24% 
Welfare Use 19% 9.5% 
Unemployment 11% 21% 
Men in Labor Force 5696 4896 
Women in Labor Force 36% 41% 


Source: U.S. Census Bureau 1990, 2000 


stigma attached to welfare receipt has grown over the years however, and this 
increased social disdain may help account for both the perception of growing 
receipt as well as some of the actual decline in receipt. ° 

While welfare receipt has always been looked down upon, working poverty 
has historically had a very different social interpretation. Many of my subjects 
grew up with a working father whose salary barely kept the family afloat, and the 
lack of electricity and telephones was nearly as common for them as it was for 
those whose fathers did not work. Women were less likely to work outside the 
home while the mill and woods were open, particularly among married-couple 
families. Without a second earner, even a mill salary afforded only the barest 
of middle-class lifestyles. As 48-year-old Eric McCloud recalled, "When | was 
growing up, this was a community of poor, working people. They were poor, and 
some of them stayed poor, and some of them saved every penny they made and 
ended up with a fairly nice retirement." 

The key to respectability in the community was not necessarily wealth, but 
rather having a male earner who provided as much as he could through his work. 
. There was little shame in living on meager wages, and subsistence activities 
were also highly valued as work. The masculine culture of Golden Valley was 
built around the value of hard labor for its own sake. Thus, the biggest difference 
between the past and the present is not the existence of poverty, but the gendered 
nature of work. In my sample, 41 percent of all respondents had grown up with a 
mother who worked at least parttime. However, of the women in my sample who 
were either married or parents, 83 percent were currently working. The ability 
of the male breadwinner to support the family is the most noticeable change 
since the closure of the woods. During the forest products’ heyday, a solid lower- 
middle-class lifestyle was more commonly achievable without a working wife. 

Since 1990, jobs that men are willing to take have become sparse and 
unreliable. Women's workforce participation has increased significantly, as they 
have gone to work in order to compensate for the lost male earnings. Women 
are now in the workforce in numbers about equal to men, and their median 
earnings are actually higher than men's for full-time, year-round workers ($29,306 
vs. $23,594 according to the 2000 Census). However, most women's jobs do 
not pay comparably to the forest jobs that have been lost nor are the hours 
equivalent. While mill and logging jobs generally were fulltime and paid more 
than minimum wage, the majority of women's current jobs are in the service 
and sales sectors (U.S. Census 2000; Wilson 2002), characterized by part-time 
work and low wages. In 1999, only 11 percent of working-age women worked 
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fulltime and year-round compared to 27 percent of men. Thus, women’s overall 
median earnings — including both full- and part-time work — were $13,000 in 1999, 


compared to $21,000 for men (U.S. Census 2000). 


Meanwhile, unemployment and lack of workforce participation have become 
much more common for men over the decade. Given the cultural context, this 


presents a difficult challenge for the men of Golden 
Valley. Not only must they fight for material survival, 
but they must also fight harder than ever before just 
to retain respectability within the community. It is 
not easy for men to manifest their work ethic in the 
absence of work. However, the need to demonstrate 
the proper work ethic is greater than ever before, as 
the pool of unemployed men from which employers 
may choose has grown substantially. 

As the jobs that men are willing to take have 
disappeared, reputations have become extremely 
important in deciding who gets the remaining jobs. 
Paramount to maintaining a decent reputation is the 
ability to manifest accordance with Golden Valley's 
cultural norms and moral values, particularly the 
importance of self-sufficiency through hard, physical 
labor and a strong work ethic. A poor work history 
that includes welfare receipt can be very damaging 
to a person's reputation, and a bad reputation will 
often cause employers to pass up a job seeker.’ Thus, 
the survival strategies that the unemployed choose 
not only affect their current financial situations, but 
also their future social and economic standing in the 
community. In a community as small as Golden Valley, 
few are invisible. 


Coping With Poverty 


Those who lack jobs or whose incomes are inadequate 
must choose between a number of available survival 
options in Golden Valley. The different coping 
strategies carry with them varying degrees of moral 
capital, as illustrated in Figure 1. By far the most 
popular and respected of coping strategies are those 
related to subsistence food provision. These include 
hunting and fishing, as well as growing gardens and 
raising livestock. The first two options are almost 
universally practiced as hobbies by Golden Valley's 
men whether or not they rely on them for sustenance. 
Hunting and fishing are vitally important to Golden 
Valley's culture. They represent a man's first line of 
defense against economic problems as well as his 


Figure 1. Moral Value of Coping Strategies 
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tie to the land and to historical forms of the male provider role. Being the most 
highly valued of the local subsistence options, they are also the most acceptable 
way for a man to contribute non-monetarily to his family's basic needs. 

For the truly poor, however, hunting and fishing are nonetheless insufficient to 
meet all of their daily needs. Thus, subsistence activities are generally combined 
with a number of other possible coping strategies, whose social interpretations 
cover the spectrum from acceptability to contempt. Second to subsistence 
activities in moral acceptability is receiving help from relatives, including wives. 
This includes cash loans, help with child care and sending wives into the workforce 
who would otherwise have been at home. Poor families in Golden Valley often 
create networks of support quite similar to those found in urban communities 
(Stack 1974; Newman 1999), relying on one another for money, child care and 
temporary housing. 

Also generally accepted and very often utilized are opportunities for living rent 
free or at a very low rent. These opportunities differ substantially from options 
found in urban areas. Cheap housing is generally available only to those with 
longer histories and/or extensive social ties in the valley. Common options include 
the inheritance of family homes, as well as the custom of "caretaking," in which 
tenants provide basic maintenance and upkeep on a property in return for greatly 
reduced or nonexistent rent. 

For those who still cannot manage to make ends meet, or who are unable 
to take advantage of other coping strategies, government assistance is the last 
resort. In Golden Valley there is a clear moral hierarchy imposed on the various 
income maintenance programs. The most acceptable of these is unemployment 
insurance, which a large proportion of local men receive at some point during 
the year, due to the seasonality and instability of most remaining jobs. For men 
who are either seasonally unemployed or laid-off, unemployment insurance is an 
acceptable way to survive until work reappears. It is rarely even conceived of as 
government assistance, but rather is viewed as earned income. 

After unemployment insurance, the most acceptable form of government 
assistance is Supplemental Security Income (SSI — locally known as "disability", 
whose receipt is very common in Golden Valley. Logging and sawmill jobs 
are exceedingly dangerous in nature, and almost all men who had worked in 
either had been seriously injured at some point. One 30-year-old former logger 
described it this way: "Everything out there is tryin' to kill you — everything." 
According to the Census, nearly 40 percent of Golden Valley's working-age men 
are disabled, and 76 percent of the disabled are not employed. This compares 
to 21 percent disabled and 40 percent of the disabled being unemployed in the 
state as a whole (U.S. Census Bureau 2000). It is difficult to discern how many of 
the disabled might be able to work; while some of my disabled subjects admitted 
that they could work but chose to receive SSI instead, others had made many 
attempts to continue working despite serious injuries. Although most recipients 
of SSI do have some sort of valid claim, it is likely that many of them would still 
work if there were more jobs available. Fifty-year-old Susan Elders, frustrated 
by her difficulties in finding reliable workers to do maintenance on her rental 
properties, explained, “[Since] they can only stay on welfare a couple of years, 
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what l'm seeing now is all of a sudden there's an increase in disability. People are 
trying to find whatever excuses to get on disability." 

Despite insinuations of its abuse, SSI receipt is still much more acceptable than 
the receipt of welfare because it carries with it the assumption of a dangerous, 
hardworking past. But as Elders suggests, in a number of families it does seem 
as if disability assistance has evolved into a new form of welfare, suited to 
the conditions of Golden Valley's current economy. While women's workforce 
participation is becoming widespread and necessary, men are increasingly falling 
out of the workforce and remaining unattached to it. For many of these men, 
disability assistance has become their main tactic for contributing monetarily to 
the household without holding a job. 

Means-tested welfare options are considered by most to be the very last 
resort. More than a third of my respondents admitted to having received cash 
assistance and/or food stamps at some point in their lives, yet most considered 
the experience to be humiliating. Golden Valley's small size and insularity make 
it difficult to receive welfare anonymously. It was considered so shameful that 
several respondents confessed to having traveled an hour or more in order to 
spend their food stamps in other communities and avoid being seen by people 
they knew. As argued by Fraser (1989:153), means-tested welfare programs 
position the recipient as "dependent clients" vs. the "rights bearing beneficiaries" 
who receive unemployment or disability assistance. In Golden Valley, while these 
latter forms of government assistance are deemed acceptable and "earned," 
welfare is almost always discussed with either disdain or shame, and welfare 
recipients are often characterized not just as deadbeats but as alcoholics and 
drug addicts as well. Those who receive it for any length of time are quickly 
labeled and shunned. More than once | witnessed food stamps users being 
ridiculed by check-out clerks within moments of exiting the grocery store. 

Finally, drug dealing is the most socially reprehensible of coping strategies 
in Golden Valley today, particularly if the drug being pedaled is crystal meth 
(methamphetamines), the most commonly abused local substance. Those who 
sell drugs on a large scale are mostly isolated from the community, both as a 
form of self-protection as well as due to social censure around their lifestyle 
choice. Drug dealers are generally absent from most conventional social settings 
and personal networks. Unlike in the urban setting, in Golden Valley there are 
no competing cultural norms that sanction the drug trade. Being associated 
with drugs in any way is hugely stigmatized in Golden Valley and considered 
antithetical to its work ethic. While it is not uncommon for teenagers to party 
during high school, adults are expected to quickly end this behavior. Continued 
use, and in particular any association with selling drugs, is highly damaging to 
an adult's reputation. Thus, most poor individuals who are integrated into the 
community do not consider drug dealing to be among their possible strategies 
for surviving poverty. 

Most people's coping activities are known in Golden Valley and substantially 
influence their status as upstanding citizens versus deadbeats. Maintaining 
decent reputations can be immensely important to the poor, not just because it 
raises their social status but because status is tradable for social and economic 
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capital. Those who are believed to be morally upright are much more likely to be 
beneficiaries of ad hoc community charity. Very commonly, when asked why they 
stayed in Golden Valley, subjects talked about feeling secure in the knowledge 
that, "the whole town will stand behind you in a time of need." Twenty-five-year- 
old Nicole Goodman, whose husband is employed by a local business and has 
longstanding ties within the community, explained: 


If your house burns down, like somebody's did recently, 
everybody's like, ‘What can we do, what can we do?’ 
They come right over and start donating and helping. 
There’s not money here, but everybody teams together 
to take care of everybody. 


But such charity is not equally available to everyone. Julie Mitchell, a 26-year-old 
daycare provider and married mother of four, talked at length about the community 
organizing pie sales and spaghetti feeds to raise money to help residents with 
serious health problems. But her husband interrupted her to explain, “But see, 
that's for only the known people in town... You want to be known in a good way." 
In Golden Valley, social capital and moral capital can be traded for economic 
capital when a poor family or individual is truly in need. But the lack of either can - 
close these doors to the community's generosity. 

Similarly, reputations and perceptions of moral worth often play a significant 
role in deciding who gets jobs in Golden Valley, particularly among the low- and 
semi-skilled. Employers frequently complain about the lack of hardworking 
applicants who can be trusted to come to work each day. Those who are 
considered less than hardworking are seldom taken seriously. During my year in 
Golden Valley, | spent time observing at businesses in town. In multiple cases, 
employees commented to me on the likelihood of the applicant receiving a job. 
His chances generally rested on the impression they had of him, including his 
known history as a drug user or being from a “troubled family” that was known 
for long-term welfare receipt. For example, my field notes from April 5, 2004 
contain the following passage regarding a job applicant for seasonal field and 
brush crew work. He was known to accept welfare and food stamps, as well as 
to have drug and alcohol problems: 


When Jeremy left the office, Amy laughed. She explained 
to me that they would never hire someone like him. She 
told me that he had done jail time for marijuana possession 
and explained that he was a bad egg and not trustworthy 
as a worker. 


Generally, only those individuals who had better reputations and higher moral 
capital were considered for jobs in Golden Valley, regardless of how sincere they 
might be in their desires to work. For those with low moral capital, the stigma of 
their pasts and presents became barriers to their futures. 
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The following three case studies illuminate the ways in which the poor struggle 
to survive and how the strategies they choose determine their moral capital in 
the community. All three are married or cohabitating couples, the still-dominant 
family form in Golden Valley, and they are presented hierarchically from most to 
least moral capital. The three cases were chosen from among 29 couples in the 
sample because they represent typical patterns that were found throughout my 
interview sample and ethnographic work in Golden Valley. 


Not Real Into Handouts: Liza Wright and Tommy Patterson 


Liza Wright and Tommy Patterson are a cohabitating couple in their late 20s, 
who have a 3-year-old son together. Despite not being officially married, they 
refer to one another as husband and wife? Their names were provided to me 
by a number of different informants, who described them as nice people, 
good role models and hard-working employees. Although both drank and used 
drugs during high school, they cleaned up as soon as Liza became pregnant, 
positioning themselves for responsible adulthood. This transition away from 
substance abuse occurred early and completely; by the time Liza was ready to 
look for work, she was known as a responsible and stably "married" mother, 
rather than a partying teenager. They are now considered to be among the pillars 
of the community's younger generation, and their participation is often sought 
for community-organized events. Both have lived in other places at times, but 
want to stay in Golden Valley. As Liza explained it, "Yeah, as long as we can 
survive here, we will." 

Liza and Tommy survive through a combination of paid work and informal 
strategies that mostly excludes government assistance. Liza works parttime 
as a receptionist in the local doctor's office and parttime as a checker at the 
grocery store. Tommy worked in logging until the bottom fell out. Now he does 
construction work locally, which is seasonal and insecure but provides enough 
for them to survive most of the year when combined with occasional side jobs 
and Liza's income. According to Tommy, the trick to survival in Golden Valley is 
a strong work ethic and taking whatever (men's) jobs are available. "Somebody 
always needs their driveway shoveled or some firewood, somethin' like that. You 
just gotta' do it." 

For Tommy and Liza, there is usually just enough work to get by. This does not 
mean that they can always afford to pay their bills, however. Seasonal work makes 
it difficult to cover all their expenses. As Liza put it, “unemployment [insurance] 
will never pay what your wages are gonna be like." Luckily, they have their 
families to fall back on. Both Liza's mother and Tommy's parents help with daily 
expenses and emergencies. Tommy and Liza are grateful for this kind of help, but 
nonetheless they prefer to think of themselves as self-sufficient adults. 


Liza: His parents pulled through for us... But for the 
most part, we just kind of made it on our own. And 
yeah, you know, every now and then you miss a bill 
and the collector will call. But we always catch back up 
eventually on our own. 
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For Tommy and Liza, there is a good deal of pride wrapped up in providing 


for themselves despite all the challenges that they face. The only government 
assistance that they regularly accept is unemployment insurance, which in their 
minds is earned and deserved. They have never accepted welfare, disability or 
food stamps, and only briefly received Medi-Cal when Liza was pregnant. As 
soon as they could afford it, they purchased private health insurance for their 


Figure 2. Tommy and Liza’s Strategies: High Moral Capital 
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son. The same set of beliefs that leads Tommy to take 
any job he can find also makes him wary of accepting 
government aid of any kind: 


It makes me feel better that we're buying 
insurance. I’m not real into handouts. If 
my family needs ‘em, I'll take ‘em. But I'd 
rather not. l'd rather work an extra job or 
work a couple extra days or somethin' and 
take care of 'em myself. But that's just the 
upbringing | have. 


Living near the edge is a source of some stress, 
but not a source of humiliation for Tommy and Liza. By . 
focusing only on survival strategies that are consistent 
with Golden Valley's ideal work ethics, they are able 
to provide for themselves not a life of affluence, but a 
comfortable life, rich in self-respect and social standing. 
Tommy proudly told me, "We got it good." Their 
strategies tend toward the most morally acceptable 
along the continuum (Figure 2); they combine seasonal 
work and family support with unemployment insurance 
and Liza's full-time work. The result for them is access 
to both economic capital in the form of decent jobs and 
social capital in the form of community ties and support. 
They still have trouble making ends meet. But because 
they stay firmly tied to the paid labor market and avoid 
the more stigmatized coping strategies, they are able 
to maintain good reputations and high self-esteem, as 
well as high moral capital within the community. They 
are proud of themselves and happy with their lot, and 
are well respected and supported in Golden Valley. 
Because they are so well liked and their work ethics 
are so admired within the community, it is unlikely that 
either will ever be out of work for very long. 


You Worked for What You Got: 
Joyce and George Woodhouse 


Joyce and George Woodhouse are in their early 50s and 
have lived most of their lives in the small, run-down cabin 
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that they inherited from her parents. They consider themselves to be among the 
respectable and hard-working poor. When they were younger, Joyce worked as 
a waitress and George worked in the mill. Eventually a series of accidents forced 
Joyce out of the workforce and onto disability. Without her working, his mill salary 
was just barely enough to keep their ever-expanding family afloat. Like many 
other poor couples, George and Joyce regularly took in children whose parents 
were physically abusive or drug and alcohol addicted. The question, "How many 
kids did you have?" elicited the following response: 


Joyce: | had two kids, a girl and a boy. They were four years 
apart. How many kids did we raise? About 200 or more. 


George: That's no exaggeration. We didn't drink and do 
drugs and stuff like that. So everybody that was havin’ drug 
problems and alcohol problems [at home], and domestic 
violence, stuff like that, would come to our house because 
they could sleep here and eat. 


The Woodhouse home became known as a safe place for kids to go to in order 
to escape from troubled families. However, providing food for so many people 
put a strain on them financially, to which Joyce responded by increasing their 
subsistence activities. Without a job, she contributed to the family's diet by 
catching as many fish as she could, even when it meant exceeding the legal 
catch limit: 


George: | would go to work at the mill. And most all of my 
paycheck was going for groceries and stuff to feed all of ‘em. 
Well, Joyce would go up to the reservoir, the lake, and fish. 
If she caught fish, then we'd have fish and like, potatoes and 
stuff like that. If she didn't fish, then we'd have potatoes. 


As they got older and their own children moved out, the Woodhouses stopped 
taking in other children as regularly. But even with fewer mouths to feed, their 
financial situation has not improved. George lost his job due to injury the year 
before the mill closed. He then invested in construction equipment and tried to 
start his own business. While he enjoyed some success at first, by 2003 he had 
little work. Although he looked for other jobs, he was unable to find anything locally, 
and complained that no one would hire him to do the kind of work that he wanted. 
Although they worry about how they will survive retirement, the Woodhouses 
continue to endure as they always have. Their house is paid for. George still makes 
a little money with his business here and there. And for food, they rely heavily on 
deer they shoot in their yard and fish they catch. Particularly for Joyce, fishing for 
food is an integral part of the only lifestyle she has ever known: 


Now when | was a kid, | was raised on salmon and deer 
meat... And last year we brought home 20 [salmon], and 
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one of ‘em was 54 pounds. So we had about 600 pounds. 
But that was why we went, was to smoke it and can it so we 
had it for the winter, to help, you know, with things. 


George and Joyce survive poverty through a strategy that focuses on the more 
highly regarded and traditional of Golden Valley's options (Figure 3). They rely 
on subsistence food, low-cost housing and disability assistance in addition to 
sporadic work — strategies that even when combined provide barely enough to 
survive. Joyce and George remain highly regarded in Golden Valley, and are active 
members of several local organizations. In Golden Valley, manifesting this kind of 
work ethic, even while living in poverty, is enough to be considered productive 
community members, although they do not receive quite the same respect as 
more stably employed couples like Tommy and Liza, nor does George have the 
same faith in his own ability to be hired locally. His age, combined with his recent 
time out of the workforce, makes him less attractive to local employers. And 
while they may contribute to self-esteem and social standing, informal charity 
activities are not considered on par with work activities. Paid work still remits the 
highest amount of exchangeable moral capital, and the lack of it is stigmatized 
by Golden Valley's choosy employers. 

Yet despite their obvious poverty and need, their pride and loyalty to local ` 
cultural norms compels the Woodhouses to avoid the least acceptable types 
of government assistance, even when it means going without basic necessities. 


George: We don't try to get food stamps or welfare or 
anything like that. | mean, basically, we probably could. But 
/ don't know, we were always brought up that you worked 
for what you got, you didn’t have welfare and stuff like that. 
Ifyou didn't work, then you cut back on what you was eatin’ 
until you got a better job. 


The Woodhouse's strategy for survival is socially rational despite seeming 
economically irrational. While they could live more comfortably on welfare, they 
believe — with some reason - that to do so would lessen their worth as individuals 
and community members. It is important to George and Joyce that they are 
perceived as hard-working people, even in the absence of paid work. Although 
they are short on cash, their strategy does pay off to some degree in the form 
of social capital. They know they can rely on help from the community in times 
of need, whether it is the local deputies helping them to deal with troublesome 
neighbors or friends helping out when Joyce is ill or injured. For Joyce and 
George, being fully accepted members of the community, and the self-esteem 
that comes from being self-sufficient, outweigh the benefits that welfare income 
or food stamps might bring. 


The Taste of Poverty: Angelica Finch 


Thirty-eight-year-old Angelica Finch has lived on both sides of the social 
acceptability line and has experienced firsthand the ways in which welfare use 
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and substance abuse are often intertwined. She met me alone, as her husband 
did not want to be interviewed. Jim Finch is 27 years her senior, and although 
he worked in the woods for years, by the time she met him he had permanently 
gone on disability. All income beyond the disability check has been earned by 
Angelica. Because the couple has three children together, a disability check 
alone was never enough money for them. They also 
had serious drug and alcohol habits to support for their 
first decade together, which added further expense to 
their strained budget. 

Despite her substance abuse problems, Angelica is 
no stranger to work. But as a number of researchers 
have found, low-wage work often is not enough to 
support an individual, let alone a family (Edin and Lein 
1997; Ehrenreich 2001). For years she struggled to 
find a decent job, particularly because she had not 
finished high school. After a short time with the U.S. 
Forest Service, she began working at Valley Value, a 
convenience store where she remained for nearly a 
decade. While working, she also attended community 
college in the city of Hillview, more than an hour away. 
The eventual receipt of her associates degree boosted 
both her self-esteem and her earning potential, and 
helped her to finally leave Valley Value and land her 
current job as a lower-level administrator at Golden 
Valley Elementary School. With the higher paying job, 
she was able to pull her family out of intense poverty. 
The worst of her experiences are now behind her, but 
she is still haunted by the memories of her days at 
rock bottom. 

While Jim never worked in the formal sector, he did 
contribute to the household in the form of subsistence 
activities, although not all of them were legal, let alone 
respectable. According to Angelica, his contributions 
included gathering wood for heat and providing them 
with stolen electricity: 
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Figure 3. Joyce and George’s Strategies: Medium Moral Capital 
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food stamps, and | don't remember, like $100 in cash*aid 
or whatever. | was trying to go to school in Hillview. | would 
hit the top of Gold Mountain and turn my engine off and just 
coast down into the valley to save — | mean, we were that 
poor, you know. It was that desperate. lf it hadn't been for 
food programs, the WIC program, food stamps, Medi-Cal, 
we wouldn't have made it at all. Or | wouldn't have been 
able to get my education. 


In addition to these less socially sanctioned survival strategies, Angelica and Jim 
also augmented their diet through hunting. Although it was a source of less shame 
and humiliation, she nevertheless recalled their subsistence diet with revulsion. 


One of our other survival tricks was that deer out the back 
door... In most places, venison is like a delicacy, a treat. Up 
here — and l'm not the only one who feels this way — venison 
is what you eat when there's nothing else to eat, and then 
that's all you eat. It gets real old, real fast. 


For Angelica, deer meat would always be a symbol of poverty, and she hoped to 
never have to taste it again. 

During those hard times, Jim and Angelica's drug and alcohol problems both 
resulted from and contributed to their misery. When asked if being poor had been 
stressful for them as a couple, she replied, "Yeah, yeah, and it just leads to that 
desire to go medicate, because you can't see any way out. You're so destitute 
and struggling, and it's such a struggle." She described feeling marginalized by 
the community, and internalizing a self-hatred so intense that she did not want 
mirrors in her house, because she could not stand to look at herself. In order to 
eventually improve their lives, Angelica insisted that they focus on overcoming 
both poverty and addiction simultaneously. She described the journey out of 
addiction and dependent poverty as what "brought us up into society instead 
of being down pretty much in the gutter.” The combination of her education 
with the changes in their lifestyle meant Angelica and Jim could move from the 
margins of Golden Valley society into the mainstream. 

Unlike the Woodhouses, Tommy and Liza, the Finches had relied on strategies 
that are not socially accepted in Golden Valley (Figure 4). Although like the 
Woodhouses, they had relied on subsistence food and disability assistance, and 
like Tommy and Liza, they had relied on unemployment insurance and Angelica 
working, they had also relied heavily on illegal strategies and means-tested welfare 
programs. With such low moral capital, they were denied access to both living- 
wage job opportunities and social capital within the community. Changing her 
lifestyle meant that Angelica could eventually find some acceptance and respect 
within the community. Had she not invested in her moral capital, she likely would 
have faced more barriers to cashing in on her upgraded human capital as well. 
Despite her schooling and her many years as a reliable worker for Valley Value, 
she was unable to find a better paying job while still using drugs. It was only after 
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clearly establishing herself as clean and sober for several years, and cutting most 
ties to her drug-addicted friends, that Angelica was able to find a more respectable 
job. Although she described the process as lonely and difficult, she believed 
that it was worth it; while she and Jim now have few friends, they have greater 
access to economic and social capital than ever before. Over time Angelica has 
felt increasing acceptance within community settings and is now less afraid to 


appear in public. Nonetheless, occasional sideways 
glances or pointed comments still provide subtle 
reminders that her past is a dirty secret. Angelica and 
Jim's history remain a scar on their reputations, but 
Angelica has at least finally proved herself worthy of a 
decent job in Golden Valley. 


Conclusion: The Importance of Moral Values 


Unlike in the urban setting, within Golden Valley there 
are few competing cultural repertoires from which to 
choose, and the moral focus on hard work remains 
hegemonic. Golden Valley's small size and high 
cohesion compel the poor to act in mainstream ways. 
For Golden Valley residents, their culture is reproduced 
both through intergenerational transmission as well 
as through social pressure to adhere to its norms. 
Failure to do so can result in shame, marginalization 
and decreased social support, as well as reduced 
employment opportunities. 

Furthermore, in a community as racially and 
economically homogenous as Golden Valley, there 
are few other means available by which to create 
distinction amongst members of the community. In 
this case culture and its moral norms have produced 
a moral form of capital that functions in much the 
same ways as do other forms of symbolic capital 
(Bourdieu 1986). It separates the deserving from the 
undeserving and creates social boundaries that can be 
policed and controlled. For employers, it can be called 
upon to help distinguish between similar applicants 
(Burt 1992). An individual's perceived moral worth 
can thus be traded for social and economic capital. 
Unlike in the urban setting, economic maximization 
is seldom the only factor driving people's survival 
decisions in Golden Valley. The choice of culturally 
and morally appropriate coping strategies is ultimately 
both socially and economically beneficial. 

These findings suggest that community setting 
can affect the behavior of the poor in a number of 
ways, including dictating appropriate means for 


Figure 4. Angelica and Jim's Strategies: Low Moral Capital 
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surviving economic hardships. Although we are long past accepting culture 
of poverty theories that blame the cultural characteristics of the poor for their 
own poverty (Harrington 1962), we have not invested much into newer theories 
to explain how culture and setting interact with poverty, particularly in the rural 
setting. The evidence from Golden Valley suggests that rural areas may operate 
according to very different social rules than urban areas. It further suggests 
that in order to alleviate poverty, we must first understand the different social 
milieux in which poverty is prevalent, and the ways in which setting interacts 
with culture and behavior. 

The preoccupation of the rural poor with such issues as moral and family values 
often seems perplexing to both academics and political analysts (Frank 2004; 
Stacey 1996). Popular mainstream books like What's the Matter with Kansas? 
(Frank 2004) question why working-class and poor populations vote against what 
we perceive as their interests, and describe middle-American voters as ignorant 
victims of a false consciousness imposed upon them by conservative elites. 
When the rural poor prioritize issues such as gun control over economic concerns, 
they are often dismissed as uneducated and backwards. But in marginalized 
places like Golden Valley, economic issues often seem distant and untouchable. 
Rightfully so, most Golden Valley residents feel that their economic concerns 
are largely ignored by politicians on both sides of the fence. They see that the 
political right prioritizes the interests of corporate industries, such as the large 
logging companies that clear-cut the woods in previous decades. On the other 
hand, the political left prioritizes the interests of urban liberals, such as the reviled 
environmentalists who are blamed for Golden Valley's current impoverishment. 
Most people in Golden Valley feel that neither side has much interest in their 
material well-being and economic sustenance. However, the right-wing agenda, 
with its focus on moral and personal issues, hits much closer to home. When 
your hunting rifle is a major source of your ability to engage with your children in 
a masculine activity, reproduce your unique culture, and provide for your family, 
gun control will likely seem a serious threat. Similarly, when moral capital is one 
of your main sources of tradable capital, the agenda that stresses moral issues 
will be the one that resonates better. 


Notes 


1. All identifying details, including the names of people and places, have been 
changed in order to preserve anonymity of subjects. 


2. The 1990 listing of the northern spotted owl as threatened under the 
Endangered Species Act led to federally-enforced reductions of timber 
harvesting through much of the Pacific Northwest in order to preserve the 
owl's habitat. Timber harvest in the region dropped by 80 percent between 
1989 and 1994 as a result (Daniels and Brehm 2003). 


3. Wilson (1996: 69) contrasts “mainstream norms and ideas of acceptability” 
with "ghetto-related" behaviors, which include, "existing without a steady 
job or pursuing illegitimate means of income." (p. 71) Mainstream values, 
according to Wilson, focus on work and morally-sanctioned behaviors, and 
contribute to the creation of social capital. 
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Fischer (1975) argues that in the urban setting, sheer numbers and 
the competition between people create variation and allow for greater 
subcultural variety, intensity and diffusion, as well as generally higher rates 
of "unconventionality" than is found in most rural settings. 


Rural California, as a whole, resembles the state average for most of these 
measures. This is reflective of the great diversity of rural places across the 
state of California, including many wealthier areas, particularly on the coast. 
However, Golden Valley is not unique in the northern forested part of the 
state, which includes many communities with similar economic and labor 
market concerns. 


This social stigma against welfare receipt has been found in small rural 
communities throughout the U.S. According to Bokemeier and Garkovich 
(1991:119), "strong cultural traditions prevent many rural families from 
using public assistance and the intimate nature of social interaction in rural 
communities makes others aware of individual or family decisions to access 
public assistance." 


Duncan (1999) found that in a small Appalachian community, reputations are 
often barriers to job opportunities. 


. According to many of my subjects, this type of common-law marriage is 

very common in Golden Valley and has been for generations. In general 
they are treated as identical to legal marriages. A number of my unmarried, 
cohabitating subjects referred to their partners as husbands or wives, despite 
the lack of official legal status. In many cases, including Tommy and Liza's, 
they planned to eventually get married after they had saved enough money 
for a wedding. 
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Framing the Issue: 
Religion, Secular Ethics and the Case of Animal Rights Mobilization 


Marie Mika, Ohio State University 


This article addresses social movement framing, generally, and within 
contemporary animal rights movements specifically by conducting 
focus group analyses of a non-activist population. This contrasts with 
previous studies of recruitment that have examined the conversion 
process retroactively, culling data from those already involved in 
a cause. By soliciting the thoughts of non-activists, the efficacy of 
recruitment efforts and resonance of movement frames can be gauged 
through the reactions of those whom the movement is attempting to 
reach: the uninitiated. Ultimately, the question is raised: is it preferable 
to get noticed in a negative way or not at all? The findings constitute 
a cautionary tale for social movement organizations that employ 
incendiary language or images in their recruitment efforts. 


Social movement actors commonly attempt to persuade those outside the 
movement to join their cause. They typically do so by framing their criticism 
of the status quo and prescribing a remedy in a manner that resonates with 
potential participants (Snow, Rochford, Warden and Benford 1986). Animal rights 
groups are no exception. PETA (People for the Ethical Treatment of Animals), 
the most visible group promoting animal rights, and one of the most successful 
contemporary social movement organizations, commonly makes use of moral 
claims as a framing technique (Jasper and Nelkin 1992; Snow et 31.1986). | 
examine whether such strategies are effective and what types of frames resonate 
most effectively with the broader public and potential recruits. 

Social movements have multiple audiences, including the devoted, casual 
participants and the media. This article addresses social movement framing, 
generally, and within contemporary animal rights movements specifically, 
by conducting focus group analyses of one pivotal audience: the non-activist 
population. This contrasts with and complements previous studies of recruitment 
in the social movement literature, which tend to examine the conversion process 
retroactively, culling data from those already involved in a cause. By soliciting 
the thoughts of non-activists, the efficacy of recruitment efforts and resonance 
of movement frames can be gauged through the reactions of one critical 
audience the movement is attempting to reach: the uninitiated. The dynamic 
and colloquial nature of focus group interviews further facilitates the ability to 
tap into the target audience's interpretation of the movement. This combined 
theoretical and methodological focus builds on recent theoretical inquiry into 
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the microfoundations of political action. Jasper (2004), in particular, suggests 
that exploration of the choices, or agency, on the part of social movement actors 
offers a promising avenue for social movement scholars, as the relevance of 
structural models — so dominant in recent decades - wanes. 

| begin with a review of animal rights mobilization efforts relative to social 
movement and framing literature, followed by a discussion of methodology. 
| then present extended excerpts from the focus group discussions detailing 
participants' reactions to PETA media. Finally | discuss the results and suggest 
the relevance and application of this research for social movement framing 
specifically and for recruitment efforts generally. 


Animal Rights and Social Movements 


Jasper (1997) observed that "Moral protestors are often sensitive to moral 
dilemmas the rest of us ignore; they sometimes generate new ways of 
understanding the complexities of the human condition." (p.13) PETA and other 
animal rights organizations are prime examples of the "moral protestors" who are 
agents of social movements to which Jasper refers. 

PETA was founded in 1980 and has experienced exponential growth, 
paralleling the arc of the contemporary animal rights movement as a whole. 
PETA's membership exceeds 500,000 worldwide and they have an operating ` 
budget of more than $13.4 million (Peterson 1998). Their mission statement is: 
“Animals are not ours to eat, wear, experiment on, or use for entertainment” 
(PETA website), and their mission is: “to protect animals from exploitation and 
cruelty, and to prompt people to reevaluate their relationship with other species.” 
(Peterson 1998:89) PETA has garnered success in several of their campaigns; for 
` example, stopping major cosmetics companies from testing on animals. They are 
easily the most visible and recognized contemporary animal rights organization 
based on sheer celebrity power alone; among others, Paul McCartney, Pamela 
Anderson and Charlize Theron have been spokespersons for the organization. 

As the best-known animal rights group, PETA sets the tone for the entire 
movement — and that tone is often shrill and fundamentalist (Jasper and Nelkin 
1992). Animal protection groups operate along a philosophical continuum, from 
“reformist to radical.” Fundamentalists or radicals demand “the immediate 
abolition of all exploitation of all animals, on the grounds that animals have 
inherent, inviolable rights.” (Jasper and Nelkin 1992:9) PETA, perhaps the 
ultimate example of a radical animal rights group, has attracted controversy for 
its opposition to any use of animals in biomedical research, including testing 
for AIDS and cancer research (Peterson 1998). Critics have decried their alleged 
attempt to commandeer the entire animal rights movement: “PETA is on a 
Sherman's March through the animal liberation movement, a warpath where 
anything goes if it's “for the animals’ in their view.” (Jasper and Nelkin 1992:31) 

Social movements, including those pertaining to animal rights, often provide 
a medium for moral expression in the increasingly secular and communally 
fragmented Western world (Jasper 1997). Moral claims have particular salience 
for many animal rights activists, who do not typically engage in traditional forms 
of religious practice (Maurer 2002). Social movements have long channeled their 
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moral energy into causes in the name of the dispossessed. According to d'Anjou 
and Van Male (1998:207), “Social movement actors often challenge authorities 
on behalf of people whose needs and interests are not addressed.” They quote 
"the American Quaker genius of anti-slavery," Benezet, who, in working on behalf 
of "the outsiders par excellence," stated: "God gave Man Dominion over the Fish 
and the Sea, and over the Fowls of the Air, and over the Cattle & c., but imposed 
no voluntary subjection of one Man to another." (1998:217) 

Animal rights activists seek to extend the logic of treating all humans with 
dignity to non-human animals as well. Indeed, for animal rights activists, non- 
human animals are the marginalized, powerless other, the new ultimate outsiders. 
The rise of terms such as speciesism — defined as discrimination against all other 
species deriving from the superiority of one's own - and claims that rights apply 
to non-humans indicate a willingness to entertain the idea of animals having 
a moral presence and agency beyond mere utilitarian purposes (Gaard 2002; 
Jasper and Nelkin 1992). 

Taylor (1996) argues that individual choices and activities, and efforts at self- 
improvement, "are linked directly to larger social and cultural changes taking 
place within the postmodern societies of the United States and Western Europe." 
Many of these changes involve efforts to address and eradicate "-isms," with 
animal rights seen as another stream in the confluence of isms — or the “matrix 
of domination" — to combat. Correspondingly, according to Dan Matthews, PETA 
director of campaigns: 


We've established women's rights, children's rights, gay 
rights, and | think the final frontier of social evolution is animal 
rights. | see it as much more a social cause than a political 
cause. It's not a movement you need to join by sending a 
donation. It’s a movement you can join at your next meal. 
(Peterson 1998:87) 


Gaard (2002) similarly notes that: 


„vegetarian feminists have argued for the moral treatment 
of non-human animals on the basis of sympathy, on the 
conceptual linkages among sexism, racism and speciesism; 
on the recognition of flesh-eating as a form of patriarchal 
domination; and on the basis of culturally constructed 
associations among women, animals, people of color and 
nature, that are used to subordinate these groups in Western 
patriarchal thought. (P 127) 


Animal Rights and Vegetarianism 


Of all the movements that are associated with the animal rights movement, 
vegetarianism has, perhaps, the most natural connection. Those who take animal 
rights seriously are likely to correspondingly see that breeding and raising animals 
to slaughter and eat is the ultimate violation of rights. 
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Vegetarians, however, are not necessarily animal rights supporters; concern 
for animals is just one path to vegetarianism. In fact, issues concerning personal 
health are the original motivation for most vegetarians (Maurer 2002). Nevertheless, 
a great degree of overlap exists among animal rights and vegetarian movement 
actors. For example, vegan (a vegetarian diet excluding all animal products) 
cuisine is served at the annual national animal rights convention in Washington, 
and animal rights conventions often share many of the same featured speakers 
with organizations such as the North American Vegetarian Society. 

Many activists are devoted to both movements. The demographic makeup is, 
in fact, strikingly similar. Women comprise 70 percent or more of membership 
in animal rights groups, with membership in some groups reaching almost 100 
percent women (Jasper and Poulsen 1995). PETA's own membership exceeds 
80 percent women: “one of the most consistent findings in studies of attitudes 
about animal rights and animal research has been the gender difference.” 
(Simonson 2001:402) Similarly, vegetarians are most likely to be “white, middle 
class females.” (Maurer 2002:8) There is a congruous joint preponderance of 
college graduates and professionals. Perhaps the most striking commonality 
between the two groups, however, is their lack of traditional religious affiliation. 
Vegetarians are “less likely to participate in conventional religions.” (Maurer 
2002:13) Similarly, 65 percent of animal rights activists claimed to be atheist 
Or agnostic when queried about their religious affiliations (Jasper and Poulsen 
1995:502). “This lack of conventional religious beliefs most sharply distinguishes 
both animal rights activists and vegetarians from the population at large.” (Jasper 
and Poulsen 1995:502) 

Why do these movements attract so many women, particularly agnostics and 
atheists? Eating light has been associated with femininity (Maurer 2002). Women 
have also been socialized to be compassionate to suffering (Adams 1991; Jasper 
1995; Maurer 2002). Being victims of systemic and historic oppression, Adams 
(1991) suggests that it is easier for women to empathize with the plight of animals 
destined for the butcher's block. 

Revisiting Jasper’s conceptualization of social movements as moral 
outlet, and mindful of their unconventional religious orientation, it is easier to 
understand how vegetarianism and animal rights activities could also fulfill the 
spiritual needs of the members of these groups. For many, commitment to 
the principles of vegetarianism and animal rights is part of an individual, even 
communal, spiritual ethic. 

Despite similarities, there are important differences between vegetarians 
and animal rights activists. "Neither vegetarian nor animal rights groups nor 
their constituents are coterminous." (Maurer 2002:59) Not all animal rights 
activists are vegetarians (in a recent study by Maurer 37 percent were not) and, 
as noted above, many vegetarians are not motivated by concerns about animal 
rights (Maurer 2002). The process of coming to identify with these movements 
also frequently differs. Most vegetarians become so gradually, usually after an 
incubation of a few years. The process often entails interaction in intimate social 
networks with other vegetarians, typically including a family member or friend. In 
the absence of either, local vegetarian groups provide this personal support. One 
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organization leader advises, "If you want to see a vegetarian world, be the nicest, 
friendliest person you can be." (Maurer 2002:129) 


Moral Shocks and Condensing Symbols 


Examining recruitment patterns in the animal rights movement, however, Jasper 
and Poulsen (1995) found most members have not been brought in through 
camaraderie and social networks, but rather through exposure to movement 
literature, moral shocks and condensing symbols. Moral shocks are events or 
situations that raise such a sense of outrage that individuals are inclined to react 
politically in response to them. For example, the most significant upward spike 
in the membership of pro-life organizations occurred in the year immediately 
following Roe v. Wade (Jasper and Poulsen 1995). 

Social movements attempt to generate moral claims and shock through 
rhetorical appeals or the use of condensing symbols: "...verbal or visual images 
that neatly capture — both cognitively and emotionally — a range of meanings 
and convey a frame, master frame, or theme. Organizers use such symbols to 
recruit members, especially strangers.” (Jasper and Poulsen 1995:498) Effective 
condensing symbols represent “god terms” or “unquestionable moral absolutes.” 
Shock appeals are no stranger to the animal rights movement; indeed, “Shocking 
visual images are perhaps their most powerful tool.” (Jasper and Nelkin 1992:44) 
Some of the more powerful condensing symbols offered by animal rights social 
movements in recent memory include graphic images of baby seals with heads 
bloodied from being clubbed to death for their fur and “kittens with their eyes 
sewn shut” in experimental laboratories (Jasper and Nelkin 1992:44). 


Framing 


The values that social movements such as PETA promote may not resonate 
with, and may even be antithetical to, popular beliefs (Snow et al 1986). It is 
the defining task of these movements to “[devise] packages that are culturally 
contrary and resonant by using specific strategies." (d'Anjou and Van Male 
1998:212) Accordingly, groups such as PETA must frame their arguments with 
care. Framing refers to the way social movement actors present their arguments 
in a manner that may resonate with those outside the movement. 


Frames are... schemata of interpretation that enable 
individuals to locate, perceive, identify and label occurrences 
within their life space and the world at large. By rendering 
events or occurrences meaningful, frames function to 
organize experience and guide action, whether individual 
or collective. (Snow 1986:464) 


A stark and accessible example of framing is seen in the abortion debate. "Pro- 
life" organizations characterize abortion as the murder of an innocent unborn 
child, and often depict graphic images of aborted fetuses in their literature. The 
pro-choice movement argues for women's rights to autonomy and control of 
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their bodies, free from governmental and societal control, without any equally 
strong, visceral image as a counterpoint. 


Frame alignment refers to the linkage of individual and social 
movement organizations' interpretive orientations, such that 
some set of individual interests, values, and beliefs and social 
movement organization activities, goals and ideologies are 
congruent and complementary. (Snow 1986:464) 


Snow et al. (1986:467-72) discuss four frame alignment techniques: frame 
bridging, amplification, extension and transformation. Frame bridging refers to 
"the linkage of two or more ideologically congruent but structurally unconnected 
frames regarding a particular issue or problem," essentially, drawing in those 
with similar ideological inclinations via mass media, such as mail or email 
solicitation. Frame amplification refers to the "clarification and invigoration of 
an interpretive frame that bears on a particular issue problem or set of events" 
or the psychological unearthing of issues one may have previously considered, 
which are invigorated with a nudge from a social movement organization's 
proselytizing efforts. Frame extension refers to the expansion of a social 
movement organization's "boundaries of its primary framework" so as to address 
or include issues that may be of immediate concern to potential adherents, yet 
of peripheral import to its main objectives. For example, when leaders of the 
Austin Peace and Justice Coalition were concerned that their appeal was mostly 
to "white, middle-class baby-boomers," they attempted to reach out to minority 
groups with the addition of a fourth goal to their promotional literature: "To 
promote social justice by non-violently confronting racism, sexism, and all forms 

of discrimination and oppression." 

Frame transformation is the most extreme and personal form of framing. New 
values are planted, old ones jettisoned, and contrary beliefs reframed, resulting 
in a transformation of frame. A phenomenon is reconceptualized so that what 
was once tolerable is now immoral and unacceptable. Animal rights groups, in 
particular, have often successfully used moral shock tactics to transform frames. 
A practice common at regular public meetings of one animal rights group is 
"giving witness to one's conversion" to the movement, relating the experience or 
stimulus that shocked one into activism (Jasper and Nelkin 1992). 

Another method of framing, pioneered by the vegetarian movement, is to 
attempt to evoke the donor animal from which the food on one's plate originated. 
The goal is to make present the "absent referent," or make one see on the plate 
not "meat" but rather, "dead cow" or "dead pig." (Adams 1991) 


The Risks of Moral Shocks and Condensing Symbols 


Incendiary or extreme arguments, demonstrations, campaigns or literature 
may prove effective at connecting with subjects on an immediate and visceral 
level, engendering an instantaneous frame transformation. Short of this, images 
designed to morally shock might catch the attention of a subject when less 
controversial images might be ignored altogether. While the subject may not be 
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instantly transformed or may even react negatively, the incendiary image may 
represent the beginning of a journey of transformation. In short, an image may 
be effective simply by catching a subject's attention. The success of moral shock 
tactics is well documented in the animal rights movement literature (Jasper and 
Nelkin 1992; Jasper and Poulsen 1995). 

However, it is equally plausible that such an approach may deeply offend, 
resulting in a backlash against the organization, undermining its credibility and 
tainting the movement as a whole. According to Jasper and Nelkin (1992): 


Rhetoric that compares animal suffering with the holocaust, 
that equates speciesism with racism, has emotive power 
for those who blur boundaries between humans and other 
species. For others, these metaphors appear outlandish, 
threatening, dangerously defying accepted categories. (P7) 


The possibility of backlash is particularly salient for a group such as PETA, 
which represents for many the entire animal rights movement. Framing an 
argument in extreme terms carries many risks. As the media often "detextualize" 
events by focusing on sensational aspects of a protest — indeed, are compelled 
to do so by the dynamics of news reportage - a situation may arise in which “the 
actions of a small minority may come to represent, in the eyes of the public, 
an entire movement." (Kruse 2001:69; Gitlin 2003) When demonstrators create 
disturbances or engage in sensational acts, the demonstrators, not the issue 
in which they are engaged, become the focus of attention. This may "isolate 
protestors from the larger public who otherwise might have agreed with their 
cause... this situation also has the potential to reinforce the perception that the 
group is a fringe element of society." (Kruse 2001:69) Of course this affects the 
way the organization is portrayed in the media, and how the public perceives it. 
“If interest groups are portrayed positively, their respectability may be enhanced. 
On the other hand, if they are portrayed in a negative light, their legitimacy may 
be undermined, and the public may be discouraged from identifying with the 
group.” (Kruse 2001:69) Legitimacy then becomes an issue for the success of 
future attempts to frame the group's messages. 


Data and Methods 


Jasper and Poulsen (1995) suggest that one limitation to their research is that it 
worked in reverse, from protestors to their worldviews, derived from surveys and 
interviews with activists after their recruitment into a movement. Most notably, 
this may have led respondents to reconstruct their initiation and construct a linear 
narrative that may not have so neatly existed. Correspondingly, they conclude 
that future research should "examine members of the public and tease out their 
cultural understandings, as Gamson has done through the use of focus groups. 
Then we could see how movement organizers try to find condensing symbols 
that appeal to these understandings." (Jasper and Poulsen 1995:488) Kitzinger 
and Barbour (1999) elucidate the reasons for the applicability and relevance of 
focus groups for harnessing visceral reactions and thoughts to stimuli: 
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` 


Focus groups are ideal for exploring people's experiences, 
opinions, wishes and concerns. The method is particularly 
useful for allowing participants to generate their own 
questions, frames and concepts, and to pursue their own 
priorities on their own terms, in their own vocabulary. (P 5) 


By initiating the discussions with open-ended questions, discussants are 
allowed to express immediate and visceral reactions and introduce language and 
relevant terms untainted by the moderator's preconceived ideas. Consequently, 
focus groups "can also provide insight into the way social movements' targets 
interpret the movement." (Blee and Taylor 2002:107) 

| moderated eight focus group discussions in May 2003. The size of the groups 
ranged in number from 4 to 10, with 52 discussants ultimately participating. 
Discussants were recruited through an open plea in a large introductory sociology 
class at a major university, with approximately 350 students. Compensation 
for participation took the form of exemption from an upcoming paper project 
comprising 5 percent of the overall grade. Enrollment in this class should imply 
neither interest in the course content nor any particular intellectual or academic 
leanings: Sociology 101 is a general education course cutting a wide swath 
across many majors on a campus of tens of thousands. The sample is thus 
representative of the larger student body — not merely sociology majors. Because 
PETA has also actively targeted college campuses in many recent campaigns, the 
participants reflect an audience PETA has explicitly sought to recruit. 

The data are admittedly limited to college students; thus, general conclusions 
need to be tempered. Nevertheless, within the sample there is significant 
heterogeneity reflective of the larger public, specifically with regard to gender, 
religion and political affiliation. This bolsters confidence that the findings may be 
generalized to a larger, more general population, and particularly to the key target 
and recruitment audience to which PETA is directing much of its effort. 

Participants are on average 20 years old, with a range from ages 19 to 27. This 
age group is a prime target audience for such a campaign, as "...college students 
are particularly likely to lean toward vegetarianism; about 15 percent select a 
vegetarian dining hall option on any given day." (Maurer 2002:18) This is far greater 
than the 1 percent to 2.5 percent of the population estimated to be vegetarian 
in the United States. Thirty-one of the 52 (60 percent) participants are women 
and 45 (87 percent) are white, with the remaining seven African-American, Asian 
and Hispanic. In response to a brief demographic survey, 24 responded that they 
are non-partisan, 16 (31 percent) identified themselves as Republican and 12 (23 
percent) as Democrat. When queried about religious affiliation, Christians and 
Catholics predominated (15 and 13, respectively), while eight claim no religion. 
The rest self-identified among these faiths: Buddhism, Lutheran, Judaism, 
Jehovah's Witness, Methodist, Mormon and Greek Orthodox. 

Prior to the focus group discussions, visual materials were obtained from 
PETA itself, descriptions of which are detailed in the following section. The 
images represent a variety of media available for purchase (billboards, print ads, 
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buttons, stickers and literature). Some are lone visuals. Some are part of a larger 
thematic campaign, such as "The Holocaust on Your Plate" or JesusVeg.com. 
According to PETA organizer Noah Cooper, the images first appeared in different 
media outlets, targeting both niche audiences and the general public. PETA 
debuted the "Meat is Murder" image on their teen- oriented website, peta2. 
The Holocaust image was part of a traveling multi-media exhibit erected on 
college campuses and at city halls in major downtown areas. The “Jesus Was 
a Vegetarian" literature was distributed largely at Christian youth conferences 
(personal interview Feb. 3, 2006). 

One copy of each image was passed among the focus group participants. 
Discussion then proceeded in two steps. First, participants were asked to react 
to the images at their own pace and in the order that suited them. This allowed for 
investigator evaluation of transformation, extension and absent referent frames, 
especially the frames that provoked the most visceral reactions. The second step 
involved a more general, probing discussion of the various images taken as a 
whole, and the participants' reactions to them. 

PETA's ad campaign uses various framing techniques. | categorized them in a 
manner consistent with the frame alignment distinctions defined by Snow et al. 
(1986): transformation with special emphasis on moral shock, extension, absent 
referent and bridging. 


Transformation 


Three images utilize transformation framing. One simply states, in bold letters, 
"Meat is Murder." Another, which also has religious overtones, shows a sheep 
suspended by a single back leg, showering blood on a nearby wall, with the 
accompanying text: "Lamb of God. Choose Life! Go Vegetarian." Over the course 
of the focus group discussions what emerged as the most shocking ad is an 
image of emaciated concentration camp victims juxtaposed with chickens in 
factory farm cages with the caption, "To Animals, All People are Nazis." 


Extension 


Seven images use extension to frame appeals to the public. One features, 
simply, a waving American flag, accompanied by the words, "Proud to be a 
Vegetarian American." Two others feature attractive female models, scantily clad 
in garments composed entirely of foodstuffs, with the accompanying text: "Let 
vegetarianism grow on you!" Both ads seem to attempt to appeal to a male 
audience that otherwise might have little interest in pursuing either animal rights 
or vegetarianism. 

The most provocative of the extension frames are the four religiously-oriented 
ads. The "Blessed are the Merciful" ad offers a vision of a Caucasian, blue- 
eyed Jesus with an orange slice for a halo, presenting a lengthy Biblical quote 
beseeching us to care for all animals. The Jesus was a Vegetarian" ad features 
the statue of Jesus with outstretched arms above Rio de Janeiro, accompanied 
by Biblically-based arguments to “Go Vegetarian.” 
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JesusVeg.com 


“Animals are God's creatures, not human 
property, nor utilities, nor resources, nor 
commodities, but precious beings in God's 
sight. ... Christians whose eyes are fixed on 
the awfulness of crucifixion are in a special 
position to understand the awfulness of 
innocent suffering. The Cross of Christ is 
God's absolute identification with the weak, 
the powerless, and the vulnerable, but most 
of all with unprotected, undefended, 


innocent suffering.” 


—REV. ANDREW LINZEY 
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The third religiously based ad features Moses brandishing a bunch of carrots, 
admonishing, "I said, Thou Shalt Not Kill! Go Vegetarian." The "Lamb of God" ad, 
which also represents use of moral shock as a framing technique, is the fourth 
religiously oriented image that utilizes extension. 
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Bruce Friedrich, a senior campaign coordinator for PETA, in defense of the 
"faith-based argument for vegetarianism," asks to "consider, please, that every 
social justice movement has had religious support. Given the level of religious 
commitment in society, it seems unlikely that we will achieve animal liberation 
without it." (Friedrich 2002:36) But to whom will this campaign appeal? After 
all, extension frames resonate with those who are already predisposed to 
accept the movement's worldview or perspective (Jasper and Poulsen 1995). 
Will this argument appeal to those most likely to become vegetarians: atheists 
and agnostics? One should think that most would not care what Jesus would 
do. Will it resonate with Jesus' followers, imbued with the Judeo-Christian 
ethic of man’s dominion over the earth and its creatures? It seems to run 
counter to those beliefs. 


Absent Referent 


An image of a naked woman, sitting with her back to the observer, with different 
cuts of meat demarcated on her body, nicely reflects the absent referent 
technique. The accompanying text, “All animals have the same parts,” considers 
a fellow human (more specifically, for male audiences, a fit and attractive woman) 
in the same terms as we consider the animals we consume, and thereby makes 
present the absent animal. A more benign evocation of the absent referent is the 
image of a smiling cartoon pig, with the accompanying text "Meat's No Treat for 
Those You Eat!" 

Making present the absent referent, or donor animal, is disconcerting, as the 
specific animal that was killed is invoked. "Once meat is recognized as flesh 
like the consumer's own flesh, then the sight of fragments of the dismembered 
animal... represents an all too evocative vision of that consumer's own animal 
origins, and ultimate physical dissolution." (Beardsworth and Keil:286) 


Bridging 


Finally, analysis of bridge framing is possible, at least to a limited degree. We know 
that bridge groups for animal rights organizations include women and political 
liberals. Participants in the focus groups were identified on the basis of their sex, 
and at least in two cases, on the basis of their ideology. We would expect that 
women and liberals would view the ad campaign, generally, and specifically the 
ad that features a peace sign more favorably than other types of respondents. 
This information allows an investigation of whether PETAs ad campaign was 
more effective for members of bridge groups than it was for other participants. 
The literature suggests that each of the framing techniques used by PETA in 
theirad campaign has been effective in recruiting supporters to social movements. 
The questions for this research, then, are whether PETA has effectively used 
these frames, individually and collectively, to recruit new members, whether their 
use has created a backlash, or whether these frames have had little to no impact 
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at all. Here, effectiveness is measured both as the valence of participant reaction 
and the strength of that reaction. 


Findings 


Reactions to PETAs framing techniques were mixed. Reaction was strong 
and negative to ads that either shocked or challenged the participants' deeply 
held cultural beliefs, or those that lacked credibility with the participants. The 
campaign's effort to base its animal rights appeals on vegetarianism seemed 
to be especially problematic. Some ads were viewed more positively, but with 
less interest. Still others generated almost no reaction at all. Below | detail these 
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results and their implications 
for understanding framing and 
the conversion of movement 
participants more generally. 

The following quotes were 
selected as representative of 
consensual responses to the 
imagesacrossgroup discussions. 
Some represent extended 
exchanges among participants; 
most are individual comments. 
Though they may be selected 
excerpts from individuals, the 
statements were made in the 
context of sometimes heated 
debate, as discussants were 
often challenged and pushed to 
better articulate and further hone their arguments; thus the group discussions 
likely engendered clearer and more vivid responses than individual interviews 


would have. 





Transformation 


One purpose of the three moral shock ads used in the campaign (Figures 1-3) 
is to engender a frame transformation in the uninitiated. The reaction to this 
imagery by the participants in this study was visceral, immediate, strong and 
negative. The Nazi image was deeply offensive to many, particularly those of 
the Jewish faith, who were outraged by the comparison of concentration camp 
victims to animals: 


Female (F): l'm Jewish, and that Holocaustthing really bothers 
me. You can't take our biggest tragedy of exterminating six 
million people and say don't Kill an animal or eat a chicken! 
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An unprompted and consistently made comparison was that of this image 
and the bloody lamb to the "abortion trucks," small trucks that roam college 
campuses with billboard-sized graphic images of aborted fetuses: 


Male (MJ: | think it’s almost too much shock value where 
people would just not even, if they had to glance at it, they 
would just sort of turn around in disgust and just want to 
forget about it; whereas, if they did something less shocking, 
you might actually have people consider it maybe or discuss 
it, but when you see something that shocking, you just say, 
"Oh, that's gross, that's wrong," and then you just, you... you 
almost like become more angry at the advertiser than you do 
at people who are aborting babies. 


F: Yeah, like the trucks and whatever, it's like, it's really 
extreme. They be going really extreme like that... Oh, that 
was making me like, oh, that's so disgusting. They're sick. | 
think they're sick. They got sick minds to drive around with a 
dead fetus on the thing. | wouldn't be, "Oh, abortion's bad." 
/ would be like, “Oh, they're (those who display the image) 
sick people." That's what | would think. 


The overwhelmingly negative reaction to the moral shock ads may relate 
to the differences in pathways to protest that characterize animal rights and 
vegetarian groups. It makes sense that PETA, with movement leaders operating 
within the paradigm of moral shocks, would use similar tactics to try to persuade 
people to adopt a vegetarian lifestyle. The animal rights movement has a 
considerable degree of stranger initiation wherein members often are recruited 
through literature and moral shocks (Jasper and Nelkin 1992; Jasper and Poulsen 
1995); whereas vegetarians are usually converted via close contact with other 
vegetarians in personal networks, and the duration of the transformation is 
typically lengthy (Maurer 2002). 

The efficacy of moral shock framing for animal rights groups’ recruitment 
efforts may lie in the fact that the enemy or object of moral outrage is typically 
external to the target audience. It is easy to vilify and speak in morally damning 
terms about another person or entity. After all, itis the other who is clubbing baby 
seals, planting electrodes in cats' heads at research laboratories, and running 
a circus: “Most moral shocks inform the viewer about what others — scientific 
researchers, Circus trainers, and cosmetics companies - do to animals." (Jasper 
and Nelkin 1992:44) 

However, when one is confronted with personal and aggressive attacks 
condemning meat consumption, it is one’s own behavior being condemned: the 
enemy is thyself. The discussants took umbrage at both the content and the 
portrayal of themselves as culprits. Thus, it could be that moral shock campaigns 
are ineffective when promoting vegetarianism, because condemning meat 
consumption (as opposed to other violations of animal rights) inevitably forces 
people to confront their own behavior (as opposed to that of others), and they 
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are less likely to join a cause that requires them to make fundamental changes in 
what is such a deeply ingrained lifestyle. 

Based on what was a nearly unanimous reaction to these images, it is safe to 
conclude that they were ineffective in evoking a positive response - positive in 
the sense that the audience accepted the logic of or were persuaded by PETAs 
efforts. Indeed, more than one discussant expressed the belief that these images 
undermined the credibility of PETA, which clearly is not conducive to recruiting 
members. The only potentially encouraging result for PETA is that the moral shock 
ads caught the attention of nearly every discussant and the responses were notably 
strong. Some of the other ads went unnoticed or elicited only tepid responses. 

This raises the question, is it preferable to get noticed in a negative way or 
not at all? The latter seems to hold virtually no potential for recruiting members. 
The former may not fare much better, but holds at least some potential for being 
part of a long process of conversion and recruitment. Although the initial reaction 
was negative, these images could linger and, combined with other reinforcing 
life experiences, might eventually change one's outlook. Given the fundamental 
personal changes required for identifying with animal rights — specifically, 
vegetarianism — it would hardly be surprising that, for many, it would be a long, 
slow transition. Gamson notes, "...the process of adopting an injustice frame... 
takes time and is rarely compressed into a single encounter." (1992:73) For some, 
attention-grabbing moral shock campaigns may be the first step in a process of 
transformation. 


Extension 


Seven images used a frame extending technique: the ad with a patriotic theme, 
four ads utilizing religious themes and the two ads that relied on sex appeal 
(Figures 4-10). Many took offense at the "Proud to be a Vegetarian American" 
image, angered that their "patriotism was being questioned" if they did not 
adopt a vegetarian diet. Others saw it as a shameless attempt to "jump on 
the bandwagon," a transparent ploy to play on Americans' post-September 11 
emotions. Notably, many of the male participants — and none of the women — 
responded negatively: 


M: What are they doing sticking my flag on their 
vegetarianism? Because l'm not a vegetarian, | don't have 
a problem with vegetarianism but it seems like they're 
saying all Americans are vegetarians. | would hate to have 
that confusion and have to keep working at telling that to 
everybody | meet. 


M: | was just going to say | bet they were jumping on the 
bandwagon and | bet that was made right after 9/11. And 
like everyone had American flags — they were just trying to 
say, “Hey, look, we're patriotic, too!" | think that's what they 
did here. After 9/11 everyone had American flags and they 
were just trying to fit in with everybody else. 
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M: It sorta makes me almost go against what they're saying 
because of some of the messages like that flag there. They 
try to make it sound like if you're not a vegetarian you're 
un-American. 


The opinions regarding the religious themes were consistently strong and 
negative. Similar to the reactions to the moral shock ads, most discussants 
disliked being made to feel guilty and expressed resentment at what they 
considered a personal attack: 


M: OK. | mean, | guess | can just agree with everybody else, 
like if | saw any type of advertisement that was calling me 
a bad person l'd more, like, resent it than go along with it 
or think about it, but if | saw something that was like clever 
and humorous like the “Let Vegetarianism Grow on You,” 
like they're not making a statement that's like giving you 
an ultimatum, they're just giving you another option... than 
feeling trapped or backed into a situation where they feel 
like they have to defend themselves. 


F: | don't like being told what to do. And | don't like being 
told somebody's better than me. ...like [if it's] just worth 
somebody being proud of, like, to me that's more interesting, 
I'd like be a lot more willing to listen to somebody that was 
approaching it in a pride way and telling me it's good to 
be a vegetarian, like, why they're proud to be vegetarians. 
To me that's a lot more reasonable for me to listen to than 
somebody telling me why l'm wrong and why they hate me 
and why they think they're better than me. 


A second common reaction was that the religious messages were factually 
wrong. Many discussants offered an informed rebuttal to the Biblical interpretation 
offered by PETA: 


M: Yeah, | didn’t like the one about Jesus Was a Vegetarian 
and stuff. It’s like... 


M: He was a fisherman. Catching fish. | mean, come 
on now... 


F: I think all the religious ones are kind of offensive because 
/ don't think it says anywhere in the Bible that you can't eat 
meat... l'm a Christian and | think that's kind of dumb saying 
that you can't eat meat. 
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F: Using the Bible to turn people into vegetarians... It's in the 
Bible that, you know, they talk about the feasts they had and 
there was meat there, and they're (PETA) talking about how 
He's a vegetarian. They're, they're lying, and | mean they're 
using religion, they're using Jesus, to say, "Be a vegetarian." 
They're lying about... that's not true, that's wrong. 


F: I think they're taken out of context like a lot of things in 
religion. Some specific ones that | know are out of context, 
the, you know, the animal sacrifice... | know historically and 
religiously about the animal sacrifices in the temple... | mean 
obviously, | mean like the Nazi thing, like wow, | disagree 
with their method, | must disagree with them, same thing. 
If | can prove one of their facts are false then what's to say 
their other facts aren't false? 


With these images PETA apparently has violated a major tenet of frame 
extending, which is to reach out to the target group in a way that is credible with 
its members. "Framing works when the statements are attributed to a credible 
source; framing fails when the same statements are attributed to a non-credible 
source." (Druckman 2002:1059) 

There was a somewhat positive reaction to the sex appeal campaign - though 
probably not what PETA intended. Many men noticed the women, yet said that 
the women were the only things they noticed, and did not or would not take the 
accompanying appeal into account. 


M: (referring to a PETA anti-fur campaign when speaking of 
the sexcappeal ads) l'm partial to the PETA ads with celebrities 
in them where they're just like, "I'd rather not wear anything 
than wear fur.” 


Moderator: And why are you partial to those? 


M: Because they're beautiful celebrities. It doesn't make 
me want to be a vegetarian. It just makes me wanna see 
more posters. 


The mildly positive reaction to the sex appeal ads might suggest nothing 
more than what anyone familiar with Madison Avenue already knows: sex sells. 
However, PETA might well ask in this case, "Sells what?" 


Absent Referent 


Absent referent framing uses imagery that invokes the hidden victim. This 
technique is used in the ad with cuts of meat drawn on a nude woman and in 
the "Meat is No Treat" ads (Figures 11 and 12). The former produced a reaction 
from a number of women, whereas the latter generated almost no response 


936 + Social Forces Volume 85, Number 2 * December 2006 


from the participants. Perhaps it is not surprising that only women mentioned 
being affected by the image of the woman with the various cuts of meat 
demarcated on her body. Typical comments reflected a certain uneasiness that 
was generated by the ad: 


F: The female all marked off as ground and ribs and chuck 
— | don't know. | can't cook meat myself if | know what part 
of the animal | eat, like | can't eat chicken if | was to hold the 
leg of a chicken. | get grossed out by that. It just kinda makes 
me think of the different parts, you know, comparing it. 


F: Ive gotta say the one where the parts are marked 
down only because, you know, when I eat a hamburger, 
it doesn't look like the cow... | dont necessarily think 
about what I’m eating. 


Perhaps significantly, no men expressed an interest in the attractive nude 
model, as some did in the case of the revealing images of the ads that were used 
in the extension frames. It is possible that the image of a beautiful nude model 
whose body is marked for cuts of meat is more disconcerting than scantily clad 
women sporting vegetables for clothes. 


Bridging 


PETA utilized the bridging technique in its “peace sign” ad (Figure 13). More 
generally, we would expect participants who identified with ideologically 
compatible groups to be more receptive to PETAs campaign. Because there 
was no significant reaction to the peace sign ad, most of this analysis is based 
on the more general discussion following participants' review of the individual 
images. Two women who fit the demographic profile for receptivity to animal 
rights (liberal, feminist, intelligent) provided the most trenchant insights. 

First, à woman who had given the matter some prior thought and who was the 
only participant who had previously visited the PETA website: 


F: There is a more informative way to go about it. | don't 
eat... | have always eaten red meat, but | don't eat red meat 
anymore. l took a class at Ohio State, and it was not pushing 
vegetarianism at all. It was an oncology class and it was 
about cancer and like, just by presenting the facts on you 
know, nitrates and stuff that are in red meat, carcinogens, 
when the food is processed, it made me reevaluate my 
own diet for a healthier lifestyle based on facts and not so 
much opinion. So, | think that was a much more persuasive 
method to get someone to see how maybe meat isn't the 
best thing for you because look at what it does to the human 
body from a biological standpoint... | think that would make 
a better argument than saying, well, Jesus was a vegetarian 
because a lot of people say, ‘I don't believe in Jesus, | dont 
care what He was.’ 
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Next, a liberal activist who has considered vegetarianism several times: 


F: Well, I| don't know, it's tough. | think none of them 
are particularly effective... Also the women ones — as a 
feminist, ads that compare women to food, consciously or 
subconsciously, turn me off completely. Especially PETA , 
as a liberal organization, | would expect a lot more from 
them and that would really disappoint me... | think a totally 
different type of advertising might appeal to me better as 
a person who is already liberal and involved in all different 
causes and has considered vegetarianism several times... 
Give me words, but not just pictures of what's wrong with 
animal preparation and how. Give me a reason, not just a 
peace sign or a picture of Jesus or a half-naked woman. So 
/ think none of these are really particularly effective towards 
me. They all kind of turn me against PETA. 


Moderator: What would appeal to you most? 


F: Something more logical and something that sunk in 
with my belief system that | already have and wouldn't 
require an entirely different one. Like for somebody that is 
already liberal and already tries to think in more progressive 
ways. Something that isn't trying to shock me so much as 
encourage me. 


On the basis of comments from at least two participants, it seems that 
members of bridge groups desire more than an ad campaign heavy on imagery 
and shock value to sway them to the animal rights cause. 


Discussion and Conclusion 


At first blush, one might conclude that PETAs recent campaigns have not 
served as an effective recruiting tool for the organization or for the animal rights 
movement. Certainly no participants in this study exhibited the transformational 
epiphany that has been identified in other studies of recruitment to animal rights 
groups (Jasper and Nelkin 1992). Indeed, the images designed to produce moral 
shocks, as well as the religious and patriotic ads, and the ad with the meat cuts 
on a nude woman were met with strong negative reactions. None of the other 
ads were met with either the strength or direction of feelings of these ads. 

Yet, intensely negative reactions had distinct foundations. The patriotic ads, 
it seems, stretch credibility because they tend to be associated with shameless 
opportunism. The religious ads failed, in part, because of what is widely perceived 
as faulty interpretation of the Bible. However, the religious ads also failed to elicit 
a positive reaction because they raise contradictions in participants' patterns 
of thought and force them to confront their own values and behavior as part 
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of the problem identified by the ad campaign. Similarly, the moral shock ads 
and the ad featuring the nude woman with meat cuts drawn on her body were 
viewed negatively because they challenge deeply ingrained cultural beliefs of the 
respondents. Many of the same participants who reacted negatively to the ads 
also struggle with the moral quandary that resulted from the contradictions in 
their beliefs that were exposed by these ads. 

For some, the moral quandary will be resolved simply by not thinking about 
it, much less joining the animal rights movement. For others, however, exposure 
to the more challenging images of the ad campaign could represent the first 
step in a journey that will lead to joining the movement. Working out the moral 
quandary for them could mean the eventual transformation of their beliefs and 
lifestyles to bring them into alignment with the goals and values of the animal 
rights movement. Indeed, it is possible that a viewer may retain in one's mind 
a vague connection between, for example, Nazis and animal consumers, and 
over time forget that they initially rejected the association. Therefore, it may be 
premature to conclude that the ads that produced the most strongly negative 
reactions were ineffective recruitment devices. 

If it is credible to conclude that intensely negative reactions to PETAs ad 
campaign might constitute an initial, albeit shocking, wake-up call for some 
potential recruits who have not considered animal rights or vegetarianism in 
any serious way heretofore, there is little support for the proposition that the ad 
campaign effectively reaches those who are closer to joining the animal rights 
movement. Terse ads and emotional appeals appear not to be enough to convince 
members of bridge groups who are considering joining the movement. At this 
stage of their development they likely require more reasoned factual information 
to make the transition. 

This presents a problem for social movements, such as animal rights groups, 
which seek to expand membership. Potential recruits are diverse and at various 
levels of receptivity to an organization s message. One type of campaign may 
be more effective for some types of potential recruits than others, and some 
campaigns might alienate some potential recruits while effectively reaching 
others. This suggests that groups that engage in broad based campaigns must 
target their audiences carefully, and deliver the appropriate message to each 
target audience. 

Significantly, the four ads that were viewed most positively — the two sex 
appeal ads, the cartoonish "Meat is No Treat" ad, and the peace sign ad 
— also generated the least notice. Not coincidently, they were also the most 
innocuous and least threatening of the ads. This raises another problem facing 
social movement organizations in their efforts to recruit: Is it better to conduct 
strong visceral campaigns that, at least, initially turn people off, or is it better 
to offer more innocuous, less personally threatening campaigns that might not 
produce any significant reaction? If a movement opts for the incendiary route, 
there is a further risk in addition to directly alienating the immediate audience 
for the imagery: the more outrageous campaigns may catch the attention of 
the media, which may report on the specific campaign and larger movement in 
negative terms and reach an audience that far eclipses the outreach efforts of the 
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organization. Examples of this include both civil rights and Jewish organizations 
having publicly taken issue with recent PETA campaigns. 

Animal rights groups such as PETA face a paradox when attempting to recruit 
vegetarians to the cause. Animal rights and vegetarianism are closely linked; we 
would expect the latter to serve as one of the former's closest bridge groups. Yet, 
it is likely that the kinds of recruitment techniques, such as moral shocks, which 
have been documented as effective recruitment tools for animal rights groups, 
are ineffective for vegetarians. The reason could be that moral shock, and similar 
types of visceral framing, are most effective when the "other" is identified as 
the source of the problems. But because vegetarianism is typically an intense 
personal commitment, a vegetarian appeal grounded in traditional animal rights 
rhetoric inevitably implicates the self as, at least, part of the problem. For many, 
the moral quandaries created may result in rejection of the movement, at least 
in the short run. 

When using the extension frame, groups such as PETA run the risk of losing 
credibility when their claims appear to have little relationship to the associated 
message and thus appear opportunistic (as in the case of the patriotic ad) or 
when they are perceived to contain factual errors. Attenuated claims that do not 
ring true with the target audience are not likely to be taken seriously. 

These issues of rhetorical packaging are not specific to PETA; indeed they 
are universal dilemmas faced by social movement organizers for whom there is 
rarely one best solution. Jasper (2004) lists some of the major strategic choices 
with which social movement leaders must reconcile, both implicitly and explicitly, 
and with which PETA leaders have reckoned in their outreach efforts. One is 
the Extension Dilemma: "The further you expand your group (or alliance), the 
less coherent your goals and actions can be." (P7) Negotiating this dilemma 
involves articulating the collective identity of a movement, and the degree to 
which membership should be open or closed. The "Reaching Out or Reaching 
In" Dilemma obliges leaders to choose which audience they wish to reach: those 
already within, or at least sympathetic to, a movement, or the uninitiated: "...two 
very different audiences." (P. 10) 

With finite resources and fragmented media outlets, diverse audiences with 
differing pathways to protest, PETA and other social movement organizations 
must craft their message and target their audience with care, lest they risk 
alienating those most likely to offer active, symbolic and financial support. 
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Tocquevillian Moments: 
Charitable Contributions by Chinese Private Entrepreneurs 


Dali Ma, University of Chicago 
William L. Parish, University of Chicago 


Using a 1995 national survey of 2,870 Chinese private entrepreneurs, 
this article examines collaboration between private business and 
government in times of economic transition. Much as in the late 18th 
century situation in France as described by Tocqueville, special moments 
occur when a newly emerging business class offers monetary payments 
for charity and for honorary offices. In exchange, the government, 
desperate for new sources of revenue, provides social recognition and 
political access. The data suggest that into at least the 1990s Chinese 
private entrepreneurs donated generously to government welfare 
projects, and in exchange gained political access and social status via 
appointment to political councils. 


There can be few, if any, parallels for this intense desire of the middle- 
class Frenchman to cut an official figure; no sooner did he find himself in 
possession of a small capital sum than he expended it on buying an official 
post... (Tocqueville [1858] 1955.97). 


Generally speaking, an investment in office was an investment in standing. 
What made it desirable was the status, the respectability that it conferred. 
(Taylor 1967:478). 


Tocqueville and Taylor were speaking of pre-1789 France, a moment when the 
state and the emerging bourgeoisie compromised because the state needed 
revenue and the bourgeoisie wanted status enhancement and protection from 
onerous bureaucratic fees. In exchange for the purchase of newly created official 
positions, the state conferred status and protection for newly affluent groups 
in society (Garrioch 1996). Similar patterns emerged in other commercializing 
societies in the late 1700s (e.g., England and Russia), with honorific titles and 
noble status granted in exchange for contributions and exceptional service 
by businessmen (Rieber 1982:33-38). The thrust of this paper is that in rare 
moments, when the business class has little access to power or prestige and 
when the government is short of revenue, special exchange relations between 
business and government emerge. 
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Introduction 


Charity links private business, the government and civil society together. Several 
studies of developed Western societies address how social pressure and social 
networks increase business charity to civil society (Galaskiewicz 1985; Useem 
1984), and a larger literature explores lobbying, campaign contributions and other 
forms of political activity by private business (Clawson et al. 1986; Clawson and 
Neustadtl 1989). These two lines of inquiry are typically separate from each other 
(Mitchell 1989; Hansen and Mitchell 2000). Nevertheless, during certain historical 
moments, the two types of business intervention are joined, with business 
charity filling the roles otherwise assumed by campaign contributions and other 
more direct forms of political activity (Mitchell 1989; Sánchez 2000). Top-down 
political control precludes non-governmental organizations that might otherwise 
provide welfare services supported by private charity and an absence of elections 
and electoral campaigns avoids politician reliance on business for campaign 
funds. These patterns leave only the government channel available for business 
influence. France (and other European societies) in the late 1700s and China in 
the 1990s, we argue, provide ideal-typical examples of these conditions. 

In 1990s' China, an emerging private business class began to give generously 
through local governments and quasi-government agencies (Dickson 2003; Fu 
2000; Kahn 2003; Shue 1998; Xin and Zhang 1999; Young 2002). We extend 
existing work on China with a larger sample and with more details about social 
and political ways through which entrepreneurs benefited from contributions. 
We argue that Chinese private entrepreneurs gave considerable charitable 
contributions because these contributions elicited social and political benefits 
in return. This pattern adds new complexity to debates about market and state 
interactions during periods of market transition (Nee 1989; Walder 2002; Xie 
and Hannum 1996; Zhou 2000). Oberschall (1996) and Cao (2001) complain that 
the market transition debate's focus on income and income inequality ignores 
many central aspects of Chinese reform. Our paper suggests that a lingering 
role for government and uncertainty over privileges and government regulation 
caused private entrepreneurs to remain anxious about access to the government 
bureaucracy. Private contributions funneled through government agencies - and 
the political access and social status that these contributions bought — helped 
allay some of this anxiety. 


Business Philanthropy and Political Action 


Some argue (Himmelstein 1997; Mitchell 1989; Neiheisel 1994; Sánchez 2000; 
Useem 1984, 1991) that in moments of a legitimacy crisis, such as in the United 
States before World War Il and in El Salvador in the 1990s, businesses turned 
to highly visible charity as a public relations effort. In countries with weak legal 
regulations, businesses often gain access to government officials through 
corruption (Rose-Ackerman 1978, 1999), but in mature democracies, they 
normally achieve access via campaign contributions and direct lobbying. Net 
of size and visibility, more highly regulated businesses and others with more 
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Figure 1. Exchange Relations in Permeable States with Contested Elections 


Notes: Shaded boxes = actors. Open boxes 


government contact are the more vigorous lobbyists and campaign contributors 
(Grier, Munger and Roberts 1994; Hansen and Mitchell 2000; Mitchell, Hansen 
and Jepsen 1997). 

A more general model of state-business relations and charitable giving is 
summarized in Figure 1. In permeable states with many contested elections 
encouraging campaign donations, private businesses often gain access to 
politicians through campaign donations (Figure 1, top path), and gain legitimacy 
and marketing visibility from a buying public by giving to non-governmental 
organizations (Figure 1, bottom path). In permeable states, businesses face two 
sets of uncertainties — one from government regulation and taxation (top left) 
and another from the public's image of the firm (bottom left). This public image 
is particularly important when the firm is in direct retail sales to consumers (Burt 
1983) and when its pollution activities or simple visibility because of large size 
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subject the firm to public criticism (Grier, Munger and Roberts 1994; Hansen and 
Mitchell 2000; Mitchell, Hansen and Jepsen 1997). Other possible influences 
on the lower path include peer pressure from owners and managers who share 
membership in business associations and from simple emulation of other 
successful entrepreneurs (Galaskiewicz 1985). 

In more closed societies ruled by strong states with few contested elections, 
we suggest that campaign contributions are rare, and non-governmental 
organizations are discouraged by a government jealous of its power (Figure 2). 
Therefore, charitable contributions channeled through government provide both 
the legitimacy and the access functions. Or, both sets of uncertainties can be 
ameliorated only through the government, with the continuing control of non- 
profit organizations (i.e., neo-government organizations) further encouraging 
these tendencies. China in the 1990s epitomized the second pattern. 

In China in the 1990s, one of the primary avenues of access to politicians — the 
funding of electoral campaigns — remained impossible. Corruption was an easy 
alternative, but it was fraught with dangers as more violators began to go to jail. 
Moreover, because corruption was usually dyadic, its benefits withered if the 
government official involved left the government or moved to another position. 
In contrast, an institutionalized government-private business relationship sealed 
by philanthropic contributions was less dangerous and more sustainable. Or, the 
contribution mode of building institutional ties provided an alternative clientalism ` 
(Pearson 1997; Wank 1999). When the government continued to control most 
charitable activities — schools, hospitals, public aid and infrastructure projects 
— both government officials and private business leaders benefited from the 
favorable publicity of their joint magnanimity. Hence, high rates of giving went 
through government-endorsed channels. 


Chinese Private Business and Government Welfare 


state extraction of taxes and relief supplies from the private sector, termed by 
Gates (1996) as the Chinese tributary mode of production (TMP), has had a long 
history in China. Chinese traditional merchants converted wealth into office 
holding to "benefit from TMP economic relations rather than to be exploited by 
them." (Gates 1996:38; also see Bun 2001) Largely banned between 1956 and 
1977, the sometimes faltering reemergence of the private sector in the 1980s 
and 1990s to some extent continued a tributary mode of production. 

Although the Chinese private sector grew rapidly after its 1980s legalization, 
several problems remained. First, state-business interactions remained troubled 
on several dimensions. For example, starting in 2002, the World Bank organized 
surveys of business leaders in some 50 developing countries (Dollar 2003; 
World Bank 2005). In these surveys of both public and private managers, 1,500 
Chinese businesses gave reports that ranked China among the bottom 10th of 
all societies in days spent each year meeting with tax officials (14 days), senior 
managers meeting with regulators (20 days) and inconsistency of interpretation 
of regulations by officials. x 

A second problem was access to sources of power. Until 2001, private 
entrepreneurs outside the Communist Party were not allowed to Join, and there 
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Figure 2: Exchange Relations in Strong States with Few Contested Elections and Few NGOs 


was lingering uncertainty about the legal status of private owners. Although 
private ownership was declared legal in 1988 by a law (not the Constitution), many 
owners continued to worry about possible policy reversals that would once again 
declare them illegal. One sign of this worry was that many of them registered 
as collective firms or as sub-units of state-owned firms, popularly termed "red- 
hatting." (da/ hong maozi, Wank 1999:33) 

Social approval also remained problematic. Even as China experienced rapid 
social change and money fetishism, private entrepreneurs found it difficult to 
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improve their social status. A number of early private entrepreneurs came from 
marginal social positions (ex-criminals and those with troubled political histories) 
faced discrimination. The highly publicized malfeasance of a few entrepreneurs 
tainted the reputation of all entrepreneurs, who also often faced envy from their 
less wealthy neighbors (Gold 1991; Wank 1999). In ways reminiscent of France 
in the 1700s, the mismatch between Chinese private entrepreneurs’ wealth 
and their social and political status motivated them to seek alternative ways to 
improve their status, one of which was to obtain a “signal” of respectable social 
status via membership on an honorific government council (Wank 1999: 193). 

The increasing Tocquevillian need by entrepreneurs to obtain status was 
matched by an increasing government need to fund public goods and welfare 
programs. Before 1978, the government handled urban social welfare via urban 
work units (danwei). In the countryside, except in natural calamity, people 
received almost no social welfare from the government. Lineages, the traditional 
social organization with welfare functions, were eviscerated in the 1950s. Under 
collective farming, the village had modest welfare provisions, but with the demise 
of collective farming in the early 1980s, these functions were often emasculated. 
Because Chinese society lacked non-profit, voluntary organizations to fill the void 
in welfare, society expected the government to continue providing social welfare 
(Croll 1999). ; 

The urgency for private donations was intensified by faltering government 
budgets (Blecher and Shue 1996; Oi 1992; Wong 1997). Local governments 
turned increasingly to blossoming private businesses for additional taxes, fees 
and charitable donations to subsidize their welfare and other responsibilities 
(Wank 1999). We suggest that getting placed on a political council helped hold tax 
and fee collectors at bay. Hence, a new quid-pro-quo between entrepreneurs and 
local government emerged. These tendencies coincided with the government 
officially legitimating private donations in 1994. A new set of tax laws, effective 
in January of that year, allowed tax exemptions for as much as 30 percent of an 
individual's and 3 percent of a business's taxable income (Chinese Corporate 
Income Tax Law 1994; Chinese Individual Income Tax Law 1994; Xin and Zhang 
1999). In Shanghai, as a percentage of total revenue, private firms gave twice as 
much as public firms (Fu 2000). 

The Chinese Constitution stipulates that the legislative People's Congress 
(renda) is the highest authority at each of five administrative levels — the national, 
provincial, city, county/urban district (qu) and township (xiang) levels. The Chinese 
electoral law requires direct elections of congress representatives at county/urban 
district and township levels, with provincial representatives and above chosen by 
members of lower congresses in a series of stepwise elections (O’Brien 1990). 
In practice, the congress's influence is limited (Lieberthal 2004; MacFarquhar 
1998; O'Brien 1990; O'Brien and Luehrmann 1998; Saich 2001; Zheng 1997). For 
example, Congress representatives can make suggestions to the government, 
but are forbidden to “meddle” in government and judicial work (O'Brien 1990). 
The nominating process remains secret, and campaigning is prohibited in most 
elections (O'Brien 1990). Nevertheless, voters at all levels have a choice among 
candidates, implying that candidates favored by both the government and the 
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public are more likely to be elected. Thus, charitable contributions, with their 
visible benefits for both the government and the public, are an ideal vehicle for 
winning a seat in congress. The three-year to five-year term of membership in 
a congress! is an honor (O'Brien 1990:132), not only because the congress is 
putatively the highest authority but also because many elites are representatives, 
including major party and government officials at each level. 


Methods 


The “1995 Survey of Private Firms” was jointly conducted by members of three 
of China's leading sociology groups at the Chinese Academy of Social Sciences, 
the Beijing Academy of Social Sciences and People's University. Respondents 
were owners of private firms and of a few other non-public firms. Chinese 
private firms register with the Industry and Commerce Bureau (gongshang ju). 
Using this register, a proportional quota of firms was allotted to each province. 
Within each province, at least three cities and three counties (for provinces 
with 10,000 or more registered private firms, at least six cities and six counties) 
were probabilistically selected. In each selected city/county, firms were divided 
into clusters according to urban/rural and industry categories. Firms were 
probabilistically selected from each cluster until the quota was met, with at 
least one firm being selected from each cluster. 

The survey was organized by the Industry and Commerce Association 
(gongshang lian), a semi-official organization of private firms, operating at the 
national, provincial, city and county/urban district levels. Following interviewer 
training sessions organized in different locales, staff of the research department 
from the association visited the private firms in the sample to conduct a face- 
to-face structured interview with each private entrepreneur. Interviews for 2,870 
firms were completed in 334 counties and urban districts from all 30 provinces 
and metropolitan areas of China, with a reported response rate of 95 percent. 

Unsurprisingly, given a natural inclination towards local boosterism and their 
link to local government, within their allotted quotas, the local association tended 
to pick their biggest and best firms. Thus, while all registered private firms 
averaged only 15 employees in 1995 (1996 Chinese Statistics Yearbook: 110), 
sample firms averaged 94 employees. The median number of employees in the 
sample was 30, implying that a few extremely large firms — the largest had 6,480 
employees — were pulling up the average for all sampled firms. 

Because our analysis does not focus on description of the national situation 
for private firms but on causal patterns, we see the over-representation of large 
firms as only a minor problem. Ideally, for our purposes, the sample includes a 
wide range of firms. Despite the over-representation of large firms, a full one- 
fourth of the sampled firms have 13 or fewer employees. The sampled firms are 
from both urban and rural areas and from most industry groups. There is, then, 
ample variation for use in the analysis that follows. 

The current survey is consistent with another survey of firms: In 1997-99, 
Dickson (2003) sampled private firms with annual revenue of at least $120,000 
drawn randomly from eight purposively-selected counties in four provinces. In 
his sample of 527 firms, 90 percent had made at least some contribution. This is 
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almost the same as the 89 percent of firms that had made some*contribution in 
the same revenue range in our 1995 national sample. The parallelism in this one 
result provides somewhat more confidence that the 1995 results are not just a 
quirk of the sample. 

We use structural equation modeling (Kaplan 2000) for multivariate analysis. 
In three equations, we analyze the determinants of (1) contributions, (2) political 
representative position, and (3) and perceived social status. The questions of 
particular interest include whether contributions increase an entrepreneur's 
chance of being a representative and, in turn, perceived social status. These 
possible tendencies are examined net of possible feedback effects, including 
the possible peer pressure on representatives to contribute and the possible 
tendency of higher social status individuals to contribute more generously. To 
cope with these possible feedback effects or endogeneity, we use Mplus 3 
(Muthén and Muthén 2004) to compute a mean- and variance- adjusted weighted 
least squares (WLSMV) (Muthén 1979, 1983, 1984) for structural equations in 
which the endogenous variables are continuous (contributions), binary (being a 
representative) and ordinal (perceived status). In the WLSMV estimation, a binary 
or a ordinal dependent variable is treated as a continuous latent response variable 
that exceeds thresholds to give the various outcome categories, such that it can 
be interpreted as a continuous variable. 


Results 


Descriptive Statistics 


Charitable Contributions 

In the survey, respondents reported whether they had made charitable 
contributions and the approximate amount, if they had ever given.? In responses 
to this pair of questions, 87 percent had given at least something in the past. The 
median of the cumulative contributions was $1,163 (U.S. or 10,000RMB), with a 
maximum of $1.3 million. For the average Chinese entrepreneur, whose firm had 
been in existence for only six years, these were not trivial figures. Even among 
larger firms with 50 or more employees, firms averaged only $40,000 in profit and 
4 percent reported income losses in 1994. 

We can only approximate the amount given in any of the prior years. This 
approximation is the average of cumulative giving, which we get by dividing 
cumulative contributions by the number of years in which the firm was registered 
(Table 1). This average annual contribution is used in an analysis of the correlates 
of becoming and remaining a political representative — an event that could have 
occurred at any time during the past three to five years, the usual membership 
term in a People's Congress. 

The survey also asked the entrepreneur what percent of post-tax profit went 
to charitable contributions. By adjusting for reported taxes and fees, we use this 
reported percentage to generate a new percentage of pre-tax profits that went 
to charitable contributions. To make the resulting percentages more comparable 
with U.S. data, we cap these percentages at 10 percent, the federal limit on 
donations that can be deducted from corporate income taxes. Among all sample 
firms, these calculations produce a figure of 1.4 percent of pre-tax profit given 
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Table 1: Descriptive Sample Statistics 


— U U H——á—— 


Mean S.D. Min Max 


Dependent Variables 


Annual contribution (historical average, logged) 1.12 3.32 0 14.22 
Representative position* .10 791 0 1 
Perceived social status 6.70 1.96 1 10 
Personal Characteristics 
Male* .90 .30 0 1 
Average age (17-29)* 12 33 0 1 
(30-39)* .43 .49 0 1 
(40-49)* .32 AT 0 1 
(50-72)* 13 .34 0 1 
Higher education* .18 .38 0 1 
Party member* AT EST 0 1 
Organizational Characteristics 
Revenue (historical average, logged) 14.02 1.64 8.05 19.54 
Personnel (historical average, logged) 3.95 1.24 0 8.79 
Retail sales* .55 .50 0 1 
Firm age (1-5)* AT .50 0 1 
(6-10)* .34 AT 0 1 
(11-13)* .19 .39 0 1 
Regional Characteristics 
Rural area * 42 AQ 0 1 
Income per capita in 1994 (logged) 1:90 Ki 5.48 9.98 
Total firms in the same county (logged 2.66 .94 0 4.19 
Note: (a) * indicates a dummy variable coded 0 or 1. 


(b) Number of observations vary from 2,737 to 2,870. 


to charity in China; for firms with the national mean of 15 employees, giving was 
1.2 percent of pre-tax profit. 

We compare corporate giving in China and the United States at comparable 
asset levels (Figure 3). At low asset levels, Chinese and American giving 
percentages were both more modest. However, at all comparable asset levels, 
Chinese giving was two to four times greater than U.S. giving percentages. With 
increasing asset levels, Chinese giving rose much more rapidly than in the United 
States — eventually exceeding 2 percent while U.S. giving exceeded 1 percent 
only among the very largest U.S. firms. 


Political Representation 
Ten percent of our sample included People's Congress members, nine-tenths of 


whom were at the lowest township or county level. Other, more detailed analyses 


952 » Social Forces Volume 85, Number 2 ° December 2006 


"SUI 'S'() ?u1 tr^ de[r240 3ou soop aut UOISs2821 osoutq^) 21) 10J [fAJ9]UT 
o»uopyuoo Juə21əd c6 aug, 'se3e3ua21od uornqruoo osautq;) Sune[n»[e» 210Joq ju221d QT 01 pepooar 319^ yuaorad QT aaoge sa8eyuasiod asautyD ‘Mery xe} 


‘S'N WIM puodso1100 03 se og 'dno13 ozis jesse teo ut suorje1od105/sun jo Joquinu əy} Áq pə1u8A “ep pədno:9 əu) JO suorssa1821 sa1enbs jsvo| A1eurp1o 


OM} WOI} 31e SIUN PONY AYL 'sojejs pouf) 2t ur suore10d.o» )09S PUR PUTYO UL SULIY MOJ UO pase ore mu əy} 3 SIPI jsoj[eurs YT, *$9I8IS PAYUN] 
ap ur suone1odaoo uormur ge pue eur? ur sur [96 uo poseq oe jjo| aq je So[1D 15281] ou s1oquinu ojnqosqe ug 'dnoJ8 qoeo ur sut jo uonsodosd oy} 
SI}VIIPUT 9ZIS ə|91tO "IDAJIS ənuəAƏəl1 [eur91u] "S IP Áq pox1odoi sau08oje» Suroj[oj *dno18 qvo Jo sjosse uerpour ayeurrxosdde əy} ye are SIPAT SON 

'(F661) 2214198 onuaA2 [eU.I23UT S'N SUNAJ IPBANd JO AdAING $661 :BUTYO :5221nog 


(ejeos peb6o[) (s000)$Sn U! sjessv 


000'00y  000'00) 000'01 000 | 00) 












S]IJ01d X81]-91d JO )u9219d 








əu Den  — — — sn e 
eu fen ------ eui) O 





reet ut sage PONUN əu) pue euruO ur AjLtet) 03 uaAr) sjgoad xeje1q jo ju2o1ed `€ 310314 


Contributions by Chinese Private Entrepreneurs * 953 


suggest that entrepreneurs were picked for higher level congresses when they 
had made very large donations. 


Social Status 

Subjective social status is common in stratification research (Centers 1949; Haug 
1977; Hodge and Treiman 1968). In our survey, entrepreneurs responded to a 
question that asked, "In comparison to other members of society, how do you rank 
on a ladder of social status?" The entrepreneur was given a set of stair-steps of social 
status, with 1 at the top step and 10 at the bottom step, and he/she then checked 
his/her step. In the analysis that follows, perceived social status is treated as an 
ordinal variable, similar to the MacArthur Scale of Subjective Social Status used by 
studies of the impact of status on health (Adler et. al. 2000). Using an inverted scale, 
with 10 indicating highest status, average perceived status was 6.7. 


Personal Characteristics 

Entrepreneurs were 90 percent male and averaged 42 years of age. Because we 
do not know exactly when in the life of the firm contributions were given or when 
the entrepreneur became a representative, we use "average age" in multivariate 
analysis — the average is calculated as the entrepreneur's age at the time the 
firm was registered added to the current age and divided by two. We categorize 
average age into four dummy variables. Of the respondents, 12, 43, 32 and 13 
percent were in the age groups 17-29, 30-39, 40-49 and 50-72 respectively, with 
the group age 17-29 being the comparison baseline in the multivariate analysis. 
Eighteen percent of entrepreneurs had higher education (junior college or above), 
and 17 percent of entrepreneurs were party members, with that membership 
secured before they became private entrepreneurs. 


Organizational Characteristics 

Average revenue (logged) is current revenue in 1994 plus revenue in the first 
year the firm was registered as a private firm, divided by two. Average personnel 
(logged) is the current number of employees plus employees in the first year, also 
divided by two. These average values provide a proxy for the situation in earlier 
years when an entrepreneur might have made a donation or been chosen to be 
a political representative. We add the variable of personnel to the equation of 
contributions because Thompson, Smith and Hood (1993) found that it was not 
gross sales but number of employees that significantly influenced small business 
giving in the United States. Averaging 6.2 years, firm age was categorized into 
three groups: 1-5(47 percent), 6-10 (34 percent), and 11 or more years (19 percent), 
with the group 1-5 years being the comparison baseline in later analysis. Retai/ 
sales typically increase business giving, with the motivation being to increase 
goodwill among potential consumers (Burt 1983). Fifty-five percent of the firms 
had retail sales both in the first year and in 1994, suggesting that it was very likely 
that the firm had retail sales consistently since they were founded. 


Regional Characteristics | | 
A rural location was the business site for 42 percent of all firms. /ncome per capita 


for the county/city averaged about Chinese $1,900 ($220 U.S.) — a high figure 
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because richer counties/cities produced more firms and becausë provinces and 
counties with more firms intentionally had a higher probability of being selected. 
Sampled firms per county ranged from 1 to 66, reflecting the larger number 
of firms per county from which the sample was drawn and the competitive 
environment in which the sample firms operated. Both income per capita and 
the number of competing firms could have affected entrepreneurs' giving levels, 
selection to political positions and perception of their social status. 


Multivariate Analysis 


The fit statistics of the structural equation model (Table 2) indicate that the null 
hypothesis stating that the model fits the data cannot be rejected. Charitable 
contributions had a highly significant effect on becoming a political representative. 
For a one unit increase in logged contributions, the latent continuous variable 
underlying being a representative increased by .16. A one unit increase in 
logged contributions and being a representative significantly increased a 
private entrepreneur's perception of social status by .11 and .15 respectively. 
The consequences of representative position and perceived social status for 
contributions were statistically insignificant. With regard to indirect effects, a 
one unit increase in logged contributions increased perceived social status by .02 
via being a representative (p = .05). None of the other three indirect effects, from 
representative position to perceived status via contributions, from perceived 
social status to representative position via contributions, or from representative 
position to contributions via perceived social status, was significant. 

The model in Table 2 included several other personal, organizational 
and regional characteristics — all designed to control for extraneous noise. 
. Several possible relationships were insignificant: On average, party members 
were unlikely to give large amounts. Net of contributions, average revenue 
increased neither the chances of becoming a representative nor perceived 
social status. Perceived social status was unrelated to gender, age, higher 
education or party membership. 

Several other features were more significant: Entrepreneurs gave more 
when they were in their 30s or older, and they were more likely to become 
a representative when they were at least 40 years of age. Contributions also 
increased when the firm had been operating for more than five years, was in 
retail sales, had ample revenue and employees, and was located in a county with 
fewer private firms. The chances of becoming a representative were greater for 
those who were female, had higher education, were party members, ran a more 
mature firm, and were in a rural area. Perceived social status was elevated not 
only by running a firm for more than five years, but also by being located in a rural 
area or a less affluent area (with less income per capita). 


Discussion 


We began with two sets of assertions: (1) Charitable contributions by Chinese 
private business were unusually generous in the 1990s. (2) Generosity brought 
additional social status with appointments to local political councils. Have these 
two assertions been documented? 


Contributions by Chinese Private Entrepreneurs * 955 


Table 2: Structural Equation Model of Average Annual Contribution, Representative Posi- 
tion and Perceived Social Status 


Model 1a Model 1b Model 1c 
Contributions Representative Perceived 
Status 
Endogenous Variables 
Annual contribution (logged) E X gos 
(.05) (.03) 
Representative position -.61 157 
(.48) (.05) 
Perceived social status -.50 
(.30) 
Personal Characteristics 
Male -.26* -.01 
(.12) (.08) 
Average age” (30-39) Dr 24 -.04 
(.24) (15) (.07) 
(40-49) oM A6** -.07 
(.32) (.15) (.08) 
(50-72) rt 41* .06 
(.33) (.17) (.09) 
Higher education .20* .00 
(.10) (.06) 
Party member .20 ‘ates 10 
(.27) (.09) (.06) 
Organizational Characteristics 
Average revenue (logged) oes 01 .00 
(.08) (.05) (.03) 
Average personnel (logged) inp 
(.09) 
Retail sales .34** 
(.13) 
Firm age ' (6-10 years) .69** CARs SIS 
(.25) (.09) (.06) 
(11-13 years) 140% GEN oe 
(.41) (.11) (.07) 
Regional Characteristics 
Rural area 36" 16 14** 
(.17) (.08) (.05) 
Income per capita (logged) - HT 
(.03) 
Total firms in county (logged) -.36^* -.02 
.09 .04 


Notes: (a) Standard errors in parentheses; *p <.05  "*p —.01  *"*p —.001 (two-tailed 
tests); N = 2,621. 
(b) Chi-square test of model fit p — .44. H, (the model fits the data) cannot be rejected. 
(c) Comparison baseline: age 17-29. 
(d) Comparison baseline: firm age 1-5 years. 
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All but 13 percent of the firms in the Chinese sample had made some charitable 
contributions during the lifetime of the firm. As a percent of pretax profit, the 
Chinese charitable giving during the year of 1.4 percent (1.2 percent when adjusted 
for the mean number of employees in all Chinese private firms) exceeded the 0.9 
percent of U.S. corporations. This high level of giving was among firms much 
smaller than those in developed countries and among firms much younger and 
in a more precarious financial situation — all of which makes the high levels of 
Chinese giving even more remarkable. Comparisons of asset size between the 
United States and China suggested that the higher giving as a percentage of 
profit was common across all size levels (Figure 3). In the United States, charitable 
giving rose only slowly to the levels of the 1990s - starting at about 0.3 percent 
in the 1930s and rising to higher levels only in later years (Johnson 1966:493). 
Because Chinese exempting of charitable donations from personal and business 
taxes only began in 1994, one might argue that the proper comparison between 
China and the United States would be the United States in the 1930s when 
federal income taxes were just beginning. At that time, U.S. giving percentages 
were much lower than in Chinese firms in the 1990s. These comparisons are 
consistent with the beginning argument of the paper that Chinese private firms 
were making charitable donations at exceptionally high levels. 


Rewards from Giving 


Our second argument was that through the 1990s, Chinese charitable giving 
brought distinct rewards in political access and social status. Two sets of 
results were consistent with this argument. The first set of results was for 
political position. In these results, charitable contributions significantly increased 
the chances for selection to a political position. Many of the results for other 
background conditions were also plausible, helping to lend greater credibility to 
the results. Entrepreneurs were most likely to become a political representative 
when they were female, older, more educated, party members, running a firm 
that had existed for several years, and residing in a rural area. That female 
entrepreneurs were more likely to become representatives was consistent with 
the Chinese electoral law, which encouraged the People's Congress at all levels to 
increase the proportion of female representatives. Entrepreneurs who were older 
or whose firms had existed for a longer time had advantages because it took 
some time to be noticed and to establish a network of social and political ties. 
Entrepreneurs with higher education were more likely to be selected because by 
the early 1990s the Chinese government hoped to increase the educational level 
of the representatives in political councils. Party members were advantaged for 
Several reasons, including additional connections to government officials. Firm 
revenue by itself was insignificant in terms of political office, which is consistent 
with the interpretation that contributions were the channel by which potential 
political influence was translated into actual political influence. 

Rural entrepreneurs were somewhat more likelyto be picked as representatives. 
With few competing entrepreneurs or high prestige occupations in a rural 
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locale, rural entrepreneurs were more likely to be seen as “social elites.” This 
interpretation is consistent with the tendency of these two groups (resident in 
poorer locales and in the countryside) to report higher perceived social status 
— suggesting that they felt like “big fish in a small pond.” 

Increase in perceived social status was the second major reward to increased 
giving. In the structural equation model, net of other background conditions, 
both contributions and representative position provided significant boosts to 
perceived social status. Perceived social status was linked to other background 
characteristics in intuitively meaningful ways. Specifically, perceived social status 
was higher when the firm had the added security of longevity. The indirect effect 
of party membership on perceived social status via representative position 
was significant (p < .01), which was consistent with the advantages of party 
membership being translated into higher perceived social status by increasing 
the likelihood of political office. The insignificant effect of revenue was consistent 
with an argument that size alone didn't matter unless this size was expressed 
through contributions. 

Finally, regarding the feedback effect from representative position to giving, 
one might expect a political representative to give more — because of his or her 
visibility and social pressure from peers (Galaskiewicz 1985). In the structural 
equation results, however, representative position failed to significantly increase 
or decrease contributions. This pattern is consistent with several possibilities. 
First, political councils meet as infrequently as once a year, reducing possible 
peer pressure. Second, giving is solely instrumental. After entrepreneurs had 
given enough to be selected to a position, they could give less and were unlikely 
to give more. 


Political Consultative Conference 


The Chinese People's Political Consultative Conference (CPPCC, zhengxie), 
operating at national, provincial, city and county/urban district levels, is the 
other political council that may bring political connections and reputation to its 
members. Founded by the party in 1949 to build alliances with all strata of the 
society (Dreyer 1993:105), the CPPCC advises governments at every level. The 
CPPCC does not have legislative functions, so it is even more symbolic than 
the People's Congress (Saich 2001). CPPCC representatives are selected by a 
standing committee often led by a retired senior official at the corresponding 
level; thus even at county/urban district levels there is no direct CPPCC election. 

Twenty-seven percent of the respondents in this sample were representatives 
in a CPPCC. In the structural equation model in Table 2, when positions in 
the People's Congress (PC) were replaced with positions in the CPPCC, the 
relationships between contributions, representative position and perceived social 
status remained similar — though the model had a poor fit (p ^ .02, meaning 
that the null hypothesis stating that the model fits the data was rejected). For 
a one unit increase in logged contributions, the latent continuous variable 
underlying being a CPPCC representative increased by .22 (p « .001). One unit 
increase in logged contributions and being a CPPCC representative increased 
an entrepreneur's perceived social status by .06 (p « .05) and .23 (p « .001) 
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respectively. The effects of representative position and perceived social status on 
contributions were insignificant. Regarding indirect effects, a one unit increase 
in logged contributions increased perceived social status by .05 via being a 
CPPCC representative (p « 0.001). None of the other three indirect effects, from 
representative position to perceived status via contributions, from perceived 
social status to representative position via contributions, or from representative 
position to contributions via perceived social status, was significant. 


Trends 

What we captured from the 1995 national survey was a historical moment near 
the birth of the Chinese private sector. A major issue of market transition debates 
is whether political power remained central in the Chinese reform era. Extending 
the research scope from income and income inequality to social prestige, our 
research suggests that the government under party control redistributed political 
access and even social prestige via granting positions in political councils. 
Charitable: donations in exchange for representative positions also imply that 
government-business relations were becoming more reciprocal in the mid-1990s. 
Thus, there were some concessions to the business sector in the 1990s. 

These findings suggest several possibilities about long-term developments, 
one of them being that private entrepreneurs might gradually gain political power 
through the “side door.” People’s Congress might not remain merely symbolic. 
Drawing on the history of legislatures in Europe, O’Brien (1994) suggests that 
cooperation with a strong political authority is necessary for early legislative 
development, and that People’s Congress might follow the European trajectory 
— playing a larger role in governance as time goes on. If this comes to pass, 
then private entrepreneurs on political councils might use their positions to 
pursue larger "class" interests (e.g., protecting the economic interests of private 
business) rather than just status enhancement and individual networking with 
government officials. 

Now another decade has passed. A revision to the Chinese Constitution gave 
private ownership constitutional recognition in 1998. Revised party rules let 
private entrepreneurs join the party starting in 2001, and evidence is accumulating 
that the largest business are having some success in lobbying efforts (Kennedy 
2005). With these changes, we expect that the average social status of private 
entrepreneurs has increased. If more and more private entrepreneurs are co- 
opted through membership in the party, the temptation of entrepreneurs to 
pursue their collective interests through autonomous political organizations will 
be reduced and the movement to a plural polity delayed. 

Two events in 2005 suggest that some things remain constant. First, Dickie 
(2005) reported that in China's northern province of Shanxi not only did a Chinese 
court refuse to hear a lawsuit by private entrepreneur and peasant investors 
whose oil wells worth about $800 million were seized by the state, but the 
police also detained their lawyer. This failed initiative led by private entrepreneurs 
suggests that private entrepreneurs had begun to act collectively to protect their 
interests, not through the channels expected by the government such as political 
councils and semi-official business associations, but through legal procedures. 
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Nevertheless, the rebuff to their initiative implies that local governments still 
deal harshly with this type of autonomous action. Had this initiative been led 
by private entrepreneurs who were representatives in political councils, it is 
uncertain whether they would have met the same fate. 

Second, Jian (2005) reported that a mining disaster resulting from illegal 
mining killed 123 miners in Guangdong Province, just north of Hong Kong. The 
owner was an active philanthropist and a political representative in the People's 
Congress at both the local and regional levels with extensive connections to 
government officials. Although his failure to protect the miners was obvious, 
the owner was not arrested until police had obtained approval to do so from the 
political council. The disconnect between his perverse treatment of workers and 
his generous public charity is consistent with the argument that the motivation 
for giving among many entrepreneurs continues to be for political benefit rather 
than social welfare. 

These events also suggest a bifurcation among entrepreneurs: one group 
has begun organizing independently and pursuing collective interests via legal 
channels and another has continued more traditional patronage patterns (see 
Kennedy 2005). A promising study would be to compare these two groups, 
focusing on their interaction with the public and the government. 


Other Societies 


Another issue is whether the patterns found in China (and in Europe during earlier 
periods) might be repeated elsewhere today. If our findings are correct, private 
entrepreneurs will seek additional channels of influence, including corruption and 
charitable contributions, when open elections are absent and when relations with 
the government bureaucracy are uncertain. Many of these situations still occur 
around the world. According to one data set for 2002, a fifth of the world's 150 
independent regimes significantly restricted political participation and scored 
low on measures of democratization (Marshall and Jaggers 2003), implying that 
businesses in these states might pursue alternate channels to gain influence over 
government policy. Many of these societies are similar to China in administrative 
burdens (e.g., days spent in dealing with regulatory officials), which could motivate 
special entrepreneurial efforts to gain political access (World Bank 2005). The 
Chinese case also invites comparison with the transition economies of Eastern 
Europe and the former Soviet Union, where corruption contributes to problems 
of state captured by business (Hellman, Jones and Kaufmann 2003; Kornai and 
Rose-Ackerman 2004). 


Summary and Conclusion 


Our results support the hypothesis of special moments in the history of business 
relations between the government and the public. Much as in pre-revolutionary 
France or in other developing societies, the emerging Chinese private business 
sector of the 1990s was more active in charitable giving than one would expect 
based on the experience of other countries suchas the United States. In the absence 
of a significant non-profit sector and with the government's own desperation for 
funds to continue its legitimating public welfare efforts, most donations continued 
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to befunneled through government bureaus. The circumstantial evidence suggests 
that in order to get those donations, the government rewarded contributors with 
appointments to political councils. These appointments, in turn, provided private 
entrepreneurs with a heightened sense of social status. This institutionalized 
reciprocity between government and private business was possibly more 
stable than interpersonal clientalism built on corruption. Linking business with 
government and civil society, charitable contributions helped oil the connections 
between societal groups. These patterns invite inquiry in other societies, with an 
eye to special moments that lead to unique collaborative arrangements among 
government, private business and society. 


Notes 


1. From 1986 through 1994, congressional term at the township and county/ 
urban district levels were three years, and at the city and higher levels, 
five years. 


2. The means in Table 1 are for the sample, which includes larger-than-average 
firms. To approximate the means for a population with the national average 
firm size of 15, we regressed several dependent variables on firm size and 
firm size squared. At firm size 15, this produced the following predicted 
averages: annual contribution = 5.8 (or $38 U.S.); representative = .07; 
perceived social status — 6.4. 
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Environmentalism, Globalization and 
National Economies, 1980-2000 


Evan Schofer, University of Minnesota 
Francisco J. Granados, Southern Methodist University 


It is commonly assumed that environmentalism harms national 
economies because environmental regulations constrain economic 
activity and create incentives for firms to move production 
and investment to other countries. We point out that global 
environmentalism involves large-scale institutional changes that: (1) 
encourage new kinds of economic activity and (2) reconstruct economic 
value such that environmental protection is rewarded in the market. 
We employ cross-national panel analyses to examine the effects of 
national environmentalism on economic growth, trade, industry and 
investment. We find that pro-environmental countries fare better in 
terms of economic growth, investment and size of the industrial and 
service sectors. We find no impact of environmentalism on foreign 
investment and trade. Firms and investment do not appear to be fleeing 
countries with strong environmental standards. 


Introduction 


Environmental policies are often resisted on the grounds that they will cost 
jobs and slow economic growth. Economic globalization heightens such fears 
because firms can easily move investment and production to other countries 
should local environmental regulations become too onerous (Cobb and Daly 
1989; Hansen-Kuhn 1993; Korten 1993; Thrupp 1994). Countries that choose 
to enact strong environmental protections may experience flight of firms and 
investment, damaging the economy. Yet, empirical studies on the economic 
impact of environmentalism have proven equivocal. 

This paper draws on a range of sociological literatures — including ideas 
from organizational and economic sociology, environmental sociology and 
neo-institutional theory — to better understand the economic effects of 
environmentalism ina globalized world. We argue that global institutionalization 
of environmentalism may drive the growth of new industries, reduce 
comparative advantages of polluters and, over the long term, reconstruct 
economic value, generating economic benefits for pro-environmental 
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countries. A systematic cross-national analysis addresses the question: Does 
environmentalism harm national economies? This research uses improved 
measures of national environmental activity and examines a wider range of 
economic outcomes than prior studies. 


Economic Costs and Capital Flight: Environmentalism as a Detriment to Growth 


Opponents of environmental regulations often argue that constraints on firms 
and economic activity reduce economic growth.' Basic economic theory holds 
that capital investment is a primary source of economic growth (Barro and Sala-1- 
Martin 1995). Environmental regulations can decrease the efficiency of invested 
capital and discourage investment altogether? For example, air pollution laws 
that require expensive pollution abatement equipment necessitate greater capital 
investment to achieve a given level of economic output, reducing the incentive 
to invest. Laws prohibiting or taxing the utilization of available resources have a 
similar effect. For example, environmental regulations prevent the development of 
productive land (wetlands, forests), the use of certain fuels (e.g., high-sulfur coal), 
the emission of particular chemicals, and so on. This directly halts economically 
productive activities or lowers profitability by requiring the use of more expensive 
fuels and production methods? 

The costs of environmental regulation can be quite large. By one estimate, - 
the implementation of the Kyoto Protocol limiting CO2 emissions may reduce 
economic output by $100 billion (in 1990 U.S. dollars) annually for Western 
Europe, and on the order of $50 billion for Japan alone (Li 2000:164). Depending 
on the circumstances, Kyoto may cost the typical nation between 0.5 percent 
and 1 percent of total GDP (Li 2000:168). 


Economic Globalization, Trade and Capital Flight 


Economic globalization, which includes increasing flows of trade, capital and 
investment across national borders, adds another layer of complexity (Castells 
2000; Cerny 1996; Keohane and Milner 1996; Stopford and Strange 1991). While 
proponents of globalization herald economic integration as a boon to economies, 
critics point out several adverse consequences (see Held et al. 1999). First, 
firms in pro-environmental countries bear additional costs and therefore cannot 
produce goods as cheaply as firms in countries with lax environmental standards 
- creating a disadvantage in the global marketplace that may reduce profits and 
growth (Levinson 1996). Second, the growth of trade and increasing mobility of 
capital allows corporations to relocate investment and production to nations with 
weaker environmental laws. Investment and industrial activity may shift from pro- 
environmental nations to less-environmental nations reducing economic growth 
of the former (Baumol and Oates 1988; Conrad 1993; Chichilnisky 1994).^ Loss 
of investment harms growth, and in addition the flight of production implies that 
pro-environmental countries will export less and be forced to import more. 
These arguments suggest the following: First, pro-environmental countries 
will experience lower economic growth and the decline of industrial production 
compared to nations that are environmentally lax. Likewise, pro-environmental 
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countries will accrue less capital investment, from both domestic and foreign 
sources. Instead, investment will flow into the pollution havens of the world. 
Finally, the loss of investment and flight of production will shift the balance of 
trade, such that pro-environmental countries will export less and import more. 

Environmental sociologists steeped in world-system theory have similarly 
argued that highly-polluting industries will relocate as the core exports its 
environmental problems to the periphery (Frey 1995). However, world-system 
scholars doubt that such foreign investments will promote growth in the periphery 
(but see Firebaugh 1996 for a critique).* 


Previous Research 


The empirical literature, which mainly focuses on industry relocation 
and employment, is conspicuous in its overall lack of clear evidence that 
environmentalism harms national or regional economies (Tobey 1990; Low 
and Yeats 1992; OECD 1993; see reviews in Dean 1992; Levinson 1996). For 
example, Leonard's (1984, 1988) cross-national studies observed that flight of U.S. 
corporations and capital abroad due to pro-environmental regulations occurred in 
only a small handful of highly polluting industries and had little economic impact. 
Robison (1988) finds that the increasing pollution abatement costs have had a 
negligible effect on the U.S. international trade balance. Studies of corporate 
relocation and trade between the United States and Mexico concluded that 
environmentalism in the United States played, at most, a minor role in shifts to 
Mexico (Grossman and Krueger 1991; Low 1992). Indeed, a number of studies 
find a positive relationship between environmentalism and the ability of states 
within the United States to attract firms and jobs, resulting in overall economic 
benefits (Templet 1995; Feiock 1998; Feiock and Stream 2001; and others; but 
see List and Kunce 2000). Only a few studies find negative economic effects. 
Clapp (1998), for example, points to a series of case studies of multinational 
corporations that might suggest a trend of increasing relocation of highly polluting 
industries to developing countries (also see Baumol and Oates 1988). 


Environmentalism as a Source of Growth 


Efficiency, Investment and the Rise of New Industries 


Organizational and economic sociologists have noted that environmentalism 
can prompt a variety of new investments and increased organizational efficiency 
(Hoffman 1998). Ecological modernization theorists point to similar dynamics as 
part of their general argument that environmental restructuring is fundamentally 
reshaping economic institutions of modern societies (Mol 2001; Mol and 
Sonnenfeld 2000; Weidner and Janicke 2002). 

In a static economic analysis, environmental policies confer harmful costs that 
require additional capital investment, reducing the efficiency of capital. Over the 
long term, however, the very requirement for additional capital investment can 
spur new economic growth and the creation of entire new industries. That is, 
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environmental policies and laws require firms to invest in pollution abatement 
equipment or cleaner technologies and thereby create demand for new products 
and technologies.” The additional investments required by environmental policies 
may not always benefit the firm, but they certainly drive demand for new sorts of 
environmental technologies and services. 

Entire new industries in pollution abatement, remediation, environmental 
consulting and recycling have grown to satisfy demand created by environmental 
policies. As one scholar writes: “The global clean-up industry, for which the U.S. 
is the largest market, grew some 20 percent per year in the 1980s, and has been 
valued at some US$250 billion — US$300 billion annually in the early 1990s.” (Clapp 
1998, citing Lucas 1995:29; Schwartz 1994:136) Weidner and Janicke observe 
that 3.6 percent of the entire German workforce is employed in the environmental 
sector, which is greater than employment in the entire German auto industry 
(Weidner and Janicke 2002:424). The growth of these new industries has the 
potential to create new jobs and enhance overall national economic growth. 

Furthermore, environmental requirements force organizational innovation 
and change in production processes and technology - overcoming inertia and 
sometimes prompting significant innovation and improvements in efficiency. 
Environmental laws force firms to close old and inefficient factories and eliminate 
the use of inefficient production methods, shifting investment into newer 
"clean" technologies that are often more efficient (Wheeler and Martin 1992; * 
Rock 1996; Angel and Rock 2000). More generally, environmental regulations 
create incentives to develop efficient production methods over more wasteful 
ones, reducing costs spent on raw materials and fuels and increasing profits 
(Van der Veen and Wilson 1997; Gunningham et al. 2003). Innovations directed 
at reducing greenhouse gas emissions have resulted in efficiency improvements 
in energy and transportation systems (Hoffman 1998). Thus, the technological 
modernization demanded by environmental policies can produce a variety of 
economically beneficial effects.? 


Economic Sector Variation: Industry, Service and Agriculture 


Improvements in production efficiency should particularly affect capital intensive 
enterprises - e.g., in the industrial sector - where adoption of new processes and 
technologies yield large increases in productivity. Energy industries, in particular, 
have seen dramatic improvements (Hawken, Lovins and Lovins 1999). New 
industries offering pollution control technologies and environmental abatement 
services should mainly boost the industrial and service sectors. The service 
sector may also benefit from the rise of new services such as recycling and 
environmental consulting. 

Agriculture, on the other hand, seems least likely to benefit from new 
environmental laws. In many countries, agriculture is not capital intensive 
and has a very low profit margin. Moreover, many international environmental 
treaties and organizations specifically curtail agriculture by placing limits on 
deforestation, fisheries and so on (Burhenne 1997). While it is common to 
assume that environmentalism targets industry, the agricultural sector may be 
affected more significantly. 
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Whereas critics of globalization foresee the loss of national sovereignty, 
qeclining environmental standards and capital flight, scholars from various 
“transformationalist” perspectives (Held et al. 1999) argue for a more multi- 
faceted approach to globalization. Globalization is not purely economic, but 
also involves other changes such as the expansion of global institutions and 
governance, which may have important implications for both environmentalism 
and national economies. 

Environmental sociologists and neo-institutional scholars have observed rapid 
expansion of trans-national mobilization around environmental issues (Wapner 
1995; Mol 2001; Frank et al. 2000). Neo-institutional researchers have used 
quantitative data to trace the growth of international environmental treaties 
and organization (Meyer et al. 1997), as well as the global diffusion of various 
environmental policies, laws and activities (Frank et al. 2000, Frank 1997). 
Moreover, this international pro-environmental mobilization has been associated 
with observable reductions in some kinds of environmental degradation and with 
the spread of new cultural understandings regarding the natural environment to 
firms and individuals around the world (Hoffman and Ventresca 2002; Schofer 
and Hironaka 2005). 

In sum, recent decades have seen increasing global institutionalization of 
environmentalism (Frank et al. 2000). This entails both legal/regulatory dimensions 
that encourage or require states, firms and individuals to be more environmental, 
as well as cognitive/cultural dimensions that define environmentalism as an 
important and taken-for-granted social good (Meyer et al. 1997). 

The global institutionalization of environmentalism has direct implications 
for economic activity. First, institutionalization has generated increasing global 
isomorphism — the diffusion of similar environmental policies and laws to nations 
around the world. Second, institutionalization has taken the form of new pro- 
environmental treaties and laws, normative frameworks, standards and regulatory 
schemes, which have the potential to transform markets and economic value. 

The global diffusion of similar environmental policies and laws reduces the 
threat of capital and industry flight, and presumably reduces deleterious effects 
on growth.? As nations enact similar environmental laws, the potential advantage 
of moving capital or production to another country decreases. Moreover, the 
global spread of environmentalism can confer advantages on nations with the 
most aggressive environmental policies and/or early adopters of such policies. 
Early adopters will be the first to develop new technologies, industries and 
services related to environmental problems. Diffusion of environmental laws 
around the world will expand global demand for exactly those technologies and 
services - benefiting the economies of early adopters (Stuligross 1999; Weidner 
and Janicke 2002:424). 

More importantly, the normative frameworks, laws, standards and regulatory 
schemes associated with institutionalization have a transformative effect on 
markets and economic value. Economic sociologists point out that economic 
activity and markets are embedded in socially constructed laws, norms, principles 
and institutions (Fligstein 2001; Guillén et al. 2002; Smelser and Swedberg 2005). 
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The success of environmentalism, as an institution, involves eollective shifts 
in laws and cultural assumptions that reconstruct economic value, production 
and consumption, rendering environmental practices more valuable. Ecological 
modernization theorists make this point explicitly (Mol 2001). The historically 
high value of ivory, for example, made it lucrative to engage in the slaughter 
of elephants. Collective treaties and agreements to ban trade in ivory makes 
the killing of elephants less economically rewarding by wiping out most of the 
market demand for ivory.'? Ironically, some African countries, desperate for 
resources, now have large elephant populations but cannot profit because there 
is no longer an international market for ivory. In sum, institutional and regulatory 
changes transform the economic value of ivory. 

Likewise, schemes to implement various forms of tradeable pollution emission 
credits establish the value of environmental degradation (formerly an externality), 
making it "valuable" to not pollute (Levin and Espeland 2002; Li 2000). The 
institutionalization of environmentalism reconstructs value on a global scale, 
reducing the economic rewards of degradation and increasing the economic 
gains from environmental protection. 

In addition, the global institutionalization of environmentalism has the potential 
to reshape the practices of private-sector firms. The International Standards 
Organization, for example, promotes the ISO 14000 series of standards, which 
encourages environmentally responsible management practices (Mendel 2002). ` 
ISO certification serves as an indicator of legitimacy to other firms, governments 
and multilateral institutions, and is sometimes required to access national and 
multinational markets (Rock 1996). While ISO 14000 is a modest set of initiatives, 
others propose more radical restructuring of business activity that may yield 
greater changes in the future (see Hawken 1993). To the extent that adherence 
to environmental standards and laws becomes normatively or legally required, 
environmentally sound procedures become a requisite for profit. There is some 
evidence of this. Not only do firms often conform to new policies and legitimated 
rules related to the protection of the natural environment, but they sometimes 
experience economic benefits as a result (Hoffman 1997; Prakash 2000; Hoffman 
and Ventresca 2002; Levy and Rothenberg 2002). 

Global environmentalism also affects individuals, reconstructing consumption 
and economic demand. The environmental movement is associated with new 
cultural views on the natural world, which become institutionalized and spread via 
mechanisms such as school curricula that stress the importance of environmental 
protection (Midling 1996). As a consequence, individuals increasingly "prefer" 
certain products and services (e.g., dolphin safe tuna, organic vegetables) and 
not others (e.g., genetically modified [GMO] crops, fur, cosmetics that are tested 
on animals, etc). Indeed, some firms have embraced environmentalism, both 
as a matter of public relations and sometimes in terms of products, suggesting 
significant changes in consumer preferences (Hoffman 1997). 

The global institutionalization of environmental protection and the resulting 
reconstruction of economic value are recent dynamics that are largely incomplete. 
Many forms of environmental degradation are not yet regulated or commodified: 
many firms do not conform to environmental norms and standards; many 


Environmentalism and National Economies ° 971 


individuals do not prefer environmentally sound products. Yet, there have been 
rapid changes in the direction of greater environmental protection. Each step in 
that direction shifts economic advantage toward pro-environmental countries and 
away from those that are lagging in terms of environmental policies and laws. 

These sociological arguments suggest that environmentalism may generate 
beneficial economic effects (or at least reduce the negative effects predicted 
by the prior literature): /ncreased investment and efficiency forced by new 
environmental requirements, in conjunction with the global institutionalization 
of environmentalism, may result in increased growth among pro-environmental 
nations. Moreover, the increased investment and efficiencies may actually lead 
to greater industrial activity. The global institutionalization of environmentalism 
results in isomorphism, reducing or possibly eliminating the flight of industry 
and capital from pro-environmental nations. Consequently, there is no reason to 
expect unfavorable shifts in the balance of trade: /mports will not increase nor 
will exports decrease among pro-environmental countries. 


Data and Methods 


Cross-national statistical analyses are used to determine the relationship 
between environmentalism and several aspects of national economic activity: 
national economic growth, capital investment, level of industrial activity (and 
other-sectors), foreign investment and trade. Economic outcomes are modeled 
for roughly 100 countries from 1980 through 2000, when high quality data on 
environmentalism is available. 


Measuring Environmentalism 


We constructed an index from seven measures of governmental, legal and 
organizational aspects of environmentalism. These measures represent the best 
cross-national data on environmentalism that are available for a large sample of 
nations during the time period of the study.'! The environmental measures are: 


(1) Environmental treaty ratifications (logged) reflecta nation's 
commitment to global environmental concerns that have 
been formally ratified as national laws. Major international 
treaties address issues such as CFC emissions, water 
pollution, protection of endangered species, deforestation 
and others. Burhenne (1997) contains information on the 
total number of treaties ratified by each nation. 


(2) National environmental impact assessment laws are 
another measure of national environmental commitment. 
These laws require governments and firms to assess 
potential adverse environmental effects of new construction 
(e.g., new highways, factories, etc.), and - due to the public 
nature of environmental assessment reports — provide 
an important opportunity for environmental groups to 
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scrutinize and oppose development projects (Hironaka and 
Schofer 2002; Hironaka 2002). The presence of these laws 
is measured using a dummy variable coded 1 for nations 
with this kind of legislation (Frank et al. 2000). 


(3) National environmental ministries are governmental 
bodies devoted to environmental protection and represent 
state commitment to environmental issues. The existence 
of a ministry is measured by a dummy variable coded 1 for 
nations with such a governmental body (Frank et al. 2000). 


(4) National participation ininter-governmentalenvironmental 
organizations (logged) reflects national commitment to 
international environmental concerns and often places 
nations under binding international rules regarding 
environmental issues. Inter-governmental organizations 
address air pollution, fisheries, endangered species and 
many other topics. Our measure reflects the total number 
of inter-governmental associations memberships held by a 
nation (UIA 1994). 


(5) Participationininternationalnon-governmentalenvironmental 
associations (logged) capturesenvironmentalismamongcitizens. 
International environmental NGOs, such as Greenpeace, play 
a role in monitoring countries, protesting, and encouraging 
domestic environmental mobilization (Schofer and Hironaka 
2005). National participation is measured by the number of 
international environmental associations that have one or 
more members residing in a given country (UIA 1994). 


(6) Domestic non-governmental environmental organizations 
(logged) also measures environmentalism among citizens. 
Domestic environmental organizations contribute to 
environmental protection by lobbying, initiating legal action 
against corporations, and conducting protest activities. 
Domestic organizations are coded from an international 
compendium of environmental organizations (Sierra Club 
1976/12 


(7) Number ofecologyscience organizations (logged) reflects 
the level of scientific mobilization around environmental 
problems, which has been shown to influence pro- 
environmental political and legal activity (Frank et al. 2000; 
Sachs 1990). š 
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These seven measures provide a comprehensive picture of government 
and citizen commitment to environmental issues, capturing both legal and 
policy dimensions (national laws, ministries, etc.) as well as non-governmental 
mobilization. Correlations among these measures vary from around .2 to nearly 
.9, with most correlating at .5 or more. They are combined in a single overall 
index of environmentalism by adding the z-scores of each measure.'? 


Dependent Variables: Economic Growth, Investment, Industry and Trade 


Economic Growth is measured as the annualized change in 
real Gross Domestic Product per capita (multiplied by 100 
to yield percentages), consistent with the prior literature 
(Barro 1991; Barro and Sala-i-Martin 1995; Source: Penn 
World Table 6; see Heston, Summers and Aten 2002). 


Investment is measured in two ways: as the total amount 
of capital investment as a percentage of GDP and also per 
capita (Penn World Table 6). 


Industrial Activity, Agricultural Activity and Service Activity. 
Industry is measured by the value added to GDP in constant 
U.S. dollars by the industry sector, per capita (logged) 
(World Bank 2001). We also look at similar measures of 
the agriculture and service sectors. Models in Appendix 
B examine the relative size of sectors, measured as a 
percentage of GDP (World Bank 2001). 


Foreign Direct Investment is measured as change in net 
inflow of foreign investment in constant U.S. dollars as a 
proportion of country’s GDP (World Bank 2001). 


Exports are measured by total exports of goods and 
services in constant U.S. dollars, per capita (logged) (World 
Bank 2001). 


/mports are measured by total imports of goods and services 
per capita in constant U.S. dollars, per capita (logged) (World 
Bank 2001). 


Models and Control Variables 


Economic growth and other dependent variables are modeled using OLS 
regression (Barro and Sala-i-Martin 1995; Levine and Renelt 1992). Economic 
change is measured over about a 20-year period, from 1980 through 2000. Twenty 
years is the largest span of time possible given available data, and reflects a time 
lag similar to other analyses of economic growth and change (Barro and Sala-i- 
Martin 1995: Schofer et al. 2000).** The actual span of each model depends on 
the availability of data, with most models ending in either 1998 or 1999. 
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Economic growth is measured as annualized change between times t, and 
t, with independent variables measured at t, (Barro and Sala-i-Martin 1995). 
Similar to panel models, this approach avoids the sensitivity to reverse causality 
that can occur in cross-sectional OLS models. This approach yields results that 
are consistent with panel models, except in the interpretation of the lagged 
dependent variable. 

All other dependent variables are modeled using panel analysis. Dependent 
variables are measured raw (not annualized change) at t,, with independent 
variables (including the lagged dependent variable) measured at t.. 

To simplify presentation, models have a standard set of control variables. 
Additional variables tended not to be statistically significant and did not 
substantially alter the results. This is consistent with prior research by economists 
such as Levine and Renelt (1992) who find that very few control variables were 
needed to develop reasonably well-specified models of economic growth. Many 
variables commonly assumed to benefit economic growth — such as measures of 
technology or tax rates — do not actually affect economic growth in cross-national 
models (see Levine and Renelt 1992; Barro and Sala-i-Martin 1995). Per the 
recommendations of Levine and Renelt (1992), the base model includes control 
variables for initial level of economic development, investment and education. 
Also included is a control for trade because it has a significant effect on several 
of the dependent variables analyzed.'* Similar control variables are also used in ` 
models of other economic outcomes: investment, economic sector variables, 
foreign investment and trade. 

The basic control variables follow: 


Initial Level of Dependent Variable. A control variable for 
initial level of the dependent variable was always included. 
If appropriate, the variable was logged. For example, in 
models of economic growth (measured 1980-1998), there is 
a control for logged GDP per capita in 1980. In panel models 
of foreign direct investment in 1999, there is a control for 
the initial level of foreign direct investment in 1980. 


GDP Per Capita, logged. A control for initial level of 
economic development (logged) was added in models of 
other economic outcomes. 


Investment. Economic growth models include a measure 
of total investment as a percentage of GDP a variable 
known to drive economic growth (Barro 1991). Investment 
is also included as the lagged dependent variable in models 
predicting investment, and as a control variable in models of 
foreign investment, trade and economic sector expansion. 


Education. Education is thought to increase work force 
productivity, which benefits economic growth, trade and 
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other economic variables (Psacharopolous 1984). Education 
is measured with the secondary enrollment ratio, which is 
the number of students attending secondary school divided 
by the population of the relevant age. 


Trade. Trade openness, measured as imports plus exports as 
a percentage of GDP is commonly brought up in discussions 
of economic growth, investment and industrialization. In 
practice, trade tends not to affect growth, but it is included 
here to be consistent with other studies (Barro and Sala- 
i-Martin 1995). However, trade is expected to affect other 
economic outcomes such as foreign investment. 


Methodological Checks 


Additional control variables were included in the analyses to ensure that results 
are robust to variations in model specification and not due to omitted variable 
bias. To assess whether it was appropriate to lump together heterogeneous 
types of countries (based on level of development and geographic region), 
subsets of the sample were examined. Results were generally consistent. Also, 
standard methodological checks were performed for conditional normality, 
heteroskedasticity, multicollinearity, outliers, etc. In general, there were few 
serious methodological concerns except for outliers. China was an outlier in 
many of the models. To be prudent, we opted to remove China from all analyses 
(though doing so did not change the results). 


Results 


Table 1 shows the results of four OLS regression analyses of growth in GDP per 
capita. Model 1 is the base model, which includes measures most commonly 
used in the economic growth literature. Results are similar to prior research. 
Investment and secondary education have a positive and significant effect on 
economic growth, while initial GDP per capita (1980) has a negative effect, 
indicating a relative convergence of national economies - large economies grow 
more slowly.'® Trade has a non-significant effect, consistent with the literature. 

Model 2 in table 1 adds the index of national environmentalism measured in 
1980. Environmentalism has a positive and significant effect on economic growth 
from 1980 through 1998. National environmental policies, laws and organizations 
are statistically associated with more, not less, economic growth in this time 
period. The coefficient is strongly significant at the .01 level and substantial in 
size, with a standardized coefficient of .379. Model 3 replicates the results using 
a different measure of GDP per capita, based on World Bank data from 1980 
to 1999 (World Bank 2001). Moreover, results were consistent in analyses that 
included additional control variables, such as: democracy, various demographic 
measures, foreign investment, tertiary education, civil war, international war, 
corruption, political unrest, political rights and others. 
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Table 1: OLS Regression Model: The Effects of Environmentalism on Economic Growth, 1980-1998 


Vom Le s e mL Me WM EAE ensem ME ERE ee 
Independent Variable Model 1 Model 2 Model 3 Model 3 
WDldata Industrialized 
West Excluded 
GDP per capita in 1980 (log) -.396 -.698* -.165 -.714 
(.328) (.344) (.230) (.389) 
Investment as share of GDP in .070* .049 .063* .048 
1980 (x 107) (.029) (.029) (.029) (.035) 
Trade as share of GDP in .003 .006 .003 .004 
1980 (x 107) (.004) (.004) (.004) (.004) 
Secondary school enrollment 10225 .018 .004 .020 
in 1980 (x 10°) (.011) (.011) (.011) (.013) 
Environmentalism in 1980 — 124% . 4055 . 156: 
(.045) (.042) (.062) 
Constant 1.87 4.28 .221 4.41 
(2.30) (2.46) (1.32) (2.79) 
Adjusted R? .193 .245 .253 .207 
Number of countries 100 95 96 T8 


*p<.05 *p<.01 **p<.001 (two-tailed tests) 
Note: Standard errors in parentheses 


Table 2: OLS Regression Model: The Effects of Environmentalism on Investment, 1980-1998 


Independent Variable Model 5 Model 6 
Investment Investment 
(% of GDP) (per capita) 
Investment as share of GDP in 1980 204 s — 
(.069) 
Investment per capita in 1980 — 1:99 5 
(.095) 
GDP per capita in 1980 (log) Stan -159.302 
(.821) (162.203) 
Trade as share of GDP in 1980 027% 11.208 *** 
(.009) (1.582) 
Secondary school enrollment in 1980 .020 -2.440 
(.027) (4.550) 
Environmentalism in 1980 220% ORE 
(.107) (19.379) 
Constant -20.063*** -499.986*** 
e (5.872) (1192.135) 
Adjusted R .156 .925 
Number of countries 95 95 


*p«.05 *p<.01 **p<.001  (two-tailed tests) 
Note: Standard errors in parentheses 
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Model 4 in table 1 excludes industrialized countries." Results again show a 
positive and significant effect of environmentalism on economic growth. Indeed, 
the unstandardized coefficient is larger for nations outside the industrialized 
world than it is for the full sample (.156 vs. .124). 

Table 2 examines two different measures of investment: investment as a 
percentage of GDPand investment per capita.'? In both cases, environmentalism 
has a positive and statistically significant effect on investment. Nations that 
were more environmental in 1980 had higher levels of investment in 1998, 
controlling for the initial level of investment and other factors. Environmentalism 
does not deter investment, but rather is associated with higher levels of 
investment within nations. 

Table 3 examines the impact of environmentalism on the industrial sector, 
which may suggest whether industry flees from pro-environmental nations. We 
also look at the service and agricultural sectors as points of contrast. In Model 
7 we see that environmentalism has a positive effect on the industrial sector.'? 
Countries that are more pro-environmental appear to gain, rather than lose, 
industrial activity. 

Models 8 and 9 look at the service and agricultural sectors respectively. 
Interestingly, environmentalism is associated with greater growth in the service 


Table 3: OLS Regression Model: The Effects of Environmentalism on Industrial, Service and 
Agricultural Activity, 1980-1999 


Independent Variable Model 7 Model 8 Model 9 
Industry Service Agriculture 
Industrial sector activity per capita (log) in 1980 .669*** el Lo 
(.110) 
Service sector activity per capita (log) in 1980 a Ole 
(.112) 
Agricultural sector activity per capita (log) 1980 e Lä .864*** 
(.069) 
GDP per capita (log) in 1980 .267 .209 -.028 
(.175) (.185) (.077) 
Investment as share of GDP in 1980 (02738 .015 .004 
(.008) (.008) (.005) 
Trade as share of GDP in 1980 .0004 .001 -.003*** 
(.001) (.001) (.001) 
Secondary school enrollment in 1980 .003 .002 .005** 
(.003) (.003) (.002) 
Environmentalism in 1980 .196* .243" .047 
(.077) (.081) (.051) 
Constant -.669 -.608 AEN 
(.907) (.932) (.430) 
Adjusted R° .955 .956 .915 
Number of countries T5 19 76 


*p<.05 **p<.01 ***p <.001 (two-tailed tests) 
Note: Standard errors in parentheses 
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Table 4: OLS Regression Model: The Effects of Environmentalism on Foreign Direct 
Investment, Exports and Imports, 1980-1999 


Independent Variable Model 10 Model 11 Model 12 
Foreign Exports Imports 
Direct 
Investment 
Foreign Direct Investment (% GDP) in -.261 pay KI 
1980 (.358) 
Exports per capita (log) in 1980 us Baro" am 
(.076) 
Imports per capita (log) in 1980 Ka ria B1 
(.087) 
GDP per capita in 1980 (log) 1.77 A57** 5095 
(1.01) (.144) (.152) 
Investment as share of GDP in 1980 -.199* .029** .003 
(.090) (.010) (.011) 
Trade as share of GDP in 1980 13197 TÍ ES 
(.013) 
Secondary school enrollment in 1980 2.05 .008* didis 
(.032) (.004) (.004) 
Environmentalism in 1980 .026 .019 .026 
(.122) (.013) (.014) 
Constant .026 -1.98 -1.95* 
(.122) (.870) (.892) 
Adjusted R° 2411 .940 .920 
Number of countries 90 84 84 


*p«.05 "**p«.0l ***p <.001 (two-tailed tests) 
Note: Standard errors in parentheses 


sector, but not in the agricultural sector. Later we briefly discuss models in which 
agriculture is measured as a percentage of GDP (see Appendix B). Measuring 
sector size as a percentage of GDP allows exploration of re/ative growth in the 
sectors (as opposed to absolute expansion). In such models, the impact of 
environmentalism on agriculture is actually negative and significant, indicating 
that in pro-environmental countries the sector shrinks compared to the industrial 
and service sectors. 

Table 4 examines measures related to economic globalization: foreign 
investment flows and trade. To the extent that environmentalism harms 
economies and/or causes firms and capital to flee, incoming foreign investment 
and exports should decrease, while imports should grow. However, we observe 
no statistically significant effects of environmentalism on any of these variables.?? 
The environmentalism index has a positive effect on foreign investment (in Model 
10), but the effect is very small and not statistically significant. In Models 11 and 
12, there are positive effects of environmentalism for both exports and imports 


(the latter coming close to significance with a p-value of .074). The results do not 
Support a firm conclusion. 
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Discussion 


We find no evidence that environmentalism is detrimental to national economies. 
To the contrary, countries with more environmental laws, organizations and 
activity subsequently perform better on several measures of economic activity: 
their economies grow faster, investment increases, and the industry and service 
sectors expand. 

What processes are responsible for the observed effects? Results in Table 
2 show a clear effect of environmentalism on domestic investment, consistent 
with the idea that environmentalism prompts retooling of production and the 
rise of new industries. However, this cannot be the whole story, as indicated by 
the positive effects of environmentalism on growth even after controlling for 
domestic investment. 

The organizational efficiency argument is extremely plausible and supported 
by a number of prior studies (Wheeler and Martin 1992; Rock 1996; Van der 
Veen and Wilson 1997; Hoffman 1998). Also, the positive association between 
environmental policies and growth in the industry and service sectors is 
consistent with this view. Environmentalism may benefit growth by forcing 
firms to upgrade, modernize and shift to more efficient production processes. 
However, systematic organization-level or industry-level studies would be 
needed to verify these processes directly. 

Neo-institutional theory posits that the global institutionalization of 
environmentalism will alter economic relations, rendering it more economically 
rewarding to be "environmental." Global treaties and policy isomorphism reduce 
the economic rewards of moving production to countries with lax environmental 
laws. Isomorphism can account for the absence of negative effects of 
environmentalism on economic outcomes, but cannot explain positive effects. 
The neo-institutional argument also suggests a longer-term social reconstruction 
of economic value associated with global institutionalization, which can help 
make sense of the observed positive effects. 

Data is not available to comprehensively test the neo-institutional claims, 
but one issue is easily addressed. The neo-institutional view suggests that the 
positive relationship between environmentalism and growth is conditional on 
the existence of international environmental institutions. Therefore, a simple 
comparison contrasts economic growth from 1960 to1980 with the post- 
1980 period.? The global environmental movement is greatly expanded and 
institutionalized in the later period (Frank et al. 2000), and thus pro-environmental 
nations should fare better according to the neo-institutional argument. The effect 
of environmentalism was large and statistically significant in the post-1980 period 
and much smaller in the earlier period.? National environmental policies and 
activities have a stronger effect on growth in the recent period, coinciding with 
increasing institutionalization of global environmentalism. This exploratory result 
provides further support for the neo-institutional view. However, a comprehensive 
study of mechanisms would be needed to draw firm conclusions. 

Environmental sociologists influenced by  world-system theory raise 
another possible interpretation: core countries export highly polluting industry 
to the periphery (Frey 1995) while maintaining economic dominance. Thus, 
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the relationship between environmentalism and growth could be spurious: 
environmentalism does not cause growth, but rather coincides with the economic 
success of core nations. However, broader results do not support this. Findings 
in Table 1 show that the benefits of environmentalism do not accrue only to core 
countries. A large positive effect of the environmental index can be found even 
within a sample of non-industrialized (non-core) nations (in Model 4).** Moreover, 
environmentalism seems to have no relationship to international economic flows 
— such as foreign direct investment or trade. So, lax environmental laws do not 
appear to coincide with foreign capital penetration and dependency. 


Sector Changes: Agriculture vs. Industry and Service 


We also offer some final remarks on the impact of environmentalism on different 
economic sectors. It is common to assume that the costs and constraints 
of environmentalism will harm industry. Our results find the opposite: 
environmentalism is associated with growth of industry. Rather, agriculture is the 
sector that fails to expand rapidly in pro-environmental nations. Recent battles 
over water use by farmers in California or controversies around slash-and-burn 
farming in Indonesia highlight the fact that environmentalism often comes into 
conflict with agricultural production. Environmental protection efforts result in 
the creation of national parks, the abolition of destructive farming practices, 
and the elimination of farm subsidies. Many international environmental treaties 
and organizations specifically address land, agriculture, fisheries and forestry 
(Burhenne 1997; UIA 1994). 

In other words, environmentalism may be more a foe of agriculture than of 
industry. Indeed, when the agriculture sector is measured as a percentage of 
GDP (effectively measuring the size of agriculture relative to other sectors, not 
absolute growth of the sector), the effect of environmental policies is clearly 
negative (see Appendix B).?4 


Conclusion 


Pro-environmental nations experience better economic outcomes on several 
measures, controlling for other factors, than nations with lax environmental 
policies. These findings run counter to a theme articulated by many scholars 
of globalization: that nations must cater to the demands of industry and global 
capital regarding environmental and social policy or be punished by the flight 
of capital and production. Most countries are augmenting their environmental 
protection laws, and doing so does not appear to prompt the flight of capital or 
other adverse economic effects. We observe increases in domestic investment 
and economic growth in pro-environmental countries, and no change in the level 
of foreign investment. Findings support arguments raised by organizational and 
economic sociologists, neo-institutional theory and ecological modernization 
theory. Further research is needed to clarify which mechanisms account for the 
observed effects. Findings also lend credence to so-called "transformationalist" 
perspectives on globalization, which point to complex and multi-dimensional 
consequences of globalization (Held et al. 1999). 
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Results are not sufficientto conclude thatall possible forms of environmentalism 
encourage growth or that this relationship will continue indefinitely. Recent 
environmental efforts put significant emphasis on remediation, which generates 
new industries and prompts investment. Counterfactually, one could imagine 
that a more extreme environmental movement would place stronger constraints 
on production and consumption, leading to reductions in economic activity. The 
relationship between environmentalism and growth is inherently conditional. 
Neo-institutional theory suggests one of the structural conditions necessary 
to observe positive economic effects: namely, when environmentalism is 
institutionalized on a global scale. In the absence of international institutions and 
regulation, environmental policies are more likely to generate negative economic 
consequences. 

These findings are extremely significant for policymakers. Much of the 
opposition to environmentalism stems from perceptions that pro-environmental 
reforms are costly in terms of jobs and national economic performance. While 
one can find anecdotes of environmental regulation that result in lost jobs and 
bankrupt corporations, the aggregate effect of environmentalism does not appear 
to be harmful in the contemporary period. Rather, environmentalism is associated 
with new opportunities in the industry and service sectors (though perhaps less 
so in agriculture). Our results question the common assumption among many 
policymakers that the goals of industry and the goals of environmentalists are in 
zero-sum opposition. 


Notes 


1. These consequences are predicted when regulations are effectively enforced 
(or at least perceived to be enforced by potential investors). Sometimes this 
is not the case (see Lee and Perl 2003; Hironaka and Schofer 2002). 


2. Otherfactors may also come into play. Clapp (1998), for example, identifies 
a number of additional costs associated with environmentalism: increased 
liability and insurance costs, fees in case of damages, costs of protecting 
employees, and even costs of cultivating a “green” image. All of these 
costs may reduce the profitability of firms and decrease the returns on 
capital investment. 


3. Some economists argue that dirty production methods do not, in fact, 
cost less because damage to the environment should not be treated as an 
externality (Coase 1960). Thus, if “proper” accounting methods are used, 
environmentally safe production methods are less costly in the long run. 
However, we focus on standard economic measures that do not reflect 
environmental externalities (in large part due to difficulties estimating the 
true cost of environmental externalities). Consequently, our analyses are 
likely to underestimate the economic benefits of environmentalism. 


4. On the other hand, it has been suggested that stringent environmental 
regulations in less developed countries could attract foreign investment 
because multinational corporations typically have a superior ability than 
host domestic firms to comply with new environmental regulations 


(Pearson 1987). 
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Also, the core may benefit economically from developing and exporting 
pollution control technologies, and by pushing for environmental reforms 
that put larger burdens on the periphery. 


Ecological modernization theorists, as well as some policymakers and political 
actors, have embraced the idea that environmentalism and growth can co- 
exist, in some form of “sustainable development” via a pro-environmental 
reform of the capitalist political economy that de-links economic growth from 
an increasing level of consumption of limited natural resources and avoids 
the overexploitation of the natural environment (Pearce 1988; Simonis 1989; 
Giddens 1998; Hajer 1995; World Bank 1992; World Commission 1987). 


This presumes that at least some firms bear the costs of such investments, 
rather than fleeing to other countries. If industry flight is extreme, this 
dynamic would not occur. 


Organizational scholars find that a growing number of firms are adopting 
environmentally friendly management practices that positively affect their 
actual economic performance (Hoffman 1997; Prakash 2000; Hoffman 
and Ventresca 2002; Levy and Rothenberg 2002). Firms are considering 
environmentalism as a strategic goal that can improve efficiency and corporate 
image (Hajer 1995; Christensen, Craig and Hart 2001), market advantages . 
over competitors (Porter and van der Linde 1995) and compensation from 
public institutions (Delmas and Terlaak 2002). 


Of course, substantial differences in policy — and especially enforcement 
- still remain and thus incentives for the flight of capital and industry 
presumably still exist. 


. Normative processes also play a role. Like fur, ivory increasingly has negative 


connotations. 


. We would also like to include measures of domestic environmental protest 


activity, citizen environmentalattitudes and behaviors, environmental practices 
and the enforcement of environmental laws. However, such variables are not 
available for a large sample of nations during the time period of interest. 


. Data for 1980 is not available. Data from 1976, the nearest available time 


point, is used instead. 


. Other methods of index construction yielded similar results, as did indices 


based on various sub-sets of these measures. 


. Analyses of other time-periods, such as 1985-1995, produce similar results. 


. The inclusion of trade in the models does not significantly alter the economic 


growth results. 


. Interestingly, these variables have lower t-values than in classic studies of 


the 1960s, and the effect of initial GDP per capita is not significant at the .05 
level (Barro and Sala-i-Martin 1995). This may result from the stumble of the 
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newly-industrializing Asian economies in the 1990s, as well as rapid growth 
among the core economies in this period. 


17. "Most developed" countries are defined as OECD members with GDP 
greater than $14,000 per capita in 1980: Australia, Austria, Belgium, Canada, 
Denmark, Finland, France, Germany, Iceland, Italy, Japan, Luxemburg, 
Netherlands, New Zealand, Norway, Sweden, Switzerland, United Kingdom 
and United States. 


18. Singapore, Romania and Lesotho are identified as moderate outliers in 
the investment as a percentage of GDP analyses. Removing these three 
countries did not affect the sign or significance of the results. Because it 
made no difference, they were included in the analyses presented here. 


19. Botswana, a moderate outlier, is excluded from the analysis. Results are 
not affected. 


20. We observed similar results when imports and exports were measured as a 
percentage of GDP 


21. Results were based on five-year pooled panel models with random 
effects, comparing 1960-80 and 1980-1995. The environment indicator is 
a simplified version of the one used in the main analyses. Specifically, the 
variables for "domestic environmental organizations" and "domestic ecology 
organizations" were not available prior to 1980. 


22. We also observed a positive and statistically significant interaction between 
our environmentalism index and the "late" period (defined as post-1980) in 
our pooled panel analyses, confirming that the difference in coefficient size 
over time is significant. A later time-period cut-off, such as 1985, yielded 
even stronger effects. 


23. Also, we conducted analyses in which we included dummy variables for 
semi-periphery and periphery. In such models (not presented here) we still 
observe the same effect of environmentalism. 


24. The negative aggregate effect may mask some positive changes driven by 
the growth of organic products or other lucrative eco-friendly agricultural 
activities. For example, the past decade has seen rapid growth of organic 
agricultural products, "fair trade" coffee and the like, which may lead to 
expansion of the agricultural sector. At present, however, the negative effects 
appear to dominate the statistical analysis. 
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Appendix A. Descriptive Statistics for Variables Included in Analyses 


» 


N Mean Standard 
A AA O oN 
Annualized Real GDP Growth 1980-98 (PWT 6) 95 .869 1.930 
Real GDP per capita 1980 (log) (PWT 6) 102 8.32 .999 
Secondary Enrollment Ratio 1980 102 46.01 30.47 
Investment (% GDP) 1980 102 16.69 8.73 
Trade Openness (% GDP) 1980 102 69.16 49.74 
Environmentalism Index 1980 102 2.98 5 79 
Annualized GDP Growth 1980-1999 (WDI 2001) 96 .977 1.80 
GDP per capita (log) 1980 (WDI 2002) 97 7.602 1:55 
Investment (% GDP) 1998 95 15.27 8.13 
Investment per capita 1998 95 1886.68 2562.27 
Investment per capita 1980 102 1368.73 1570.18 
Industry Value Added (% GDP) 1999 83 27.86 8.77 
Industry Value Added (% GDP) 1980 84 30.53 10.56 
Agriculture Value Added (%GDP) 1999 83 19.53 15:55 
Agriculture Value Added (%GDP) 1980 84 21.34 14.81 
Service Value Added (%GDP) 1999 83 52.60 13.07 
Service Value Added (%GDP) 1980 84 48.07 11.05 
Foreign Direct Investment (%GDP) 1999 9n 3.47 4.63 
Foreign Direct Investment (%GDP) 1980 94 1.03 1.12 
Exports per capita (log) 1999 93 6.40 1:92 
Exports per capita (log) 1980 90 6.09 1.67 
Imports per capita (log) 1999 93 6.53 1.75 


Imports per capita (log) 1980 90 6.45 1.44 
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Appendix B. OLS Regression Model: The Effects of Environmentalism on Industrial, 
Service and Agricultural Activity (as % of GDP) 1980-1999 


Independent Variable Model B1 Model B2 Model B3 
Industry Services Agriculture 
% GDP % GDP % GDP 


Industry as share of GDP in 1980 298°" E m 
(.081) 
Service as share of GDP in 1980 ES lg e 
(.089) 
Agriculture as share of GDP in 1980 Has o Dome 
(.103) 
GDP per capita (log) in 1980 -1.32 5.02: -2.63 
(1.51) (.154) (2.07) 
Investment as share of GDP in 1980 121 -.111 -.120 
(.137) GT) (.157) 
Trade as share of GDP in 1980 .009 .003 .005 
(.016) (.017) (.019) 
Secondary school enrollment in 1980 -.001 .003 -.003 
(.002) (.046) (.051) 
Environmentalism in 1980 .656 3025 -3.16* 
(1.331) (.139) (1.53) 
Constant 19.4 -11.6 -30.5 
(10.5) (10.4) (17.7) 
Adjusted R? .516 154 184 
Number of countries 73 T2 i 


*p<.05 *p<.01 **p<.001 (two-tailed tests) 
Note: Standard errors in parentheses 
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Affect Controlin International Interactions 


David R. Heise, Indiana University 
Steven J. Lerner, Blue Hill Group and Yankelovich Partners 


This research tests the proposition that national leaders generate 
international interactions in the process of maintaining sentiments 
about nations and international actions. The analysis deals with 1,934 
international incidents in which one of 25 Middle Eastern nations 
responded twice within four weeks to an instigation by another of 
the 25 nations. Quantitative predictions from affect control theory 
correlate significantly with quantitative measurements of observed 
responses. In particular, affect-based predictions account for 59 percent 
of the variance in the nations’ cooperation-conflict. Thus, international 
interactions are affectively-regulated to a significant degree. 


This study uses affect control theory (Heise 1977, 1979, 2002; MacKinnon 1994; 
Smith-Lovin and Heise 1988) to analyze how individuals produce international 
events. The specific hypothesis is that nation states and international actions 
have affective meanings in the worldwide political culture postulated in neo- 
institutionalism (Finnemore 1996; Jepperson 2002; Meyer et al. 1997), and 
relevant individuals try to maintain these meanings through political actions. 

This hypothesis is empirically tested by predicting international action-reaction 
sequences in the Middle East during the 1970s. While happenings at that time 
and place are not representative of all international interactions, the focus is 
justified by excellent event records for the region and period (Azar 1980, 1993b) 
and by the continuing political volatility of the Middle East. 

Before turning to the empirical study, a synopsis of affect control theory is 
given. Then the use of microsociological theory in macrosociological analysis is 
addressed in terms of two questions. How can wide ranging social events be 
interpreted in terms of individual action theory? To what extent are sentiments 
about national identities and behaviors shared in a world culture? 


Affect Control Theory 


According to affect control theory, individuals in a social situation allocate 
identities to each other and then perform identity-appropriate actions in order to 
confirm sentiments associated with the identities and actions. 

Sentiments are enduring affective associations; impressions are transient 
affective associations emerging from observed events (Heise 1979:1). 
Events confirm or disconfirm sentiments about entities in an event by 
creating impressions of the entities that match or differ from the sentiments 


about the entities. 


Thanks to Jason Beckfield, Adam King and Kent McClelland for helpful comments 
regarding this work. Direct correspondence to David R. Heise, Department of Sociology, 
Indiana University, Bloomington, IN 47405. E-mail: heise@indiana.edu. 
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Affect control theory proposes that individuals construct new events in 
such a way as to turn disconfirming impressions into impressions that confirm 
sentiments. For example, an aggressive action by one friend to another produces 
relatively negative impressions of both individuals, and the other friend has to 
engage in an especially good act, like forgiveness, to restore feelings to normal. 
Simulation and experimental studies (Heise 1979; MacKinnon 1994; Smith-Lovin 
and Douglass 1992; Smith-Lovin and Heise 1988) have demonstrated that this 
perspective accounts for institutional role behavior, as well as predicting creative 
responses to deviant acts. 

A cross-cultural project conducted in dozens of societies and heterogeneous 
languages (Osgood et al. 1975) established. three universal dimensions of 
affective meaning: Evaluation, Potency and Activity (EPA). Bipolar graphic rating 
scales with contrasting adjectives at each end are used to measure affective 
meanings - both sentiments and impressions - on these dimensions. In English, 
the Evaluation dimension may be measured with a scale that contrasts "good, 
nice" with "bad, awful." Potency may be measured by contrasting "powerful, big" 
with "powerless, little." And, Activity may be measured by contrasting "lively, 
active, fast" with "quiet, passive, slow." 

The transformation of impressions in an event is described by affect control 
theory's empirically-derived equations for predicting impressions (Heise 1979; 
MacKinnon 1985/1988/1998; Smith-Lovin 1987; Smith et al. 1994). These contain - 
linear and non-linear terms, as shown in the following simplified example from 
Smith-Lovin (1987:37). 


A! =-0.26+0.39x 4, +0.48x B, +0.25x B,x O, 


In this equation, A’ is the outcome evaluation of the actor in an event: A, is the 
evaluation of the actor before the event; B, is the evaluation of the actor's 
behavior; and O, is the evaluation of the object of action before the event. The 
equation shows that the outcome evaluation of the actor depends on the previous 
goodness-badness of the actor, on the goodness-badness of the behavior that 
the actor performed, and on the consistency between the behavior evaluation 
and the prior evaluation of the object. 

Impression-formation equations currently used in affect control theory have 
more terms than the above, including cross-dimensional terms — e.g., a term 
multiplying behavior evaluation times object potency. Also, there are nine 
different equations — a separate equation to estimate the evaluation, potency 
and activity outcomes for the actor, behavior and object. The equations can be 
examined in affect control theory's behavior simulation program available over 
the Internet (Heise 1997). 

Published articles and books discuss subtleties in impression formation 
associated with consistency and cross-dimensional effects (see especially Smith- 
Lovin and Heise 1988). However, roughly speaking, actors performing behaviors 
create impressions of themselves as follows. Engaging in good actions makes 
the actor seem benevolent, and engaging in bad actions makes the actor seem 
rancorous, though the object's goodness-badness moderates these effects. 
Controlling actions make the actor seem strong, while servile actions make the 
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actor seem weak. Rushed actions make the actor seem active, and unhurried 
actions make the actor seem passive. Examples of effects on the object of action 
are: an individual who is the object of a good, weak and active behavior seems 
benevolent, potent and active; while an individual receiving a bad, strong, passive 
action seems evaluatively neutral, slightly impotent and passive. 

Equations for predicting behavior are derived from the impression-formation 
equations in conjunction with the proposition that people try to construct events 
that confirm sentiments (Heise 1977, 1979, 1985, 1992). First, the impression- 
formation equations are used to predict differences between sentiments and 
the impressions that will be produced by the next event. Then, the differences 
represented by these equations are minimized via calculus, yielding new 
equations that specify an Evaluation-Potency-Activity profile for an optimal future 
behavior. The EPA profile for the future behavior is defined in terms of cultural 
sentiments and current impressions regarding the actor and object. 

Affect control theory presumes that the actor chooses intelligently 
among behaviors that instantiate the optimal EPA profile, and that he avoids 
behaviors with affective meanings so far from the profile that they imply 
immorality or lunacy. 


Macroactors, Macroactions 


Cetina and Bruegger (2002) chronicled computer-mediated transactions in 
the global financial market, and observed that the discourse in face-to-screen 
financial interactions occurring over great distances, such as New York to 
Tokyo, shows many characteristics of face-to-face relations. They argued that 
"microsociological structures and relationships are what instantiate some of 
the most globally extended domains - for example, global financial markets. 
In the last few decades, we have witnessed a rise to structural equivalence 
of what Goffman called the interaction order and macrosocial phenomena." 
(Cetina and Bruegger 2002:907) 

The system of nation-states is another venue where there is equivalence 
between an interaction order and a global order. Political representatives of 
nation-states are linked via world-wide interpersonal ties (Giddens 1991). 
As these individuals interact with each other at a distance, international 
interactions result. 

Various types of individuals perform state actions. Officials — presidents, 
prime ministers, ambassadors - are chartered to act in the name of their nations 
within prescribed limits. Offices such as that of president encompass a group of 
advisors and staff members, so an interaction between two nation-state leaders 
amounts to interaction of their retinues. Shadow governments of out-of-power 
political parties sometimes have enough international legitimacy to augment 
the international behaviors of a state. Also, collective identities have role-like 
implications (Thoits and Virshup 1997), and thereby individuals in parliaments, 
patriotic masses or paramilitary groups may respond to foreign stressing of 
national identity by performing nationalistic actions (Irwin 2003, 2004; see also 
Troyer and Robinson 2006). These various kinds of individuals are the animating 
agents of the nation-state as macroactor. 
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Heise and Durig (1997) introduced the concept of macroaction to designate 
interpersonal behaviors that are accomplished through the activities of third 
parties. The third party may be a social organization embodied in a material 
establishment, implementing a division of labor, employing professionals and 
relying on sophisticated information-processing. Macroactions enable political 
action at a distance without requiring dematerialization of behaviors, as in the 
case of the global financial market described by Cetina and Bruegger (2002). 
For example, lodging a diplomatic protest to register distress about an incident 
begins with a prime minister or president instructing her foreign minister to 
deliver an appropriate document to the leader of another nation. Staff members 
at the foreign ministry prepare and manage movement of the text and coordinate 
a meeting with the ambassador of the other nation, to whom the protest 
document is handed for delivery to the ambassador's superiors via diplomatic 
pouch. Thus, one ruler gives a concrete document to another at a distance by 
employing diplomats and their staffs as instruments of action. 

Macroactions compounded recursively within bureaucracies or armies may be 
far-reaching in time and space. For example, a presidential order to act militarily 
instigates further orders from the chiefs of each military service, which in turn 
instigate cascading orders among officers of each service — all marshaling legions 
of warriors to far-flung places over a period of weeks or months. “Yet the crucial 
insight continues to hold: these vast social complexes are instruments of action - 
by individuals — magnates and rulers." (Heise and Durig 1997:112) 

A wide range of international macroactions are available to individuals who 
perform state actions. Azar and Lerner (1981) distinguished 125 types in a variety 
of categories including confederating, forming strategic alliances, providing 
military and non-military support, displaying discord or hostility, and acts of war. 


Shared Sentiments 


Worldwide political culture provides "cognitive and ontological models of 
reality that specify the nature, purposes, technology, sovereignty, control, and 
resources of nation-states and other actors." (Meyer et al. 1997:149) Are affective 
meanings also shared in a worldwide culture? If so, we should find cross-national 
correlations in sentiments about identities and behaviors. 

Studies by different researchers working in six nations — the United States, 
Canada, Northern Ireland in the United Kingdom, Germany, Japan and the 
People's Republic of China — have provided quantitative EPA measurements of 
hundreds of social identities and behaviors (data available in Heise 1997). Heise 
(2001) correlated the average ratings of equivalent concepts in different cultures, 
and found substantial correspondence. The median values of the cross-national 
correlations reported by Heise (2001: Table 2) for average ratings of identities 
were: 0.82 on evaluation (goodness), 0.78 on potency (powerfulness) and 0.62 
on activity (liveliness). The median cross-national correlations for average ratings 
of behaviors were: 0.86 on evaluation, 0.45 on potency and 0.44 on activity. 

Thus there is cross-national convergence in evaluations of people and 
behaviors. "An international moral order circumscribes judgments of morality 
and the allocation of honor and stigma in interpersonal relations, and this moral 
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order bridges human societies across continents, across political economies, 
and across languages." (Heise 2001). 

The cross-cultural correlations for potency and activity are large enough to 
indicate that international orders also exist with regard to these dimensions, 
though less dramatically so than in the case of evaluations. The lower correlations 
for potency and activity probably result from limited variance — attenuation of 
correlation due to restriction of range is a well known statistical phenomenon 
(e.g., Lord and Novick 1968:129). Evaluations of identities and behaviors range 
broadly from extremely bad to extremely good, providing a substantial base 
for cross-cultural comparisons. However, behavior potencies vary mainly from 
slightly potent to extremely potent, and activities of identities and behaviors 
concentrate around the middle of the activity scale. 

While none of the concepts examined in the Heise (2001) study concerned 
international relations, Azar and Lerner's (1981) study of political macroactions 
provides some evidence of convergence in international-relations sentiments. 
Azar and Lerner (1981, Table 1) reported mean EPA ratings of nation-state actions 
on the one hand from professionals in international relations and on the other hand 
from laypeople — mainly U.S. undergraduates. Comparing the mean ratings from 
the international-relations professionals with the mean ratings from the laypeople, 
one finds that evaluations of international behaviors correlate 0.96 between 
the two groups; potency ratings of the two groups correlate 0.80; and activity 
ratings correlate 0.83. Thus the affective meanings of international behaviors 
held by international-relations professionals largely correspond with the affective 
meanings held by laypeople. From this, one might infer that international-relations 
professionals worldwide possess similarity in affective meanings of international 
behaviors at least as great as the cross-national similarity in laypeople's affective 
meanings demonstrated by Heise (2001). In fact, the professional consensus 
should be greater because professionals are the ones who propagate worldwide 
culture, according to neo-institutionalist arguments. 


International Affect Control 


In accord with the argument of Meyer et al. (1997) that interactionist perspectives 
account for much of the behavior of national leaders, this study applies affect 
control theory to international interactions as follows. National leaders and other 
concerned individuals take on nation-state identities and utilize international 
macroactions to maintain affective meanings of their nations and of international 
behaviors. As these individuals personify their countries and engage ininternational 
macroactions with each other to regulate their interpersonal expressive order, 
they produce a world interaction order among nation-states. 

To illustrate, here is an analysis of affective processing by hypothetical states, 
constructed with affect control theory's computer simulation program /nteract 
(Heise 1997; Schneider and Heise 1995), and measurements of macroaction 
sentiments from Azar and Lerner (1981). 

Sentiments about three nation-states are set as follows. Nation 1 and Nation 
2 are very benevolent, somewhat controlling and somewhat active (similar 
to Ethiopia and Morocco during the 1970s). Nation 3 is extremely rancorous, 
extremely controlling and extremely active (similar to Israel in the 19705). 


998 + Social Forces Volume 85, Number 2 * December-2006 


Lacking any external disturbances, the leader of Nation 1 confirms the affective 
meaning of his nation by acting extremely good, quite strong and a bit lively 
toward Nation 2, as in “conducting or elevating friendship agreements” (quoted 
phrases are international macroactions from Azar and Lerner 1981). Indeed, such 
behavior would be equilibrium activity between the two nations. 

Suppose that the leader of Nation 3 chooses to act on Nation 1. The optimal 
macroaction in this case is somewhat bad, a bit strong and very lively — such as 
"condemning strongly specific actions and policies." This macroaction maintains 
the affective meaning of Nation 3, but it causes Nation 1 to seem less benevolent, 
controlling and active than it is supposed to be. 

The leader of Nation 1 might try to recover from the macroaction of Nation 3 
by returning an unfriendly macroaction to Nation 3, such as "granting sanctuary 
to opposition leaders." Another, and actually more effective, way for Nation 1 
to recover its affective meaning is to return to the relationship with Nation 2 
and act stronger and more lively than usual while retaining the usual goodness 
—e.g., by “ceasing economic restrictions.” Still another possibility (and the least 
effective) is to engage in good, weak, lively domestic macroactions, such as 
“celebrating holidays.” (Azar and Lerner 1981 assess sentiments for 113 domestic 
macroactions as well as for international macroactions.) 


An Empirical Test 


The empirical test of affect control theory's usefulness in analyzing international 
relations is based on the theory's predictions about action-reaction sequences in 
the Middle East during the 1970s. Incidents are considered where nation A acted 
on nation B, and then within four weeks nation B acted at least twice on nation 
A. For each of these incidents, affect control theory's prediction about nation B's 
action toward nation A — given nation A's earlier action on nation B — is compared 
with nation B's actual behaviors toward nation A. 

The empirical analyses take account of the stable affective meanings of the 
two nations in their relation with each other and of the affective disturbance 
created by the initial action of nation A to nation B. They do not make allowance 
for (1) earlier interaction between the two nations; (2) nation B's ameliorative 
interactions with nations other than nation A; or (3) nation B's displacement 
of international disturbance into domestic activity. Overall, then, the testing is 
conservative in registering only part of the web of continual international and 
domestic activity. 


Data 


The data about international events come from Azar's (1980, 1993b) Conflict and 
Peace Data Bank. Similar datasets are discussed in a special issue of the journal 
International Interactions (Duffy 1994). However, COPDAB is notable as a large 
database (431,263 events) for the years 1948-1978 built on reports from 71 
worldwide sources and giving excellent coverage to international interactions 
in the Middle East. 
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This study focuses on Middle Eastern events during the period 1971-78, 
divided into a five-year span, 1971-75 and a three-year span, 1976-78. The split is 
uneven because there are fewer events in the early years than in the later years, 
superior data-gathering having been instituted for the later years (Azar 1993a). 
Events in the five-year span are used to derive quantitative measurements of 
the affective meaning of each international actor. Events in the three-year span 
are used to test theoretical predictions based on quantitative measurements 
from the five-year span. 

This study deals with interactions among 25 Middle Eastern political entities: 
Afghanistan, Algeria, Bahrain, Egypt, Ethiopia, Iran, Iraq, Israel, Jordan, Kuwait, 
Lebanon, Libya, Morocco, North Yemen, Oman, Pakistan, Saudi Arabia, Somalia, 
South Yemen, Sudan, Syria, the Palestinian Liberation Organization, Tunisia, 
Turkey and the United Arab Emirates. All of these are referred to as nations, 
including the PLO because it was recognized by the United Nations in 1974 as 
the representative of the Palestinian people. 


Assaying Behavior Sentiments 


Azar and Lerner (1981) obtained Evaluation, Potency and Activity ratings of 125 
international behaviors from 29 professionals in international relations. The raters 
were State Department employees, consulting firm specialists and academics 
in and out of the United States, drawn from the collegial network of Edward 
Azar (1938-91), a Lebanese-born political scientist specializing in the Middle East. 
Adjectives defined the polar opposites of the three rating scales: Evaluation — 
good-bad; Potency — strong-weak; Activity — fast-slow. Each scale had nine rating 
positions with adverbial anchors of infinitely, extremely, quite and slightly on 
either side of a middle position labeled "neutral or neither." Ratings in the various 
positions were interpreted numerically with a derived metric from Heise (1978), 
and mean ratings of the 125 international behaviors varied from approximately -3 
to +3 on each scale. 

Though Azar and Lerner (1981) did not report reliabilities of their measures, 
these can be estimated as follows. Variances in rating a behavior on similar scales 
typically are 1.29 on Evaluation, 2.26 on Potency and 1.68 on Activity (Heise 
1978: Table 4.9). Hence, with 29 raters, the error variances of the mean ratings 
would be about 0.04 on Evaluation, 0.08 on Potency and 0.06 on Activity. Total 
variances of the mean ratings across the 125 behaviors presented by Azar and 
Lerner (1981) are: 2.36 on Evaluation, 0.38 on Potency, 0.40 on Activity. Entering 
the error variances and total variances in standard equations from measurement 
theory (Lord and Novick 1968) yields the following reliabilities for Azar and Lerner's 
(1981) measures: 0.98 for Evaluation, 0.79 for Potency and 0.85 for Activity. 

The COPDAB database describes international events with behavior 
descriptions that are more detailed than the 125 specified by Azar and Lerner 
(1981). For example, one of the actions by Syria to Saudi Arabia is described in 
terms of an Activity "accept," an /ssue-Area "Saudi idea for sharing Euphrates 
water," and a Scale Value of 6. Trained analysts on the COPDAB project assigned 
a Scale Value to each event by coding event descriptions from the source 
periodicals into one of 15 scale categories, with value 1 being for the most 
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cooperative events between nations (e.g., voluntarily uniting into one nation- 
state) and value 15 representing the most conflictive events between nations 
(e.g., total war). The codebook for scaling events (Azar 1993a) offered examples 
of events at each level, these examples being the 125 events that later were 
studied by Azar and Lerner (1981). 

One conceivable method of converting COPDAB descriptions into Evaluation- 
Potency-Activity measurements would be to compute the mean EPA protile of all 
Azar-Lerner (1981) events that were used as examples of a particular COPDAB- 
Scale value and then assign that mean EPA profile to COPDAB events with that 
Scale Value. The method would expand the COPDAB-Scale values to the three 
EPA dimensions but give only 15 different configurations of EPA. Lerner (1983, 
pp. 88-9) devised an alternative procedure to use more of the information in a 
COPDAB description and obtain a greater variety of EPA configurations. 

Lerner classified COPDAB records into more than 6,000 combinations of 
Scale Values and Activities. (A specific Activity, such as agree, combines with 
multiple Scale Values since nations' agreements vary in their degree of conflict- 
cooperation.) The classification was reduced to 431 most common types of events 
by keeping only those that applied in five or more records and that accounted for 
at least 0.5 percent of the records within a particular Scale Value. Lerner (1983) 
assigned a subset of the 125 behaviors in the Azar-Lerner (1981) study to each of 
the 431 event types by selecting behaviors appropriate to the Activity from the- 
list of behaviors exemplifying the given Scale Value in the COPDAB codebook. 
For example, the event type Scale Value 7 and Activity "attend" can be used to 
describe "Meeting of high officials," "Conferring on problems of mutual interest," 
or "Visits by low level officials for talks, but is not representative of "Proposing 
talks," "Exchanging prisoners of war," or any of the seven other Azar-Lerner (1981) 
behaviors that exemplify Scale Value 7. The EPA ratings for all Azar-Lerner (1981) 
behaviors related to a particular event type then were averaged, producing a 
single EPA profile for that type. 

Use of 1,293 computer-implemented rules (431 event types times three 
dimensions) yielded EPA profiles for COPDAB descriptions of events in the 
Middle East, 1971-78. The assignments of EPA profiles to behavior categories 
dealt with 89 percent of the events between Middle Eastern nations during the 
years 1971-78. Events whose Activity verbs were not among the 197 used in 
Lerner's (1983) behavior classification system were assigned the mean EPA 
values of all successfully-coded events with the same Scale Value. 


Assaying National Sentiments 


Sentiments about national identities in the Middle East were inferred from dyadic 
events in the period 1971-75. Of the 300 dyads among the 25 nations, 151 were 
suitable for analyses. Excluded were 121 dyads that had eight or fewer events in 
1971-75 from which to estimate sentiments. 

An additional 28 dyads were excluded because their mean levels of 
cooperation-conflict, as measured by COPDAB Scale Values, changed by 1.2 or 
more between 1971-75 and 1976-78, indicating that behavior-based estimates 
of identity sentiments from the earlier period no longer would be relevant to 
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behavior in the later period. For example, the Egypt-Israel dyad was eliminated 
because national sentiments estimated during the 1973 Ramadan (Yom Kippur) 
War could not be explanatory for the more positive interaction between these 
nations in 1977-78; the change of mean Scale Values from 9.4 to 8.1 reflects the 
decline in conflict. Dropping the 28 dyads from analyses implements the goal 
of seeing whether affect control theory predicts behavior generated by shifting 
impressions, rather than by changes in sentiments. 

EPA profiles of nation-to-nation behaviors were averaged to obtain a single 
summary profile. For example, Iraq acted on Saudi Arabia 17 times during the 
years 1971-75, and the mean Evaluation, Potency, and Activity of those acts was 
1.53, 0.69, 0.55, respectively. 

Affect control theory's reidentification model (Schneider and Heise 1995), 
as implemented in the computer program /nteract (Heise 1997), was used to 
answer the question: What kind of actor nation would perform this average act 
on that kind of target nation? Initially, the target nation was given an EPA profile 
consisting of all zeros. On later rounds of calculations, the target nation was 
assigned the profile computed for it as an actor on the previous round. Eight 
iterations yielded EPA profiles for all nations that were stable from round to round 
within 0.02 units. 

To illustrate: Asking what kind of nation would perform the mean Afghanistan- 
Iran action (1.54, 0.89, 0.49) on a nation with a neutral EPA profile (0,0,0) gives 
a first estimate of Afghanistan's identity in its relations with Iran: 1.15, 0.83, 
0.73. By a parallel computation, Iran's identity in its relations with Afghanistan 
is estimated as 1.35, 0.85, 0.77. On the next round, we ask what kind of nation 
would perform the mean Afghanistan-lran action on a nation with a profile of 
1.35, 0.85, 0.77, and we get the revised identity for Afghanistan of 1.86, 0.77, 
0.81. At the end of eight such iterations, the estimate of Afghanistan's identity in 
its relationship with Iran had stabilized at 2.06, 0.83, 0.79. 

The end result of these computations was a set of EPA profiles for each 
nation specifying its actor identity in a dyadic relationship with another nation. 
In essence, a nation's different identities define how it adjusted its behavior 
in different relationships. For example, Israel was “warm” in its relations with 
Ethiopia, "apathetic" in its relations with Turkey, "domineering" in its relations 
with Iraq. (These descriptive adjectives were obtained by assuming that Israel's 
unmodified identity was the mean of its relational identities in all dyads: -0.62, 
0.14, 0.67, and then /nteract was used to find a modifier explaining Israel's 
average behavior in each relationship.) 


Delineating Incidents 


The test of affect control theory requires sequences in which a nation's action 
on another nation was followed by reaction from the other nation. The goal is to 
predict the reaction using affect control theory's behavior model. 

It was assumed that affect-based reactions typically would occur no more than 
four weeks after the initiating event. However, not knowing whether the response 
in every such incident constituted a reaction to the initial event, incidents were 
expanded to encompass two reactions. An incident was recognized whenever 
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an actor's behavior toward a target was followed by two behaviors of the 
target toward the actor within four weeks, without consideration of thematic 
connections among the three behaviors. In the years 1976-78 there were 1,934 
such incidents among the 151 nation-dyads that were suitable for analyses. 


Behavior Predictions 


Affect control theory's predictions of reactions 
in each of the 1,934 incidents were computed 
with computer program /nteract (Heise 1997). 
First, Evaluation-Potency-Activity profiles for the 
dyadic identities of the nations and for inter-nation 
behaviors were imported into the program. Next, 
for each incident, /nteract calculated impressions 
created by the initial event, and then computed the 
EPA profile for the reaction that would best change 
the impressions created by the first event into new 
impressions optimally matching sentiments. 

For example, on June 17, 1976, Libyan leader 
Kaddafi opposed Syrian intervention in Lebanon — a 
behavior with an EPA profile of -0.90, 0.40, 0.60. 
Entering this initial behavior into program /nteract 
along with the sentiments for Libya (1.65, 0.53, 
0.87) and Syria (1.87, 0.62, 0.86) in their relation 
with each other, we find that affect control theory 
predicts that the resulting impression of Libya was 
-0.30, 0.46, 0.94; the resulting impression of Syria 
was 0.79, -0.34, 0.48; and the Syrian response to 
reduce the discrepancies between impressions and 
sentiments should have been a behavior having 
an EPA profile of 2.02, 1.38, 0.34. The first of 
Syria's actual responses occurred three days after 
Kaddafi's remarks when Syria's President Assad met 
with Libya's Premier Jalloud for talks on Lebanon 
- a behavior with EPA profile 1.90, 0.90, 0.50; the 
second actual response occurred after eight days 
when Syria negotiated with Libya on the withdrawal 
of Syrian forces in Lebanon — a behavior with profile 
of 1.50, 0.50, 0.40. 


10 


14 


21 


Results 


Second Response Occurs Within Days: 


Linear canonical correlation analysis1 dealt with 
the multi-dimensional nature of the predictions 
and observations about international behaviors. 
The  Evaluation-Potency-Activity profiles for 
predicted actions constitute one set of variables in 


Canonical 
Notes: Correlations are computed over Middle Eastern incidents, 1976-78, for dyads having stable sentiments during the period, 1971-78. 


Significances were obtained with Bartlett test of residual correlations: * p «.05 ** p < .001. 





Table 1: Canonical Correlations Between EPA Profiles for Predicted and Two Observed Responses 
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each canonical analysis, and the EPA profiles for actual actions constitute the 
other set of variables. Canonical analyses identify ways of summing variables in 
each set into multiple variates with maximal correlations across sets and zero 
correlations within sets. 

The first question is whether affect control theory predictions about nations' 
reactions to other nations' actions correlate with the first two reactions that 
actually were observed after an instigating event. Since the optimal lag for 
affective responses in international interactions was not known beforehand, 
canonical correlations were computed for subsets of incidents involving different 
maximum lags, from one day to four weeks. 

The most striking feature of the results in Table 1 is the high correlation 
achieved in the primary pair of canonical variates, regardless of amount 
of lag. Such power of explanation substantiates affect control theory's 
usefulness in this domain. 

The table shows significance levels for the correlations between each pair of 
canonical variates.2 The first pair of variates is related at high levels of significance 
in every column of Table 1; the second pair is related significantly in all samples 
of more than 300 incidents; and the third pair is not significantly correlated. The 
appearance of two significant canonical correlations indicates that affect control 
theory's predictions relate in a multi-dimensional manner to measurements of 
actual international behaviors. 

Correlations for the two significant canonical variates generally are highest at 
short lag periods, and the second canonical correlation declines notably when 
the maximum lag is longer than 14 days. This suggests that affective reactions 
in international interactions mostly occur within two weeks of a provoking event, 
and that is the cut-off used for subsequent analyses. 

Table 2 shows canonical loadings on the significant canonical variates in the 
sample of incidents with a maximum lag of 14 days. Canonical correlations in this 
case are 0.770 for the first pair 
of variates and 0.221 for the Table 2: Canonical Loadings for EPAs of Predicted and 
second pair of variates. Observed Responses 

The first numerical column 


e Canonical Variates 
in Table 2 indicates that 


the first pair of canonical M 1 2 
variates emphasizes behavior ACT Predictions 

Evaluation, within both the Evaluation 926 355 
predictions and the observed Potency -.296 .549 
responses. Additionally, Activity -.938 .168 
behavior Activity correlates Observed Responses 

negatively with these variates Event 1 Evaluation .886 .128 
for both predictions and Event 1 Potency .048 .384 
observations. Thus the variates Event 1 Activity -.636 OL 
meld Evaluation and Activity Event 2 Evaluation 898 116 
into a composite dimension Event 2 Potency 185 806 
contrasting bad and rushed Event 2 Activity - 642 072 
action with good and unhurried ^ Canonical Correlation 770 221 


ee SS 
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action. This meld corresponds to the COPDAB Conflict-Cooperation Scale (Azar 
and Lerner 1981) which correlates -0.93 with behavior Evaluation and 0.72 with 
behavior Activity in the 20,086 Middle East events, 1971-78. 

The second numerical column in Table 2 shows the composition of the second 
pair of variates, obtained after removing all variance explainable by the first pair. 
This pair of variates is defined in terms of predicted behavior Potency and in 
terms of the Potency of the later observed response; Potency of the earlier 
response also has a moderate correlation. Thus these canonical variates relate to 
a dimension of behavioral dominance, aligning potencies for predicted behaviors 
with Potencies of observed responses. The canonical correlation for behavior 
dominance is lower than the correlation for cooperation-conflict probably because 
of restriction of range: the variance of behavior potencies is only about one-sixth 
the variance of behavior evaluations. 

In summary, affect control theory predictions correlate highly with real events 
on a dimension of cooperation vs. conflict, where cooperation is a meld of positive 
evaluation and unhurriedness. Indeed, affect control theory explains 59 percent 
of the variance in inter-nation cooperation and conflict. The theory's predictions 
also correlate with real events on a dimension of behavioral domination, defined 
in terms of behavioral potency. Affect control theory predicts 5 percent of the 
variance in behavioral domination. Affect control theory's predictions work best 
for reactions occurring within two weeks of an instigating event. 


Discussion 


This study converts a three-dimensional affective space having cross-cultural 
validity into a bounded world for studying the interactions of macroactors. The 
dimensions of the space are analogous to familiar macrosociological variables 
- Evaluation and Potency are akin to status and power (Kemper 1992) and Activity 
relates to an actor's rate of agentic initiation (Heise 1999) — but there are some 
benefits to viewing the space in terms of affective dimensions as opposed to 
the macrosociological analogues. Past research has endowed the affective 
dimensions with sophisticated measurement technologies (Heise 2001; Osgood 
et al. 1975), and mathematical equations in affect control theory offer rigorous 
depictions of dynamic processes occurring in the affective space (Smith-Lovin 
1987). Thus, the affective space brings quantitative accuracy and mathematical 
depth to studies of macroactor interactions. 

This study shows that affect control theory predicts significant portions 
of variance in international action-reaction sequences, thereby documenting 
the importance of an expressive order in macroactor interactions. Affect 
control theory thus serves as a multi-level theory specifying general principles 
governing the social world. 

Affectivity motivates general goals of actors during exchanges (MacKinnon 
1994:54-9) - e.g., whether actions will be cooperative or conflictual. Thereupon 
rationality is exercised to choose among options and to implement selected 
actions. For example, a ruler maintaining an inter-nation expressive order may be 
inclined to cooperate with another nation. Implementing this inclination involves 
rational choice among macroactions such as "Conducting or elevating friendship 
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agreements," "Establishing technological and scientific communications,” 
"Conducting cultural-academic agreements or exchanges," "Repaying debts." 
The ruler's subordinates employ logic and reasoning to implement the selected 
macroaction. The importance of affect in setting goals explains why subjects 
in game-theoretical experiments often fail to act properly in their own interests 
(Davis 1983; McDermott 2002), thereby not meeting the rationality assumptions 
of game theory; their general inclinations toward cooperation or defection are 
expressive rather than rational. 

Affect control theory's power to predict cooperation vs. conflict in international 
behaviors emerged even though dealing solely with isolated dyadic incidents, 
disregarding nations' ameliorations of dyadic stresses by interactions with outside 
nations or by domestic activity. More sophisticated future analyses focusing 
on series of behaviors within a network of interactants should explain more of 
the variance in cooperation-conflict. Affect control theory's predictive capacity 
also emerged while giving no consideration to relevance between actions and 
reactions other than temporal proximity, and predictions might be improved 
by incorporating judgments about which events actually were responses to 
instigating actions. 

This study's analyses were limited to dyads that maintained a stable 
relationship throughout the 1970s so that estimates of sentiments and norms 
from 1971-75 would remain appropriate in the later period. However, affect 
control theory remains predictive when sentiments are allowed to be time- 
varying and are estimated from recently observed behavior, as demonstrated 
by analyses of international interactions reported elsewhere (Heise 2006). Time- 
varying sentiments might be caused by shifts in network influences (Axelrod 
1997: Chapter 7; Friedkin 1998; Ridgeway and Erickson 2000), especially ties to 
international bureaucracies according to neo-institutionalism (Meyer et al. 1997). 
Internal political successions by democratic or other means are crucial as well, 
installing leaders with different sentiments, who consequently behave differently 
toward other nations, and thereby change the meaning of their nation in the 
international community. 

Affect control analyses usually assign each interactant in a situation a single 
institutional identity, and the sentiment attached to that identity generates 
different roles as the interactant engages individuals with other identities. 
However, this study assigned multiple identities to a nation — a different 
sentiment for each of its international relationships — allowing that Its demeanor 
might be, for example, warm in one relationship and domineering in another 
relationship. Multiple sentiments for an interactant, corresponding to different 
relationships, are appropriate in situations lacking institutional structuration, such 
as interactions within an informal clique. Current international relations have this 
character notwithstanding structuration with regard to center and periphery in 
the modern world system (Wallerstein 1976). 

Related to the issue of multiple sentiments, 46 percent of the incidents that 
began with a negative instigation included at least one positive response — a fact 
that limited affect control theory s ability to account for reactions. Affect control 
theory will have to incorporate concurrent identities (Smith-Lovin 2002, 2003) in 
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order to predict simultaneous acts of vengeance and conciliation. For example, in 
one case Israel responded to a PLO attack by setting a curfew, thereby confirming 
its conflictual dyadic identity with the PLO, and also responded by agreeing to 
attend a UN conference with the PLO, thereby confirming its cooperative identity 
within the UN. 

To summarize, merging two sources of data — Azar (1980) and Azar and Lerner 
(1981) - produced a large dataset of international events, with each action and 
each nation measured on three affective dimensions. This dataset was used to 
conduct affect control analyses of international interactions in an entire region 
over a period of years. The analyses explained international activities with a high 
level of accuracy. 

This work justifies collection of Evaluation-Potency-Activity measurements for 
a variety of political concepts. Shrodt (2001:9) reported, "We now have a fairly 
stable set of about 4,000 verbs and verb phrases that are sufficient to capture 
most of the political behavior coded in the WEIS [World Event Interaction Survey] 
..., and a standard list of about 500 major political actors." Obtaining EPA profiles 
for 4,000 verbs and 500 nouns would be a substantial project, but feasible. With 
such dictionaries in hand, it will be possible to perform enlightening analyses of 
the accumulating data on international interactions, being assembled through 
automatic coding of news stories on the Internet (Duffy 1994; Schrodt 2001). 


Notes 


1. The canonical analyses were computed in SPSS using the CANCORR macro 
in file Canonical correlation.sps of the Advanced Statistics module. The 
macro is called with the following two lines of syntax. 


INCLUDE ‘c:\Program Files\SPSS\Canonical correlation.sps’. 
CANCORR SET1=varlist/SET2=varlist/. 


2. Some incidents shared events because the incidents come from 
continuous sequences of interaction, but significance testing assumes 
that the incidents are wholly independent. Culling overlapping incidents 
leaves samples of non-overlapping incidents: 532 with a 28-day lag, 420 
for 21 days, 275 for 14 days, 233 for 10 days, 115 for 7 days, 96 for 5 days, 
76 for 3 days, 48 for 2 days, and 18 for one day. The canonical analyses 
were recomputed for culled samples with more than 200 independent 
incidents, and the first two canonical correlations always were significant 
at the 0.01 level. Values of the first canonical correlation ranged from 0.442 
to 0.534 — lower than the values in Table 1, because culling eliminated most 
negative incidents, halving the Evaluation variance of observed behaviors, 
and thereby attenuating correlations. Values of the second canonical 
correlation ranged from 0.164 to 0.334. The third canonical correlation was 
significant at the 0.01 level in the 21-day sample with a value of 0.214 
but insignificant otherwise. Thus, results with the independent incidents 
sustained the conclusion of statistical significance. 
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When More Power Makes Actors Worse Off: 
Turning a Profit in the American Economy 


Mikotaj Jan Piskorski, Harvard University 
Tiziana Casciaro, Harvard University 


We propose a theory which predicts that an increase in an actor's relative 
power reduces the actor's rewards in high mutual dependence dyads. 
Our argument is based on the premise that higher relative power gives 
the more powerful actor a greater share of surplus, but it also reduces 
dyadic exchange frequency, which lowers the expected magnitude of 
that surplus. As mutual dependence increases, fairness issues associated 
with power imbalances reduce exchange frequency and expected 
surplus at an increasingly higher rate. Thus, at a certain level of mutual 
dependence, the more powerful actor obtains a greater share of a much 
smaller exchange surplus leading him or her to be worse off than he 
would be in an equal-power dyad. We support this prediction with data 
on profit rates in American industries from 1977 through 1992. 


Introduction 


Research in social exchange theory has long been concerned with the theoretical 
link between power capability and inequality in rewards (Cook, Emerson and 
Gillmore 1983; Marsden 1983; Yamagishi, Gillmore and Cook 1988; Coleman 
1990; Friedkin 1995). Perhaps the most basic finding in this stream of literature 
is that, following an increase in power capability, the power-advantaged actor 
will be better off by virtue of receiving a greater proportion of benefits from the 
exchange. However, the causal link between greater power capability and greater 
rewards holds only if the acquisition of power does not decrease the magnitude 
of rewards available in the exchange. If, under certain conditions, the acquisition 
of power capability were to substantially reduce the magnitude of rewards 
available, the power-advantaged actor would obtain a greater share of a smaller 
pool of rewards instead of a smaller share of a larger one. If the reduction in the 
pool of available rewards is substantial enough, the increase in power capability 
can make the power-advantaged worse off. This article identifies one important 
condition under which this is likely to occur. 

We develop our argument within the power-dependence framework (Emerson 
1962) which characterizes power capabilities in terms of re/ative power and total 
power. Relative power captures the difference in power capability between 
actors in a dyad (Lawler 1992; Lawler and Yoon 1996). Total power captures the 
actors' mutual dependence on each other - the extent to which the dyad as a 
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whole will be better off if the actors exchange with each other,rather than with 
their alternatives (Bacharach and Lawler 1981; Molm 1987, 1990). 

Our argument stems from the notion that an increase in relative power has 
two opposing effects. On the one hand, it allows the power-advantaged actor 
to obtain a greater share of the rewards from the exchange than he would in 
an equal-power exchange. On the other hand, the increase in relative power 
introduces issues of fairness and legitimacy that reduce the frequency of 
exchange between the actors. This reduction in exchange frequency in turn 
decreases the expected magnitude of exchange rewards available to the power- 
imbalanced dyad as compared to a power-balanced one. Under conditions of 
high total power, power inequalities complicate the bargaining agenda even 
further, such that an equivalent increase in relative power leads to a greater 
decline in the frequency of exchange (Lawler and Bacharach 1987; Lawler and 
Yoon 1993, 1996). Thus, as relative power increases, the expected magnitude 
of rewards available to the dyad becomes smaller at a faster rate in a high- 
total-power exchange than in a low-total-power one. At a certain level of total 
power, the reduction in the magnitude of exchange rewards will be so large that 
even though the power-advantaged actor obtains a greater share of exchange 
benefits, he will be worse off than if he received a smaller share of a greater 
pool of resources in equal-power exchanges. 

We support our predictions with data from a study of industry profit rates in - 
the American economy in the period 1977-1992. With this choice of setting, the 
paper departs from most empirical research in social exchange theory, which has 
traditionally hinged on experimental studies of individuals or small groups. An 
important difference between these studies and ours is that we draw on advances 
in network theory, particularly on the notion of constraint (Burt 1982) to construct 
the measures of relative power and total power. Cook and Whitmeyer (1992) 
argued that social exchange theory and network theory are generally compatible 
approaches to mapping social structures and understanding their effects. 
Nonetheless, these theoretical traditions are still developing largely in isolation 
from one another. By providing a model of profit differences derived from social 
exchange theory and implementing it using advances in network theory, we hope 
to show how the two approaches complement each other and to contribute to 
their integration. Testing our propositions in the context of economic exchanges 
outside the laboratory also offers the benefit of exploring the micro-to-macro 
generalizability of exchange theoretic explanations of inequality. 


Theory 
Exchange Structure 


We implement the power-dependence model in the context of an exchange in 
which actors / and / bargain over a fixed set of rewards, r. Both actors have 
outside options given by a,and a, respectively. The sum of these outside options 
is lower than the total amount of benefits available from the focal exchange. 
Relative power is determined by the ratio of the outside options which actors / 
and j receive if they fail to conclude the exchange with each other (a/a) Thus, if 
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actors have equal outside options, as illustrated in Configuration 1 in Figure 1, 
relative power in this dyad is balanced. However, if the more powerful actor's 
outside option is 1.5 times the value of the outside option of the less powerful 
actor, as illustrated in Configuration 2 in Figure 1, relative power is imbalanced 
by a factor of 1.5. 


Figure 1. Configurations of Relative Power and Total Power 
Low total power (.5) High total power (1) 


Configuration 1 Configuration 3 


80 e i«——, j e» 80 600 ¡5 jo 60 


Relative power 
balanced (1) 


Configuration 2 Configuration 4 
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Relative power 
imbalanced (1.5) 





In all configurations actors bargain over 240 units of profit 


The values of the outside options not only determine /'s and /'s profits if they 
fail to exchange with each other, but also have a direct effect on the distribution 
of rewards if / and / do exchange with each other. This is because, when actors / 
and / bargain over the distribution of rewards, they both have to receive at least 
the value of their outside option in order to have the incentive to participate in 
the exchange. Subsequently, / and j bargain over the distribution of the remaining 
benefits from the exchange - the exchange surplus, s, defined as r — a; — a,. If 
actors agree on the distribution of the exchange surplus and exchange with each 
other, they will be jointly better off by the value of the exchange surplus. The 
concept of total power, defined as s/(a,+a,), captures the extent to which 
actors will be better off by exchanging with each other rather than with their 
alternatives. 

To illustrate this concept, consider Configuration 1 again, assuming that the 
actors bargain over 240 units of profit. In order for actors / and / to have the 
incentive to participate in the exchange, both actors must receive 80 units of 
profit each. This leaves 240 - 80 - 80 = 80 units of profit to distribute between 
them through bargaining. Because the value of surplus is 80 units over and above 
the 160 units received from the outside options, the total power in this dyad is 
5. In contrast, if the outside options for both actors are only 60 units of profit, as 
illustrated in Configuration 3, the actors will distribute 240 - 60 - 60 — 120 units 
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of exchange surplus through bargaining. Because the surplus in this case is 120 
units of profit over and above the 120 units they jointly receive from their outside 
options, the total power in this dyad is 1. 

Relative and total power can vary independently of each other. Two dyads 
can have equal total power, but differ in their relative power, as illustrated 
in Configurations 3 and 4 in Figure 1. Similarly, two dyads can have equal 
relative power, but differ in their total power, as illustrated in Configurations 
2 and 4 in Figure 1. 


Profit Rates 


The concepts of relative power and total power allow us to draw a conceptual 
distinction between the rewards that / and / receive with certainty and those 
that they receive only when they exchange with each other rather than with 
their alternatives. Because actors obtain the value of their outside options if 
they agree to exchange with each other, but also when they do not, relative 
power characterizes rewards that actors / and/ are certain to obtain. Total power 
characterizes the exchange surplus that the actors can receive over and above 
their outside options. This surplus can be obtained only if the actors agree to 
exchange with each other. 

Thus, if the two actors agree to exchange with each other with frequency f, 
the expected surplus from exchanges between / and / is given by f * s. If actor 
| receives a proportion of the expected surplus given by p, then the expected 
surplus that accrues to actor / is given by p * £ * s. The expected surplus that 
accrues to/ is (7-p) *f*s. As the total profit accruing to each actor is composed of 
(a) the outside option that the actor receives regardless of whether the exchange 
between / and / occurs and (b) the share of expected surplus, then profits for 
actors / and / are given by: 


T; cat p*f*s 


Ema EE 


Exchange Frequency 


The profit equations contain two unknown variables, the exchange frequency 
and the proportions of expected surplus that accrue to the actors. One of the 
fundamental insights of social exchange theory is that relative power and total 
power determine the dynamics of bargaining between actors / and j. These 
dynamics allow us to predict both the frequency with which actors / and j agree 
to exchange with each other and the division of the surplus between the actors, 
should they agree to exchange. As a consequence, we can derive predictions 
regarding the relative magnitudes of the actors’ rewards under different 
configurations of relative and total power. 

Dyads characterized by high total power are likely to exchange more frequently 
than those characterized by low total power (Granovetter 1985: Uzzi 1996). This 
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is because, in high-total-power dyads, actors can gain much more profit by 
exchanging with each other than with their next best alternatives (Lawler and 
Yoon 1993, 1996). On the other hand, actors in power-imbalanced dyads will 
exchange with each other less frequently than those in dyads in which relative 
power is balanced. Two mechanisms are responsible for this effect. First, in 
order to obtain a greater share of rewards from the exchange, the more powerful 
actor will withhold the exchange from the less powerful actor (Bacharach and 
Lawler 1981; Hegtvedt and Cook 1987; Skvoretz and Willer 1993). Second, when 
relative power is imbalanced, issues of legitimacy and fairness concerning the 
distribution of rewards from the exchange emerge (Lawler and Yoon 1996). 
Specifically, the less powerful actor may threaten not to enter into an exchange in 
order to force the more powerful actor to agree on a more equitable distribution 
of surplus, while the more powerful actor may seek to resist such threats. As a 
consequence, both the more powerful and the less powerful actors engage in 
a higher frequency of confrontational behaviors, such as the tendency to make 
more demands of, make fewer concessions to, and use coercive tactics against, 
the potential exchange partner. Together, such confrontational behaviors reduce 
exchange frequency, which consequently reduces the expected surplus. 

The higher incidence of exchange failures in power-imbalanced dyads has 
been documented in diverse settings and across levels of analysis. Social 
exchange studies of dyadic exchanges provide support for such detrimental 
outcomes in experimental settings (Lawler and Bacharach 1987; Lawler and 
Yoon 1993, 1996). Similar results have been obtained when individuals operated 
in experimental groups, where power imbalance has been shown to induce 
behaviors that decrease the size of exchange surplus and produce significantly 
lower individual and group outcomes (Mannix and White 1992; Mannix 1993). 
The mechanisms that lead to lower exchange frequency in experimental settings 
have also been documented outside the laboratory. For example, Ebenbach and 
Keltner (1998) have shown that the tendency of the power-advantaged party to 
display lower judgmental accuracy about the exchange counterpart also occurs 
in natural settings. In organizational settings, power imbalance has also been 
shown to lead to less frequent exchange. For example, McAlister, Bazerman and 
Fader (1986) document this phenomenon in negotiations between suppliers and 
distributors in marketing channels. Finally, power imbalance has been shown to 
trigger behaviors that decrease firm performance, including slow growth and low 
profitability (Eisenhardt and Bourgeois 1999). 

The extent to which issues of legitimacy and fairness emerge in the exchange 
critically depends on the total power in the dyad. Under conditions of low total 
power, actors bargain over a small exchange surplus. Thus, the distribution of 
the surplus has a minor impact on the realized profit of the actors, who are 
therefore unlikely to question the legitimacy of surplus division. Also, under 
these conditions, any attempts by the power-disadvantaged actor to threaten 
the power-advantaged with withdrawal from the exchange are unlikely to have 
an effect because the more powerful actor has a very attractive outside option 
and can use it at very low cost. Thus, under conditions of low total power, the 
effect of power imbalance on exchange frequency should be relatively weak. In 
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contrast, when total power is high, actors bargain over a significant exchange 
surplus, and therefore the agreement over the distribution of the surplus has 
great impact on the actors' realized profit. This is particularly important for the 
less powerful actor, whose threats to withhold the exchange are more likely to 
be effective under conditions of high total power. Under these circumstances, 
a failure to exchange would be costly to the power-advantaged actor. Although 
withholding the exchange is even more costly to the power-disadvantaged actor, 
doing so signals the less powerful actor's determination to impose costs on the 
more powerful actor in order to get a more equal distribution of the rewards 
(Lawler and Bacharach 1987). For these reasons, an increase in relative power 
has a larger negative impact on exchange frequency under conditions of high 
total power than under conditions of low total power. 

The foregoing discussion suggests three main relationships linking relative 
power and total power to the frequency of exchange between actors / and j. 
We summarize these relationships in Figure 2, using hypothetical exchange 
frequencies consistent with the theoretical model advanced here. First, actors 
in high-total-power dyads will exchange more frequently than actors in low- 
total-power dyads. This implies that actors in Configuration 3 will exchange 
more frequently than actors in Configuration 1, and actors in Configuration 4 
will exchange more frequently than actors in Configuration 2. Second, actors 
in power-imbalanced dyads exchange less frequently than actors in power- ` 
balanced dyads. Thus, actors in Configuration 1 will exchange more frequently 
than actors in Configuration 2, and actors in Configuration 3 will exchange more 
frequently than actors in Configuration 4. Third, the negative effect of an increase 
in relative power on the frequency of exchange will be much higher for dyads 
with high total power than for those with low total power. Thus, the difference 


Figure 2. Relative Power and Total Power: Exchange Frequency and Expected Surplus 
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in the frequency of exchange between Configuration 4 and Configuration 3 is 
greater than the difference in the frequency of exchange between Configuration 
2 and Configuration 1. 


Expected Surplus 


The differences in the frequency of exchange across the four different conditions 
have important implications for the expected value of surplus that the actors 
realize from the exchange. Consider, for example, a power-balanced and a power- 
imbalanced dyad with the same total power. If actors in the unequal-power dyad 
are less likely to exchange with each other, the expected surplus in the dyad will 
be lower than in the power-balanced dyad. Thus, even though the potential to 
obtain surplus from the exchange is the same in both dyads, the expected surplus 
will be different due to the different frequencies with which the actors exchange. 
For example, in Configuration 1 in Figure 2, where the surplus is 80 units of profit, 
and the actors exchange with a hypothetical frequency of .5, the expected surplus 
is 40. In Configuration 2, where actors also obtain 80 units of profit over and 
above their outside options for every completed exchange, but exchange with 
a hypothetical frequency of 0.3, the expected surplus is only 24. The change in 
relative power has, therefore, decreased the expected surplus by 16. 

Furthermore, the magnitude of the difference in expected surplus between a 
power-balanced and power-imbalanced dyad increases with total power. This is 
a direct consequence of our assertion that an increase in relative power under 
conditions of high total power should lead to a greater decrease in the frequency of 
exchange than under conditions of low total power. For example, in Configuration 
3, actors obtain 120 units of surplus for every completed exchange and exchange 
with frequency 0.95, so the expected surplus is 114. In Configuration 4, where 
actors exchange with frequency 0.6, the expected surplus is 72. In this case, 
the change in relative power has decreased the expected surplus by 42. This 
decline in expected surplus is much larger than the decline that would occur 
under conditions of low total power. Thus, 


Hypothesis 1: /n an exchange dyad, an increase in relative 
power leads to a greater reduction in the magnitude of 
expected surplus under conditions of high total power than 
under conditions of low total power. 


Surplus Distribution 


The expected surplus from the exchange is the amount that actors are likely 
to obtain jointly over and above the sum of the outside options. To determine 
how different levels of relative and total power affect actors' total rewards, we 
need to examine how this surplus is divided between the exchange partners. 
Sociological research has provided a number of theories that predict the allocation 
of exchange surplus. The most prominent among these are equidependence 
theory (Cook and Yamagishi 1992) and exchange resistance theory (Skvoretz and 
Willer 1993; Lovaglia, Skvoretz, Willer and Markovsky 1995). Both trace their roots 
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to equiresistance theory (Willer 1981; Heckathorn 1983), which assumes that 
negotiating actors have profit expectations, termed best hope outcomes. Actors 
will never resist exchanges in which they obtain their best hope outcome. They 
will be increasingly resistant to lower profits from the exchange, with the value 
of the next best alternative exchange moderating that resistance. The theory 
predicts that actors will divide the surplus at the point where they are equally 
resistant to the exchange. 

Beyond these similarities, equidependence theory and exchange resistance 
theory make different assumptions on how best hope outcomes are determined, 
which in turn leads them to make different predictions regarding surplus 
allocation. Equidependence theory (Cook and Yamagishi 1992) assumes that 
both actors' best hope outcomes are always equal. Given this assumption, the 
theory predicts that, in exchanging with each other, the actors always get the 
value of their outside option and then split the surplus in half, independently of 
their relative power. In contrast, exchange resistance theory suggests that actors’ 
best hopes are related to the dyad's power configuration (Skvoretz and Willer 
1993; Lovaglia, Skvoretz, Willer and Markovsky 1995). Underlying this claim is an 
argument that the experience of exclusion is likely to reduce the less powerful 
actors best hope, because "the only course of action sure to get excluded 
actors ‘back in the game’ is raising their offers in hopes of providing partners 
with their best offers." (Thye, Lovaglia and Markovsky 1997:1036) Consequently, 
the less powerful actor is more willing to offer the more powerful actor a greater 
share of the surplus. The existing evidence supports the prediction of exchange 
resistance theory (Skvoretz and Willer 1993), while equidependence theory has 
received only limited empirical support. If we follow the predictions of exchange 
resistance theory, then actors should split the exchange surplus equally, when 
relative power in the dyad is balanced. However, in power-imbalanced dyads, 
the less powerful actor should develop lower aspirations with regard to surplus 
distribution, which would allow the more powerful actor to receive a greater 
share of the exchange surplus.' 

Total power, unlike relative power, does not have a direct effect on the 
distribution of rewards. However, as implied by our earlier argument, total power 
reduces the extent to which relative power leads to unequal rewards in the 
dyad. This is because total power allows the less powerful actor to withhold the 
exchange to counteract the claims of the more powerful actor. Under conditions 
of low total power, such a threat is unlikely to have an effect. However, when total 
power is high, the less powerful actor's threat to withhold the exchange is more 
likely to be effective, because a failure to exchange would be costly to the power- 
advantaged actor. Because the less powerful actor can now impose significant 
costs on the more powerful actor, the less powerful actor does not scale back 
its best-hope outcome. Thus, if the actors manage to come to an agreement, the 
more powerful actor will not receive as great a proportion of the rewards from 
the exchange under conditions of high total power as he would under conditions 
of low total power. Hence, 


Hypothesis 2: /n an exchange dyad, an increase in relative 
power yields a smaller increase in the share of expected 
surplus for the more powerful actor under conditions of high 
total power than under conditions of low total power. 
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When More Power Makes Actors Worse Off 


The foregoing discussion suggests that relative power has three effects on 
the rewards that accrue to the more powerful actor. First, the more powerful 
actor receives a higher outside option than he would in a power-balanced dyad. 
Second, the magnitude of expected surplus available to the actors in the dyad 
declines as compared to a power-balanced dyad. Third, the more powerful actor 
receives a greater share of the expected surplus than he would in a power- 
balanced dyad. To the extent that the higher outside option and the greater share 
of surplus acquired are sufficiently large to offset the negative effects of reduced 
surplus, the more powerful actor will be better off than he would be in a power- 
balanced dyad.? This is illustrated in Figure 3 by a shift from Configuration 1 to 
Configuration 2, where we assume that the surplus is allocated in proportion to 
relative power. 

However, under conditions of high total power, increases in relative power are 
less likely to result in this outcome. Two factors are responsible for this. First, an 
increase in relative power under conditions of high total power leads to a much 
larger decline in expected surplus. Second, increases in relative power lead to 
a smaller increase in the share of exchange surplus when total power is high. 
Thus, at a certain level of total power, the greater share of surplus acquired will 
be too small to offset the negative effects of reduced surplus, and the more 
powerful actor will be worse off than he would be in a power-balanced dyad. This 
is illustrated by a shift from Configuration 3 to Configuration 4 in Figure 3, which 
assumes that the expected surplus is distributed more equitably than suggested 
by relative power? 


Figure 3. Relative Power and Total Power: Expected Rewards 
Low total power (.5) High total power (1) 
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Method 
Setting 


We have developed theoretical predictions linking relative power and total power 
to expected rewards for a dyad. Such predictions are usually tested in laboratory 
experiments where the experimenter can manipulate relative and total power 
by setting the pool of rewards and the actors' outside options (Lawler and Yoon 
1996). Experimental subjects, usually college students, are then allocated to 
different dyads and asked to bargain with others (Molm 1987). The rewards 
accruing to every subject are recorded and then averaged over all subjects in a 
particular dyad type. These averages are then compared across different types 
of dyads to test hypotheses linking structural positions to earnings (Skvoretz and 
Willer 1993). 

We depart from the traditional approach and test our hypotheses using non- 
experimental data on profits from economic exchanges of goods and services 
between businesses in the American economy. Despite the difference in research 
setting, the dyadic power-dependence framework we adopt here can be fruitfully 
applied in this context. Consider, for example, economic exchanges of goods 
and services between two business units in different industries. If we know how 
much the two businesses bargain over and what their outside options are, it is 
possible to characterize their dyadic relationship in terms of relative power and 
total power. By comparing the average profits obtained in dyads characterized by 
a particular combination of relative power and total power to the average profits 
realized in dyads characterized by a different combination of these variables, we 
could test the hypotheses put forward in this paper. However, as evidenced by 
the dearth of non-experimental studies in power-dependence tradition, specifying 
the reward pool and actors' outside options necessary to construct measures of 
relative power and total power among businesses in an economic system is very 
arduous. To solve this problem, we draw on existing network theory and derive 
proxies for the ratio of the reward pool to the dependent business unit's outside 
option. This ratio captures the dependence of one business unit on another and 
can be used to construct measures of relative power and total power for a dyad 
of business units. 


Dependence 


Existing network theory suggests that in a dyadic relationship between a business 
unit in industry / and a business unit in industry /, the outside option of the 
business unit in industry / will be comparatively less attractive than the reward 
pool when two conditions are simultaneously met: (a) industry j is dominated 
by a few large business units and (b) the business unit in industry / has to sell 
predominantly to industry / (Burt 1982). These conditions correspond to the value 
of the outcomes or rewards controlled by actor / and the availability of outcomes 
or rewards from /'s alternative sources, which, in Emerson's power-dependence 
framework, define the power capability of actor / in relation to actor /. 
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To formalize this notion, we adopt Burt's (1980) seminal approach to capturing 
business unit dependence using the input-output representation of an economy 
(Leontief 1966). This representation depicts the total dollar value of goods sold 
by industry / to industry / in one year (z). Business units in industry / will be 
constrained in their exchange with business units in industry / if a large proportion 
of their sales or purchases needs to occur with that industry. Formally, this 
dependence on purchases, D and dependence on sales, Sin iS: 





and Sj = 


Z ii Z; 
Py = — 
d Z; 25 Zig 
q q 


To incorporate the second factor - the scarcity of alternative exchange partners in 
industry / — we also follow Burt (1980) and use a measure of oligopoly in industry /, 
given by O, Because a business unit's outside option is significantly lower when 
a business unit has to exchange mostly with one industry and that industry has 
only a few exchange partners, we multiply the measures of purchase and sale 
dependence by the measure of oligopoly. Thus, our measure of the dependence 
of a business unit in industry / on a business unit in industry /, is:* 


DM SS =(p; +s; ) O, 


Relative Power and Total Power 


The measure of dependence provides us with a proxy for the magnitude of 
outside options of business unit / relative to the reward pool in any dyadic 
relation between business units in industry / and business units in industry /. 
An equivalent measure can be developed as a proxy for the relative magnitude 
of outside options for business units in industry /. Joint consideration of these 
measures allows us to characterize any dyad of business units in industries / and / 
in terms of relative power and total power. For example, an exchange relationship 
between a business unit in the forestry products industry and a business unit in 
the wood products industry, shown in Figure 4, illustrates a power-imbalanced 
relationship. In this relationship, the business unit in the wood products industry 
is power advantaged, as the constraint that this business unit puts on the forestry 
unit is much larger than the constraint that the forestry unit puts on the wood 
products business unit. 

The two measures of dependence can also be used to capture total power 
between any two business units (Mizruchi 1992). In our context, high total power 
can be exemplified by an exchange relationship between a business unit in 
the paper industry and a business unit in a paperboard industry, illustrated in 
Figure 5. In this relationship, relative power is balanced, but both business units 
significantly constrain each other. Certain exchanges between business units in 
the American economy will be characterized by both relative power and high 
levels of total power. This is the case, for example, in the relationship between 
automobile producers and car chassis manufacturers, with automobile producers 
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Figure 4. Relative Power between Business Units 
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being substantially more powerful actors in the exchange. The same applies to 
the exchange relationship between business units in the apparel industry and 
business units in the fabric manufacturing industry, with business units in the 
apparel industry wielding more power. 


Dependent Variable 


In choosing our dependent variable, we follow Burt's (1982) classic analysis 
of the structural determinants of the average profitability of business units in 
American industries. Burt used the price-cost margin — defined as the ratio of - 
the net income of business units in the industry to the value of sales of the 
business units in the industry — to measure the differences in average profitability 
across business units. The net income is defined as the difference between the 
value of sales of all business units in an industry and the sum of all production 
costs including labor, materials, supplies, fuel, electric energy, cost of resales, 
and contract work done by others (Collins and Preston 1968). Thus, the price 
cost margin is: 
Net Income, 
POM a oe ae 
Sales, 

The use of this dependent variable to test our hypotheses presents three 
potential problems. First, the measure of price-cost margin obscures important 
differences in profitability within a particular industry. Although these differences 


Figure 5. Total Power between Business Units 
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are interesting, we cannot shed light on them as our measures of dependence 
capture the minimal level of dependence common to all business units in the 
industry. Consequently, the focus on the average industry price-cost margin is 
more appropriate. This approach is also consistent with previous studies that 
have tested power dependence theories in the lab. These studies usually assign 
subjects to a number of dyads characterized by the same levels of relative power 
and total power. Subsequently, they examine rewards obtained by the actors 
averaged across all the dyads of the same characteristics and ignoring between- 
dyad differences. In our case, all dyads of business units in two different industries 
are characterized by the same levels of relative power and total power. Much 
like the lab studies, we then examine rewards obtained by the business units 
averaged across all the dyads of the same characteristics and ignoring between- 
dyad differences. 

Second, the price-cost margin does not capture the average profits of 
business units in industry / realized in exchanges with business units in industry 
J. Instead, the measure captures the weighted average of profits that business 
units in industry / realized in exchanges with all industries with which industry 
/ exchanges. The weights are given by the percentage of inputs and outputs 
exchanged between business units in industry / and every other industry. 
Consequently, there is a mismatch between the dependent variable (defined at 
the level of business units in a particular industry) and the measures of relative 
power and total power (defined at the level of business units in two industries). 
Prior research has addressed this problem by adding the dyadic dependence 
scores across all relations in which an actor is involved to derive a summary 
measure for that actor which can then be compared to the profitability measure 
(Burt 1982). Following the same approach, we add the dyadic relative power and 
total power scores across all industries with which business units in industry / 
exchange to construct a measure of relative power and total power for business 
units in industry /. 

However, this approach ignores hierarchy (Burt 1992:70-71): the distribution of 
relative power and total power across the dyadic relationships. Hierarchy implies 
that a business unit facing, say, a total power of 0.2 in two separate transactions 
is assumed to experience the same effect of total power as a business unit facing 
total power of 0.4 in a single transaction. Although we cannot directly test this 
assumption, we test it indirectly with analyses of a sub-sample of industries in 
which business units face most of their relative power or total power in one or 
two exchanges. In these cases, the measures of relative power and total power 
approximate dyadic measures. Thus, the observed profit rates can be more clearly 
attributed to specific dyadic exchanges. If the results from the analysis using the 
restricted sample mirror those from the entire sample, we can be more confident 
that the use of the weighted average of profits of business units in industry / 
realized in exchanges with all industries with which industry / exchanges does 
not pose significant problems. 

The final issue arises because our dependent variable cannot distinguish 
directly between changes in the expected surplus and changes in the proportion 
of that surplus that accrues to each actor. Consequently, we cannot examine 
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directly whether dyadic exchange frequency and surplus allocation change 
with relative power and total power as suggested by our hypotheses. However, 
because exchange frequencies and surplus allocations manifest themselves 
directly in profit rates, it is possible to recover these relationships using an 
econometric model which uses only profit data as the dependent variable. To 
do so, we rely on our earlier discussion suggesting that an increase in relative 
power has two effects: (a) it reduces the frequency of exchange between actors 
i and j and (b) it allows the more powerful actor to obtain a greater share of the 
benefits. Reducing exchange frequency affects both actors in the same way, by 
reducing the amount of surplus available from the exchange. Thus, starting from 
the baseline condition in which both actors are equally powerful and receive the 
same amount of surplus, a reduction in expected surplus will reduce both actors' 
surplus equally. On the other hand, the more powerful actor obtains a greater 
share of surplus in the exchanges that actually occur. Here, the gains of the more 
powerful actor are exactly the losses of the less powerful actor. 

To capture the changes in exchange surplus, we use a power imbalance 
variable, defined as the absolute value of relative power. As the value of this 
variable is the same for both actors in a dyad, regardless of who holds the power 
in the relationship, the measure of power imbalance is ideally suited to capture 
the extent of the identical losses of exchange surplus of both actors. On the other 
hand, an increase in relative power imposes symmetrical, but opposite effects on . 
the actors' abilities to appropriate the exchange surplus. Thus, the re/ative power 
variable captures the differential ability to capture surplus. 

Totest ourtwo hypotheses, we interact each of the two variables, power imbalance 
and relative power, with total power. Thus, the final model we estimate is: 


z, = P Relative Power, + B,Total Power, + B,Power Imbalance, + 
+f,Power Imbalance, * Total Power, - B; Relative Power, * Total Power, 


In Hypothesis 1, we predict that the presence of power inequalities under 
conditions of high total power leads to increasingly larger reductions in expected 
surplus. Thus, we expect that an increase in power imbalance under conditions 
of high total power will have a greater negative effect on price-cost margins (8, 
« 0). In Hypothesis 2, we predict that, under conditions of high total power, the 
effect of relative power on the power-holder's ability to appropriate the exchange 
surplus will be weaker. Consequently, we expect that an increase in relative power 
under conditions of high total power will have a smaller effect on profit (8, « 0). 


Data and Variables 


The data for the analysis come from the Survey of Current Business published 
by the U.S. Department of Commerce. We construct the dependent variable, 
price-cost margin, for 77 industries in 1977, 1982, 1987 and 1992. To ensure the 
normality of the dependent variable, we use the log of the price-cost margin in 
our regressions (also see Burt 1982). 

To construct the independent variables, we convert the total dollar values of 
flows between industries into proportions of sales and purchases of all dyads of 
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industries. We use these proportions, together with the four-firm concentration 
ratios in industry /, as the measure of O to construct Burt's measure of the 
dependence of business units in industry / on business units in industry /. The 
concentration data are not published for input-output table markets and they are not 
available from a single source. To ensure consistency with Burt's (1982) analyses 
on constraint relations, we use the concentration data compiled by Burt. 

We then convert the directional measures of dependence into dyadic 
measures of relative and total power. To construct the measure of relative power 
of business units in industry / over business units in industry /, we take the log of 
the ratio of the dependence of business units in industry / on business units in 
industry / and the dependence of business units in industry / on business units in 
industry /.? The measure of power imbalance is the absolute value of the relative 
power measure. To construct the measure of the total power of business units in 
industries / and /, we take the log of the product of the dependence of business 
units in industry / on business units in industry / and the dependence of business 
units in industry / on business units in industry /:? 


Relative Power, ,, = log D 
Ce 


ji 





Power Imbalance, ; = | Relative Power 


Total Power, ,, — log (Gss noo ) 


Finally, since business units are involved in multiple dyadic exchanges, we 
follow the assumption used by Burt (1982) and sum the measures of relative 
power, power imbalance, and total power across all the dyadic exchanges in 
which business units in a particular industry are involved. We also construct two 
interactions: one between total power and relative power and one between total 
power and power imbalance. Thus, the final measures used in the analysis are 
defined as follows: 


Relative Power, = 2» (Relative Power, ,,) 
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Power Imbalance, = D (Power Imbalance, ) 
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Total Power, = Zä (Total Power, ; ) 
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Power Imbalance, *Total Power, = DA (Power Imbalance, , ,*Total Power, , ; ) 
R 


1026 ° Social Forces Volume 85, Number 2 ° December 2006 


Consistent with Burt (1980, 1982, 1992), we also introduçe three control 
variables: a dummy variable for non-manufacturing industries, the concentration 
ratio in industry /, and the percentage of imports in the industry's total sales. 
Table 1 provides summary statistics and correlations for all variables used in the 
models. 


Results 


Table 2 summarizes the findings of regressions with robust standard errors with 
industry i's price-cost margin as the dependent variable. Model | is the baseline 
model with three control variables and relative power. Consistent with Burt's 
results (Burt 1982), the control variables are statistically significant and in the 
expected direction. Industries with a low concentration ratio suffer lower profits, 
non-manufacturing industries enjoy higher profits, and industries in which 
imports make up a significant part of the total industry output have lower profits. 
The estimated coefficient of relative power, while in the expected direction, is 
not statistically significant. Although this may seem surprising in the light of 
our predictions, our theory predicts that relative power will only contribute to 
rewards if accompanied by low total power. In Model Il, we introduce the power 
imbalance measure, which is also insignificant. Model Ill includes the total power 
variable; the coefficient estimate is negative and significant. Once total power is. 
introduced, the coefficient of relative power increases and becomes significant 
at the 10 percent level. 

In Model IV, we introduce the interaction between relative power and total 
power. The coefficient estimate is negative and is significant at the 1 percent 
level, consistent with Hypothesis 2. It is also noteworthy that the introduction 
of this interaction substantially increases the size of the estimate coefficient for 
relative power and makes it statistically significant at the 1 percent level. Such 
an increase in coefficient size and statistical significance yields further credence 
to the suggestion that the effect of relative power will vary depending on the 
level of total power. As indicated by the main effect of relative power in Model IV, 
when total power is zero, the relationship between relative power and an actor's 
ability to appropriate exchange surplus is positive. Yet, as total power increases, 
this relationship is significantly attenuated, as indicated by the negative sign 
on the interaction term between relative power and total power in Model IV. 
In Model V we introduce the interaction between power imbalance and total 
power. The coefficient is negative and statistically significant. This is consistent 
with Hypothesis 1, according to which the presence of power imbalance under 
conditions of high total power reduces the benefits accruing to both actors. 

To aid the interpretation of these results, Figure 6 maps on a plane the 
relationship between profit margins and relative power and total power, using 
statistically significant estimates from Model V. The graph shows that an equivalent 
increase in relative power has a vastly different effect on price-cost margin 
depending on total power. At the back of the plane, where total power is low, an 
increase in the relative power of business units in industry / over business units 
in other industries increases the price-cost margin linearly. However, at the front 
of the plane, where total power is high, an increase in relative power of business 
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units in industry / has a more complex relationship to profit margin. Here, an 
increase in relative power increases the price-cost margin only if the businesses 
in industry / were disadvantaged on power. This is because an increase in power 
of the power-disadvantaged actor actually decreases the relative power in the 
dyad. To the extent that power balance in high-total-power exchanges promotes 
exchange frequency and allows actors to obtain greater exchange surplus, this 
should increase the profit realized by the actor. In contrast, an increase in the 
relative power of business units in industry / when they are already advantaged on 
power leads to a decrease of the price-cost margin. This is because an increase 
in power of the power-advantaged actor increases the power imbalance in the 
dyad. Under conditions of high total power, this increase leads to a substantial 
decline in exchange frequency so that actors obtain a smaller surplus. Thus, even 
though the increase in power allows the power-advantaged actor to obtain a 
greater share of the expected surplus, the substantial decrease in the size of the 
surplus leads the more powerful actor to obtain a smaller profit. 

Finally, we also undertake a number of sensitivity analyses on Model V to 
ensure that our findings are stable across different model specifications and data 
samples. We report the results of two such analyses in Models VI, VII and VIII. 
In Model VI, we introduce the constraint variable to check whether the results 


Figure 6. Effect of Relative Power and Total Power on Industry Price-Cost Margin 
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presented in Model V are stable with respect to earlier network models that 
sought to explain differences in industrial profitability (Burt 1983). The constraint 
variable is insignificant and no parameter estimates from Model V are materially 
changed. In Model VII, we restrict our sample to industries in which the largest 
power imbalance in any dyadic relationship in which business units in industry / 
are engaged makes up at least 50 percent of the summarized power imbalance 
score for business units in industry /. The objective of this exercise is to make sure 
that the results of Model V are not an artifact of summing across a large number 
of dyadic transactions. The restriction severely reduces the sample size down to 
92. However, it does not affect the main results. The fact that these results mirror 
those from the unrestricted sample makes us more confident that the findings 
presented in Model V are not an artifact of summing our independent variables 
across numerous dyadic transactions. In Model VIII, we repeat the exercise but 
restrict our sample to industries in which the two largest dyadic power imbalance 
scores make up 90 percent of the summarized power imbalance score. As before, 
the results are not affected by the sample restriction.'? 


Discussion 


In this paper we proposed a set of conditions under which an increase in relative 
power decreases the benefits that accrue to the power-advantaged actor in an 
exchange dyad. Specifically, we hypothesized that as exchange partners grow ` 
mutually dependent, increases in relative power progressively reduce the ability 
to garner exchange benefits, to the point that the more powerful actor's rewards 
may actually decrease after a power increase. According to our argument, 
changes in exchange frequency, expected surplus and surplus distribution are 
_ jointly responsible for making the power-advantaged actor worse off after an 
increase in relative power under conditions of high total power. As for exchange 
frequency, we built on existing exchange-theoretic research showing that actors in 
high-total-power dyads exchange more frequently than in low-total-power dyads, 
while equal-power dyads exchange more frequently than unequal-power dyads 
(Lawler and Yoon 1993, 1996). Furthermore, we argued that an increase in relative 
power decreases the frequency of exchange more under conditions of high total 
power, because fairness concerns complicate the bargaining agenda. The higher 
failure rate induced by an increase in relative power leads to greater losses of 
expected surplus under conditions of high total power. As for surplus distribution 
in successful exchanges, we posited that the power-advantaged actor does not 
receive as great a proportion of exchange rewards under conditions of high total 
power when the power-disadvantaged actor can pose a credible enough threat 
to induce the power-advantaged actor to accept a more equitable allocation 
of exchange benefits. We proposed two hypotheses from these joint effects. 
First, we hypothesized that an increase in relative power would lead to a greater 
reduction in the magnitude of expected surplus under conditions of high total 
power than under conditions of low total power. Second, we predicted that an 
increase in relative power gives the power-advantaged actor a smaller increase in 
the share of surplus under conditions of high total power than under conditions of 
low total power. A study of price-cost margins of American industries supported 
these hypotheses. 
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Our findings indicate that an equivalent increase in relative power has vastly 
different effects on industry price-cost margins depending on total power, 
with price-cost margins decreasing when relative power increases occur in the 
context of high levels of mutual dependence among business units. In addition 
to the theoretical developments and substantive results presented here, this 
paper illuminates the benefits of tight integration of exchange theories, which 
are tested largely in lab settings with network theories that are tested using 
macro level phenomena (Burt 1983, 1992). Prior analyses of the link between 
Industry performance and dependence did not leverage the exchange-theoretic 
distinction between relative power and total power. By bringing the analysis of joint 
configurations of relative and total power to this area of inquiry, this investigation 
significantly increases the explanatory power of traditional models of inequality 
among industries in an economic system. This increase in explained variance 
brings us closer to realizing the potential of the exchange-theoretic framework to 
illuminate macro-level behavior and outcomes. At the same time, documenting 
the macro-level manifestation of exchange processes that have traditionally been 
investigated in the laboratory at the interpersonal level of analysis substantially 
enhances the generalizability of exchange-theoretic accounts of power and 
inequality. As an inherently cross-level theory, the power-dependence framework 
has indeed spurred both micro and macro investigations. However, few studies 
have offered a direct account of the relevance of micro exchange processes to 
macro behavior. The results of the present work help position exchange theory 
and structural sociology to bridge micro and macro levels of analysis and to apply 
the power-dependence framework across a wide variety of exchange settings. 

Despite its contributions, this paper is not without shortcomings that need to 
be addressed in future research. Testing predictions in the context of transaction 
patterns in the American economy prevents full control over variables of interest 
that experimental studies are best suited to provide. Instead of using direct 
measures of relative power and total power, we utilize proxies. Similarly, we only 
use an aggregate measure of profit instead of data on individual exchanges, 
exchange frequencies and surplus allocations. Future research needs to test the 
extent to which the assumptions made in this study are actually valid. 

Future research can also enhance the theoretical model by elaborating on the 
impact of different forms of power on the allocation of exchange rewards. Lawler 
and Bacharach (1987) highlighted the distinction between dependence power and 
coercive power, where coercive power comes from the ability to impose damage 
on the exchange partner above and beyond the opportunity cost of withdrawing 
from the exchange. Similarly, a conceptual distinction can be drawn between 
reward power and punishment power, which are related to an actor's capacity to 
produce positive and negative outcomes, respectively (Molm 1997). Because we 
do not discriminate between these forms of power, we cannot fully account for 
contexts in which the bases for the exchange relationship are primarily punitive 
in nature. While one can subsume all forms of power under the same overarching 
construct, the existing evidence suggests that this approach may oversimplify a 
rather complex theoretical scenario in which different forms of power are linked 
to exchange outcomes through distinct mechanisms. 
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Another potentially fruitful line of future research could extend the findings 
of the present study beyond dyads to networks of exchange relations. Indeed, 
Friedkin (1995) found that, in Kite networks, unbalanced exchanges lower an 
actor's net receipts as the disadvantaged trading partners turn elsewhere for 
exchange. While Friedkin's analysis focuses exclusively on configurations of 
relative power — to the exclusion of total power — and on a very specific network 
form, its findings are largely consistent with ours and suggest that the ideas 
we advance in the context of dyads should also be pursued in the context of 
exchange networks. 

Overall, the results of this study challenge the conventional wisdom that greater 
power yields greater rewards for the power-advantaged actor. The association 
between power and rewards is strictly contingent on the nature of the relationship 
linking exchange partners. In highly interdependent relationships, the heavy 
costs of exchange failures induced by power imbalance will pressure exchange 
partners to equalize power. This tendency toward balance in mutually dependent 
relationships may manifest itself as clearly in interpersonal relationships as in 
inter-organizational ties. In this sense, the results of this study of industry profit 
point to a general social process: the closer the social bond between two actors, 
the more power asymmetries will be incongruous to that relationship. Explicit 
recognition of the interactive effects of mutual dependence and power imbalance 
enhances exchange theoretic explanations of structural sources of inequality. 


Notes 


1. While focused primarily on predicting exchange frequency, Friedkin's (1986, 
1995) expected value theory also produces predictions concerning surplus 
allocation. In contrast with equidependence and exchange resistance 
theories, which assume that actors are motivated by their best hope 
outcomes (moderated by the threat of not concluding the exchange), 
expected value theory assumes that actors develop aspirations depending 
on ego's dependency on alter. If ego's dependency on alter is low, ego's 
aspirations are high, and vice versa. This process leads the more powerful 
actor to appropriate a greater share of the surplus. Thus, even though Friedkin 
begins with a different assumption regarding the distribution of surplus, the 
final predictions regarding surplus distribution are directionally consistent 
with exchange resistance theory. 


2. Assuming that the surplus is allocated in proportion to relative power, rather 
than split equally, makes it more arduous to establish that power imbalance 
makes actors worse off under conditions of high total power. If the surplus 
were split equally, it would be sufficient to show that increases in relative 
power lead to increasingly greater declines in expected surplus under 
conditions of high total power (H.) to establish our result. 


3. It is important to note that a decrease in actors’ rewards following an 
increase in power imbalance under conditions of high total power (shift 
from Configuration 3 to 4) does not necessarily imply a decrease the actors’ 
rewards following an increase in total power under conditions of power 
imbalance (shift from Configuration 2 to 4). 


10. 
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In deriving the measures of dependence between business units, the use of 
industry-level data is theoretically advantageous over the use of actual data 
on transactions between these business units, as explained by Casciaro and 
Piskorski (2005:184) 


We square D and s, to ensure consistency with Burt's (1982) constraint 
measure. However, our results are equally robust to specifying the measure 
as Cp > (p, m s) O, 
For example, consider changes in the actors' rewards following increases 
in relative power illustrated by a shift from Configuration 1 to Configuration 
2 in Figure 3. First, the increase in relative power increases actor /'s outside 
option by 16 and decreases the outside option of actor/ by the same amount. 
Second, the increase in relative power reduces the frequency with which 
actors appropriate the surplus of 80 by 0.2. Thus, both actors forego 16 units 
of surplus. Since power was originally equal in this relationship, then each 
of them loses 8 units of surplus. Third, the increase in relative power allows 
the more powerful actor to obtain 0.16 more of the 24 units of surplus. This 
amounts to 4 units. Taken together, the more powerful actor gains 16 units, 
loses 8 units and gains 4 units, which amounts to 12. The less powerful actor 
loses 16, loses 8 and loses 4, which amounts to 28. Thus, both actors face 
a symmetric change in profitability due to the loss of expected exchange 
surplus. However, both actors face opposing changes in profitability due to 


changes in outside options and share of surplus. 


There are eight negative observations in the dataset; the log transformation 
is not possible for these. One strategy to deal with these observations 
is to add a constant to ensure that there are no negative observations. 
Unfortunately, such a transformation also results in a skewed dependent 
variable. Consequently, we examine our models both with and without the 
eight observations. Although, the results without the eight observations are 
statistically weaker, we report them here as they provide a more conservative 
test of the hypotheses. 


Because the measure of C., is bound between 0 and 1, we add 1 to the 
measure of constraint to ensure that the value of the denominator is always 
greater than zero (Mizruchi 1992). We experiment with a number of different 
constants, such as 0.01, to ensure that the results are not an artifact of the 
selection of the constant, and find no significant differences. For simplicity 


of exposition, we do not include the constant in the equations in the text. 


We also experiment with different functional specifications of this variable, 
such as the log of the geometric mean and the log of the sum of the 
dependence measures. We obtain similar results, but our results are most 
robust with the measure described in the text. 


Inan unreported model, we also restrictthe sample by excluding all observations 
for which the value of relative power or total power is above the 75^ percentile. 
This exclusion reduces the sample size by approximately 36 percent and 
significantly lowers both the range of the two variables and the range of 
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interactions between them. We do this to ensure that outliers in the sample 
do not drive our estimates of interaction term coefficients? The parameter 
estimates remain significant and do not change direction, suggesting that the 
results in Model V are not driven by the outliers in the sample. 
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Book Reviews 





Virginity Lost: An Intimate Portrait of First Sexual Experiences 
By Laura Carpenter 


NYU Press, 2005. 304 pages. $20 (hardcover) 


Everyone Is Not Doing It: Abstinence and Personal Identity 
By Jamie Mullaney 
University of Chicago Press, 2006. 223 pages. $50 (hardcover); $20 (paper) 


Laura Carpenter in Virginity Lost and Jamie Mullaney in Everyone Is NOT Doing 
/t both draw from in-depth interviews, historical sources, popular culture and 
sociological theory to understand how individuals use sexuality as a means 
to construct and enact their social identities in contemporary society. While 
Carpenter focuses on loss of virginity, Mullaney considers abstinence from a 
variety of behaviors, including sex, driving, technology and substances. 


Everyone Is NOT Doing It 


While scholars have long argued that individuals use behavior or "doings" to 
reveal who they are, they have generally overlooked the role of abstinence or 
“not doings” in the presentation of self. Furthermore, previous studies that 
have considered abstinence (defined as the choice to avoid acts expected by 
others) have focused on a single behavior. By examining abstinence from several 
different types of behavior, Mullaney is able to consider how abstinence operates 
as a general process. 

Mullaney's ability to generalize across different types of behavior and create 
descriptive phrases for various categories and strategies of abstainers reminds 
me of Erving Goffman's classic work. For example, Mullaney uses the term "fire 
walking" to refer to a process whereby individuals engage in acts closely related 
to the avoided act in order to prove the integrity of their abstinence. To illustrate 
this process, she uses as an example Temptation Island, a television show that 
aired in 2001. This show separated four couples on a Caribbean Island and placed 
the partners among “desirable” single individuals for a two-week period to test 
commitment to their ongoing relationships. 

Mullaney's methodological appendix led me to question the rigor of her 
qualitative methodology. She states that she used a grounded theory approach 
which she succinctly describes as an inductive process of moving from data 
to theory. Yet, she conducted the bulk of her 38 interviews in a single summer, 
using an interview guide that did not appear to change over the course of the 
interview period. Though | am not a qualitative researcher, it is my understanding 
that grounded theory approaches require more feedback between the phases of 
data collection and analysis. To offer one example of how her emerging findings 
could have informed her subsequent data collection efforts, Mullaney could have 


© The University of North Carolina Press Social Forces, Volume 85, Number 2, December 2006 


1038 * Social Forces Volume 85, Number 2 * December 2006 


altered her interview guide after learning with her final question that a sizeable 
number of respondents abstained from more than one type of behavior. Further 
undermining my confidence in her methodology, she also reveals that she began 
the study by interviewing abstaining individuals with whom she already knew 
"somewhat on a personal level" and then used their contacts to obtain a snowball 
sample of abstainers. 

Overall, | was impressed by the way Mullaney was able to cobble together a 
wealth of sources to illustrate in rich detail various forms of abstinence. Everyone 
is NOT Doing It is undoubtedly a valuable contribution to our understanding of 
abstinence and the formation of identity. 


Virginity Lost 


Very few studies have explicitly examined the meaning in loss of virginity. Generally 
speaking, these studies have focused on heterosexuals and have not extended 
beyond the 1980s. Filling an important gap in the literature on the transition to 
adulthood, Carpenter draws from interviews she conducted in the late 1990s of 
straight, gay and bisexual men and women to offer nuanced accounts of how 
young adults define and interpret the experience. 

Carpenter finds that all but a handful of the 61 individuals interviewed highlight 
one of three metaphors, viewing virginity loss as a gift, a stigma or a step in. 
the transition to adulthood. Her sophisticated sampling design and methodology 
allow her to consider how broader social factors structure interpretations of 
virginity loss. Her inclusion of young adults who came of age in two distinct 
periods reveals that younger men are more likely than older men to view virginity 
as a gift, and that younger individuals are more likely than older individuals to 
define virginity loss in same-sex inclusive ways. Her consideration of young 
adults from a variety of social backgrounds further reveals that gender, race/ 
ethnicity and religion influence interpretations of virginity loss, but individuals 
must often negotiate between different understandings due to their intersection 
of different social identities. 

Scholars and college students alike are likely to find Virginity Lost provocative, 
entertaining and edifying. | was especially impressed with how Carpenter weaved 
insights from popular culture and social exchange theory into her presentations 
of the narratives. Not only is this book likely to become a classic in the field of sex 
and gender, but it has important implications for policy. Policy makers concerned 
with implementing sex education programs most conducive to the physical and 
mental health of adolescents should, at the very least, read Chapter 7, which 
details the policy implications of the findings. 


Reviewer: Kara Joyner, Cornell University 
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The Empty Cradle of Democracy: 

Sex, Abortion, and Nationalism in Modern Greece 

By Alexandra Halkias 

Duke University Press, 2005. 413 pages. $89.95 (cloth); $24.95 (paper) 


An ambitious undertaking, Alexandra Halkias’ The Empty Cradle of Democracy 
reveals much about the entanglements of gender, sexuality and nationalism as 
she explores the practice of and discourses that surround abortion in modern 
Greece. By official estimates, Greece has one of the highest rates of abortion 
worldwide. At the same time, Greece faces a rapidly declining birth rate. This 
paradox has fueled something closely akin to a panic among state and church 
officials. Halkias sets out to examine this paradox. 

Truly, this is a book about many things - the contradictions, tensions and 
paradoxes that electrify the modern landscape of Greece and Greekness, bodies 
and reproduction, personhood and the liberal democratic state, contemporary 
femininity and the good Greek woman. Themes of sex and love, intimacy 
and loss, responsibility and blame, shame and moral transgression, race and 
patriotism are woven together as Halkias lays a series of conceptual maps that 
parse out the complex and contradictory meanings of abortion across different 
registers. Halkias has gathered an extensive range of materials for analysis: 120 
in-depth interviews with women who report having had more than two abortions: 
interviews with doctors, nursing personnel and midwives; observation of a state- 
sponsored abortion clinic as well as private medical facilities; and hundreds of 
public documents from editorials to newspaper articles on the subject. With rich 
detail and analytical acuity, Halkias threads together the different stories that 
are told about abortion, demonstrating how abortion is a field of power where 
gender subjects are produced, where ethnic anxieties are inflamed, where the 
tensions between modernity and tradition play out, and a discourse of nation 
building is fortified. In doing so, Halkias builds a persuasive case for the centrality 
of women's bodies and reproductive trajectories to nation-building efforts and 
the construction of a collective national identity. For feminist readers, this will 
be a familiar tale. For decades feminist scholars have detailed the very public, 
contentious and sometimes violent struggle for control of women's reproduction. 
Yet, Halkias provides fresh insight as she gives equal consideration to the 
specificities of one "geo-political" site and the broader currents of state-building 
projects in the contexts of "globalizing high modernity" and modern gender and 
sexual regimes. 

Halkias explores abortion at two levels: its status as Greece struggles 
to gain greater presence in the European Union and at the far more personal 
level as women navigate their intimate relationships with men. Halkias makes 
visible the interpretive frames that are brought to bear in the decision to have 
multiple abortions. The women's narratives are intense as the subtext of loss 
and longing are palpable. Yet Halkias also demonstrates how sex operates as a 
site of resistance against modernity as these women assert their own agency in 
a rapidly changing social, emotional and political context. Her analysis reveals to 


us once again that "the personal is political." 
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The chapters move easily between different materials for analysis. Early 
chapters set the historical and political stage, later chapters spotlight the interview 
material. One chapter explores the subject of birth control, uncovering a series 
of clever connections between the meaning of birth control use and nationalism. 
Halkias shows how condom use is seen as a disruption to spontaneity, freedom 
and passion, qualities understood to be central to the very sense of being Greek. 
Halkias also demonstrates how women use a narrative template deployed in 
nationalist rhetoric about foreign invasion to the body politic as they talk about 
their refusal to ingest the pill. Halkias shines in these moments when she reveals 
unlikely connections and remains grounded in the rich narrative materials she 
carefully and reflexively collected. However, there are distinct moments where 
Halkias seems to be swept away by an overly-theoretical impulse. Portions of her 
analysis are so steeped in the language of post-structuralism that it compromises 
her commitment to a reflexive and feminist mode of inquiry by creating a 
disjuncture between analytical framings and the interpretive claims and the 
actual interview material. In one such example, Halkias claims that women define 
abortion as a “natural” practice. | found this point difficult to digest since there 
was little support for such a claim in the narrative material. Yet on the whole, 
this book covers tremendous ground, building from an extensive collection of 
empirical materials. It is an impressive undertaking and skillfully done. 


Reviewer: Amy L. Best, George Mason University 


Consuming Citizenship: Children of Asian Immigrant Entrepreneurs 
By Lisa Sun-Hee Park 
Stanford University Press. 2005. 169 pages. $50 (cloth); $19.95 (paper) 


This work investigates the link between consumption and citizenship among 
Chinese-American and Korean-American adolescents and young adults 
whose parents work as small business owners. Park critiques previous work 
that simply describes the mobility patterns of immigrants and their children 
without questioning how they interpret their mobility. She argues that the more 
interesting questions to ask of Asian Americans are “why Asian Americans 
are made to feel compelled to proved their Americanness' and how Asian 
Americans display their social citizenship (or belonging)." Her definition of 
consumption is not simply about material goods but also the career choices 
that her respondents make to promote their consumptive power. She argues 
that consumption is also about the social process of consuming. These are 
provocative questions, and they provide a unique treatment of the experiences 
of children of immigrants whose family businesses defined their childhood and 
their relationship with their parents and siblings. 

Park uses a variety of interviews, focus groups and informal conversations with 
more than 88 primary respondents (adolescents and young adults of immigrant 
parents) and additional conversations with family and community members. 
In total, she interviewed more than 100 individuals. While her primary focus 
was on Asian Americans of all ethnicities, her primary data collection garnered 
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almost exclusively Chinese-American and Korean-American respondents; hence 
she focused on these two groups. She weaves together a vivid portrayal of the 
everyday lives of these young people and their common sense of obligation to 
their parents. Park uses a wide variety of previous research and theory, ranging 
from Veblen's discussion of conspicuous consumption to contemporary work 
about ethnic entrepreneurs. She also shows a clear mastery of authors who work 
in Ethnic Studies and Asian-American Studies. This is an extremely well-written 
book, but is somewhat unsatisfying analytically. 

While the proposed goal of the book is about consumption and citizenship, the 
primary empirical chapters (Chapters 4 and 5) do not address consumption at all. 
Chapter 4, titled "The Business in Children's Lives," portrays the loss of childhood 
and the conflation of having adult-like roles but being treated as perpetual children 
who must obey their parents' wishes. While this is an extremely interesting 
chapter, it does not elucidate the thesis of the book. Chapter 5, "The American 
Narrative of Asian Immigration," examines the narratives of relative upward, flat 
or downward mobility of the immigrant parents. The goal here is to provide an 
interpretive account from the children of their parents' socioeconomic mobility. 

Park does not return to the question of consumption until Chapter 6, 
"Consumption Fantasies of Upward Mobility" Here, the children describe 
their goals of fulfilling obligations to their parents through material goods and 
career goals. The ability to provide these possessions or manifest their status to 
their parents is meaningful not only because it allows children to express their 
appreciation of their parents, but because it allows parents to tell their peers 
that their sacrifices have been worthwhile. It is important to note that these 
aspirations are not just about the actual items but the meaning of these goods. 
For children who hope to buy their parents a Mercedes, this gift not only conveys 
their appreciation for their parents' hard work, but also serves as an external 
symbol of their own success as legitimate and complete Americans. The latter is 
a new and important contribution that Park makes to this literature 

The final chapter is a brief reflection on the legitimacy of immigrants as 
American citizens primarily defined by their economic contributions. Park argues 
that this results from a "conflation of the economic system of capitalism with the 
political system of democracy, with patriotism measured by economic ‘benefit. 
In other words, consumption helps to define citizenship for all, although it is not 
sufficient in providing the means to full citizenship for disadvantaged minority 
groups. Her thesis is interesting, but it is not directly linked to her empirical 
work. Do Asian Americans simply provide an example of this argument? Would 
we find similar aspirations among Mexican or Greek small business owners? 
It is unclear how Asian Americans distinctly serve the general theoretical point 
she wishes to make. The last point of this chapter is more convincing - that 
Asian Americans serve as model minorities at least partly due to their high rates 
of entrepreneurship. The perceived success of Asian Americans through their 
entrepreneurial activities serves the current political landscape that prescribes a 
"do-it-yourself" solution to problems of limited mobility. 

Overall, | enjoyed reading this book. Park is a gifted writer, and this would be a 
useful text for an undergraduate course or introductory graduate course. 


Reviewer: Grace Kao, University of Pennsylvania 


1042 ° Social Forces Volume 85, Number 2 ° December.2006 


The Politics of Attention: How Government Prioritizes Problems 
By Bryan D. Jones and Frank R. Baumgartner 
University of Chicago Press, 2005. 304 pages. $70 (cloth); $25 (paper) 


|s our democratic system of government responsive to the concerns of the 
public? Why or why not? Some say that it is responsive, evidencing the best of 
American democracy; others contend that the political system only responds 
to the wishes of the powerful. This book contributes to the debate with a 
focus on how information is used and allocated by American policymakers and 
institutions. 

The authors contend that cognition and information processing is at the core 
of stability and change in public policy. Focusing on government as the receiver 
of information, they ask how government responds with policy to incoming 
information: Why do policymakers focus on some aspects of information while 
ignoring or discounting other bits of information? The ambition of the present 
volume, which is but the latest output in a long and productive collaboration, is 
the development of a full theory about how human cognitive processing interacts 
with political institutions to produce the long periods of policy stability that are 
punctuated infrequently by change. 

The authors develop a model of individual and institutional decision making 
that starts with recognizing that there is a problem, assesses the dimensions - 
of the problem, sorts through potential solutions, and then makes the final 
policy choice. Policy actors and institutions generally maintain a status quo in 
policy because institutional processes (such as the need for supermajorities in 
Congress to move issues forward) combine with the individual cognitive factors 
outlined above to act as filters and controls. Some information is ignored while 
other information receives a great deal of attention. This process adds friction 
and makes the policymaking process inefficient in the sense that institutions do 
not respond quickly and proportionately to changes in the environment. But the 
insertion of new signals or new framings of old issues has the ability to disrupt 
this stability. A disproportionate response then occurs because after we decide 
to focus on an issue, we not only reevaluate our prior policy choice but also 
our understanding of the problem, our weighting of the relevant dimensions of 
the issue, the consideration of possible solutions, and our goals in addressing 
the issue in the first place. The authors illustrate these patterns of stability and 
change by analyses of various data sources covering more than 50 years of U.S. 
policymaking. 

While some decry this state of affairs as gridlock or thwarting the will of 
the people, the authors note that the framers of our political institutions strove 
for a balance between representation and liberty, navigating between acting 
when there is no need and failing to act when the need is clear. Moreover, 
such a process strengthens signals from outside policy subsystems as policy 
proponents know that they must transmit information in a way that will overcome 
the inertia within political subsystems and institutions. Finally, a political process 
that responds efficiently in the short run to all inputs through a pure division of 
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labor may encourage information monopolies that in the long run will constrain 
diversity and adaptability. 

This work is a commendable and important work at several levels, not least of 
which is the variety and scope of data used in support of the authors' arguments 
(which the authors have made available at www.policyagendas.org). The authors 
theorize about the broad scope of policy over five decades, and their thesis pulls 
together different theories of policy and politics. 

For these reasons, sociologists interested in political life will find this book 
engaging, but some questions could be raised. For example, is the system 
really awash with too much information as the authors contend? As one of the 
authors noted in previous research, only a few issues attract a great number of 
interest groups while many other issues are the object of attention of a small 
number of lobbyists (Baumgartner and Leech 2001). Burstein (2006) suggests 
that public policy does not completely reflect public opinion because the public 
may not have formed opinions on every issue before the Congress. Moreover, 
if information supply is endogenous and subject to manipulation, rather than 
simply exogenous as the authors' model assumes, it may be that their model of 
information processing and policy agenda setting is not as benign as the authors 
imply. In any event, this is a well-crafted book that will influence these and other 
debates over political life for years to come. 


References 
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Reviewer: John C. Scott, Cornell University 





Rethinking Homicide 
By Terance D. Miethe and Wendy C. Regoeczi 
Cambridge University Press. 2004. 320 pages. $70 (cloth); $25 (paper) 


In Rethinking Homicide, Terance Miethe and Wendy Regoeczi report on 
an ambitious and innovative study of homicide that employs data from the 
Supplementary Homicide Reports (SHR) for the United States as well as police 
narrative accounts from homicides in Miami, Los Angeles, Las Vegas and St. 
Louis. Noting that prior studies neglect the complex interaction of offender, 
victim, motive and circumstance, the authors comprehensively investigate the 
structure and process of thousands of homicides. Moreover, rather than examine 
typical questions such as “What city-level characteristics are associated with 
higher homicide rates?” or “Are the predictors of family- and stranger-perpetrated 
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homicides similar?" Miethe and Regoeczi's situation-based analytic approach 
addresses other interesting questions such as: "Which combination of offender, 
victim, motive and weapon structures are most common?” ‘Are the most common 
homicide structures in the 1990s different from the most common structures of 
earlier decades?” and “Do instrumental- and confrontation-motivated homicide 
events share similar or distinct structures?” Overall, the research described 
is creative and the depth of the analysis certainly constitutes a worthwhile 
contribution to the homicide research literature. However, the substantive focus 
and theoretical framework of the book are somewhat narrow, which may dampen 
its appeal to social scientists working outside of criminology. 

One of the major strengths of Rethinking Homicide is the innovative analytical 
strategy used by Miethe and Regoeczi to analyze literally hundreds of thousands 
of homicide events. Using Qualitative Comparative Analysis (QCA) they effectively 
investigate some of the most common myths or assumptions about homicide 
and its distribution across demographic groups and time. For example, much 
speculation indicates that during the 1990s, homicide rates were being driven 
by the onset of juvenile “superpredators,” an emergent group of youthful, male 
offenders who callously kill at random. However, by using OCA to analyze changes 
in the offender/victim/motive/circumstance profile of homicide events across 
the 1970s, 1980s and 1990s, Miethe and Regoeczi illustrate that there is little 
evidence to support the notion of the rise of juvenile superpredators during the: 
1990s. Other common assumptions examined by Miethe and Regoeczi center on 
whether the structure of homicide incidents vary by race or gender, and whether 
conflict-motivated homicides have distinct signatures from instrumentally- 
motivated homicides. 

Another strong feature of the book involves the use of police narratives of 
homicide incidents. By supplementing the QCA analysis of homicide structures 
with these descriptive accounts, important insights on the interpersonal 
dynamics and subtle nuances of homicide events are revealed. For example, an 
apparent assumption in prior homicide research is that homicide events can be 
neatly classified as having either an instrumental or an expressive motivation. 
Consistent with that assumption, the OCA analysis by Miethe and Regoeczi 
indicates that the vast majority of frequently occurring homicide structures can 
be classified as uniquely "expressive" or "instrumental." Upon inspection of the 
homicide narratives, however, it becomes evident that certain homicide cases, 
particularly those related to youth gangs and drug-market activities, exhibit a 
mixture of instrumental and expressive motivations. Thus, the narrative data 
indicate that the classification of homicide events by motivation type may not be 
as straightforward as prior studies suggest. 

Despite the analytic strengths of the book, a couple of features detract 
from the overall package. First, the vast amount of data analysis in the central 
chapters is accompanied by little integrated theoretical discussion. Although an 
early chapter reviews potentially relevant theoretical models and the concluding 
chapter offers some discussion of findings in relation to criminological theory, 
the conceptual relevance of the research findings is scarcely mentioned in the 
rest of the book. As a consequence, it is easy to be overrun by the details of 
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the data analysis. Second, despite all the virtues of the holistic event-focused 
approach in Rethinking Homicide, sociologists who read the book are sure to 
wonder what role the broader social context might have on the structure of the 
events that are analyzed. The situational context is clearly an important aspect 
of homicide events, but these situations are embedded in a larger social and 
economic context that might strongly condition the results that emerge in this 
study. Additional discussion of the potential linkages between these structural 
and situational contexts might serve to broaden the audience for the book and 
provide a future research agenda for scholars who chose to build upon the 
foundation laid by Miethe and Regoeczi in Rethinking Homicide. 


Reviewer: Graham C. Ousey, College of William e> Mary 


Hard Work: Remaking the American Labor Movement 
By Rick Fantasia and Kim Voss 
University of California Press, 2004. 244 pages. $50 (cloth); $19.95 (paper) 


Fantasia and Voss have written a book in which they take stock of the contemporary 
labor movement, situate it in a historical context and provide a vision for its future. 
They have two goals in Hard Work. First, they seek to sensitize the reader to the 
social logic of the U.S. labor movement. Second, they explore the possibility of 
reinvigorating organized labor so it looks and acts more like a social movement. 
Theirs is a farranging exposition providing a broad historical sketch of the 
movement, from the Knights of Labor to the Justice for Janitors campaigns. They 
also offer a nuanced sociological analysis of the contemporary labor movement. 
Building on the tradition of Pierre Bourdieu, they incorporate symbolic and cultural 
themes into their analysis and highlight various institutional dynamics that have 
made the movement, at times, militant, corrupt and static. They conclude that 
despite its current malaise, a revitalized labor movement remains the best hope 
for workers to avoid dystopian future. 

Their excavation of the social logic of labor centers on twin motifs of debunking 
and contextualizing. They begin by unraveling the mythology surrounding the 
American economic model and its purported superiority to the "old" welfare 
state economies of Europe. The claims that the "new" U.S. economy is a boon 
for workers is, they argue, largely a chimera propagated by proponents of 
neoliberalism. They counter that in terms of pay, vacation time, health insurance 
and parental leave, European workers enjoy a much better standard of living 
than their American counterparts. The authors suggest this is due to the fact 
that social provisions in Europe are bestowed to broad publics and are defended 
by alliances between the state and labor movement. In the United States, the 
reverse is true. The state tends to be allied with the interests of capital and, to the 
extent that social provisions exist, they are granted privately to workers based 
on their employment in a specific workplace. This comparative analysis lends 
support to the authors' larger argument that a strong labor movement has the 


potential to improve the wellbeing of all workers. 
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Theauthors also offera newtake on the old notion of "American exceptionalism" 
-the idea that the unique character of American workers explains why an explicitly 
working class politics has not emerged in the United States. Fantasia and Voss 
contend that American workers are not unique, but rather the political and social 
environment is exceptionally hostile to the working class in general. Corporations 
are antagonistic toward unions, the state is indifferent to the interests of workers, 
and the U.S. labor movement has responded to this situation by becoming 
increasingly cautious and, as a result, ineffectual. 

One of the most refreshing aspects of the book is its cultural analysis of 
the labor movement. The authors describe the “neoliberal social project” as 
having effectively rendered the Worker invisible on a symbolic level. Standing 
in its place, the Consumer has come to represent the only legitimate economic 
actor in the new economy. The ideas and language of labor, work and class have 
been thoroughly expunged from the popular vocabulary. They assert that it is 
increasingly difficult for these consumers - who also happen to be workers - to 
see themselves as a class, let alone act as one. 

So how are unions supposed to overcome all of these obstacles? For Fantasia 
and Voss, the hard work of remaking the labor movement will require a combination 
of things. First, they point out that the movement needs a new "metaphysic" or 
vision that evokes a set of unifying ideals larger than itself that can excite the 
popular imagination. Another essential ingredient will be to reproduce, on a large 
scale, social movement unionism - those characterized by worker participation, 
non-traditional tactics, and a broader orientation to social justice. Finally, they 
contend that the space between unions is an important terrain on which to 
rejuvenate organized labor. Citing the reemergence of central labor councils, the 
activism on college campuses around the anti-sweatshop movement, and the 

. proliferation of living wage campaigns, the authors are among a small group of 
labor scholars to consider the possibility that the future of labor movement may 
not be entirely linked to trade unions. 

This book will be an excellent introduction for students and general readers 
who are unfamiliar with the U.S. labor movement - the authors' intended 
audience. Although there may not be much new here for those familiar with the 
literature on labor revitalization, labor scholars and practitioners will appreciate 
the challenging questions this book raises. For example: How might labor activists 
go about generating a new labor metaphysic? Or, how can labor overcome the 
legacy of the neoliberal social project? Or finally, how might scholars and labor 
leaders aid the development of that space between unions? Overall, this book 
makes an important contribution. It is an accessible yet critical introduction to the 
contemporary labor movement that skillfully situates it in its social, historical and 
political contexts. By sensitizing the reader to the social logic of labor, Fantasia 
and Voss have indeed demonstrated the value of Hard Work. 


Reviewer: Richard Sullivan, Illinois State University 
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Immigrants, Unions, and the New U.S. Labor Market 
By Immanuel Ness 
Temple University Press, 2005. 230 pages. $59.50 (cloth); $21.95 (paper) 


Employers, scholars and activists frequently argue that immigrant workers are 
"unorganizable" due to fear, immigration status and the fact that many immigrants 
work in the secondary and informal labor markets. In /mmigrants, Unions, and the 
New U.S. Labor Market Immanuel Ness shows us that immigrants are not only 
"organizable," but often organize themselves with or without the assistance of 
labor unions.' In fact, Ness argues that immigrants are more likely to organize 
than their native-born counterparts. 

Ness provides us with three well researched and compelling case studies of 
immigrant worker organizing in New York City — greengroceries (small corner 
grocery stores, supermarket delivery and black car service (private for-hire 
vehicles). In the first case study, thousands of mostly Mexican immigrants fought 
against exploitative working conditions (12-hour work days, no benefits and below- 
minimum wages) in small greengroceries. They founded an organization called 
the Mexican American Workers Association (AMAT). In 1998 AMAT began working 
with the Union of Needletrades Industrial and Textile Employees Local 169 (UNITE) 
and the Lower East Side Community Labor Coalition (CLC). Although workers won 
improved conditions and the implementation of an industry wide code of conduct, 
the union campaign fell apart. Ness argues that the battles between Local 169 
and UNITE's national leadership, as well as between UNITE and the UFCW on 
jurisdictional issues, ultimately undermined immigrant worker organizing. 

In the second case, supermarket delivery workers fought against their 
independent contractor status. The industry had once been unionized, but the 
Retail, Wholesale and Department Store Union (RWDSU) actively collaborated 
in de-unionizing the bagging and delivery worker classifications in the 1980s. As 
a result, West African workers who now dominated the industry labored under 
deplorable sweatshop conditions. Workers organized a two day strike and then 
approached UNITE Local 169 for assistance. UNITE helped the workers form the 
African Workers Association (AWA), but its involvement stopped when the RWDSU 
claimed jurisdictional authority over the workers. In the end workers became part 
of the RWDSU and won collective bargaining agreements. However, we again 
see that jurisdictional battles between unions and the exclusion of militants from 
contract negotiations impeded rank and file democracy along the way. 

The final organizing drive involves South Asian immigrants in the black car 
industry. Similar to workers in the other two cases, South Asian immigrant black 
car drivers worked long hours, earned low wages, and had to pay for their own 
insurance. As a relatively new industry, a result of the restructuring of the taxi 
industry, there were no pre-existing unions and no jurisdictional fights. Drivers 
started organizing for better working conditions and contacted the International 
Association of Machinists (IAM). The IAM fully supported the workers’ campaign 
and strategy. Unfortunately, the Patriot Act, passed after 9/11, severely affected 
the organizing campaign. South Asian drivers were racially profiled, treated as 
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terrorist suspects, and deported by the INS. Nonetheless, the union and the 
workers are still working to organize in the industry. 

Through these case studies, Ness does an excellent job of showing us that 
immigrants are central to the revival of the U.S. labor movement. However, 
the book would have been strengthened by additional analysis in two areas: 
immigrant worker solidarity and anti-union policies and practices. 

Ness argues that immigrant organizing is in large part a result of class solidarity 
based on ethnic identity, shared workplace experiences, and isolation. | agree, 
but it is also true that employers also use these tactics to "divide and conquer” 
workers. Workers are often divided along nationality and community lines. Did 
employers in any of these case studies try to divide workers against each other? 
If so, what explains the ability for workers to come together in solidarity with 
each other? Is ethnicity, and shared workplace experience enough to create 
solidarity? Furthermore, | wanted to hear about whether there was solidarity 
between workers across the three campaigns. 

| agree with Ness's overall assessment that unions need to support the 
organizing efforts of immigrant workers whether or not they are coordinated 
by unions or in independent immigrant organizations. | also agree with his 
conclusion that union democracy and rank-and-file militancy will strengthen the 
labor movement. However, | wanted to see more analysis of the impact of anti- 
union employers and anti-union state policies in each of the case studies. For - 
example, in the case of black car drivers, it was post-9/11 State policies, not 
union jurisdictional battles, that ultimately hindered the campaign. This suggests 
that even when unions are democratic and there is no fighting among them, 
workers must fight against capital and state policies. 

These concerns aside, /mmigrants, Unions, and the New U.S. Labor Market is a 
must read for scholars and activists interested in social movements, immigrants 
and labor. Not only is the book well written and carefully researched, but it also 
provides hope about the future of the labor movement. 


Note 
1. Fora superb examination of immigrant worker organizing campaigns in Cali- 


fornia, see Milkman, Ruth. Editor. 2000.0rganizing Immigrants: The Chal- 
lenge for Unions in Contemporary California. Cornell University Press. 


Reviewer: Carolina Bank Muñoz, Brooklyn College-CUNY 





Frames of Protest: Social Movements and the Framing Perspective 
Edited by Hank Johnston and John A. Noakes 


Rowman & Littlefield, 2005. 269 pages. $80 (cloth); $32.95 (paper) 


In the 1980s, as the social sciences in the United States began to make their 
way around the cultural turn, the study of social movements ventured into the 
theories of frame analysis. Borrowing from the legacies of Bateson and Goffman, 
scholars including William Gamson, David Snow and Robert Benford propelled 
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the study of social movements into a new paradigm. This perspective brought 
social-psychological questions back to the forefront of movement analysis by 
focusing on the ways that movement actors actively negotiate cultural codes 
to construct interpretations of social problems and the goals of resistance. In 
so doing, the framing perspective sought to add a cultural and psychological 
component to the more strictly organizational and structural analysis of the 
"resource mobilization" paradigm, which focused upon movement resources, 
strategy, and political opportunities. More than 20 years have passed since this 
intervention, and analysis has, like any paradigm, become a body of theory that 
is, at once, indispensable for its insights, diverse in its manifestations, and the 
subject of much critique and debate. 

Frames of Protest, edited by Hank Johnston and John A. Noakes, is an engaging 
and companionable text that brings together a variety of social scientists to both 
reaffirm and expand the theories of the framing perspective for a new generation 
of movement researchers. Throughout his career, Johnston has demonstrated a 
consistent interest editing coherent volumes that seek to create dialogue across 
theoretical boundaries. Frames of Protest is no different. It has a strong coherence 
and each chapter contributes to the argument that frame analysis, while invaluable, 
is in need of greater reconciliation with other theories of social movements. 

Frames of Protest is organized into an introductory chapter and four sections. 
Johnston and Noakes provide a comprehensive introduction to the framing 
literature by identifying its genealogy, its core concepts, and some of the debates 
that have preoccupied its practitioners. The first section offers three insightful 
but very different case studies, which, taken together, suggest that the framing 
perspective offers robust insights into the interactive, dramaturgical, and culturally 
embedded complexities of movement development. In doing so, each affirm the 
central role framing processes play in movement mobilization, independent of 
available resources or structural opportunities. 

Parts two and three address how frames are weapons in what Tarrow describes 
as a "struggle for cultural supremacy" between power holders and movements. 
In part two, contributors discuss non-movement framing, particularly that of 
states and media, and their impact on movement frames. Framing processes 
are revealed as ongoing, dynamic interactions between movements, their 
opponents, and a broader culture — each of which shape multiple structures of 
political opportunity. 

Part four includes five chapters that debate and refine the framing perspective, 
with two principal goals: the marriage of the framing perspective with traditions 
of ideology critique, and the development of comparative frame analysis. The 
centerpiece of this section isa friendly but not dispassionate debate. Pamela Oliver 
and Johnston contend that “frame” often becomes a substitute for “ideology,” 
which has the effect of over-emphasizing the micro-level social psychological 
components of meaning construction and omitting the more systemic cultural 
structures of ideology. Consequently, frame theory can become depoliticized and 
thus have little interpretive power over the macro-social structures of norms and 
worldviews that shape movements. They therefore call for reconciliation between 
theories of framing and ideology. Snow and Benford retort that framing is a more 
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dialogical process than Oliver or Johnston recognize, one that engages with 
multiple forms of meaning and social context, including the ideological. Snow 
and Benford therefore claim that frame theory provides more precise analytical 
perspectives with which to understand political context and the precise function 
of ideology. No easy resolution is found, which is symptomatic of the need for 
greater dialogue between currently disparate theoretical traditions. 

Frames of Protest is an expansive and well-constructed volume that would 
be useful for students, scholars, and activists interested in social movement 
analysis. However, it is not without limitation. First, despite provocative case 
studies, they frequently are unclear bases for generalizations regarding framing 
processes in other contexts. Second, while much space is dedicated to frame 
theory's dialogue with resource mobilization and ideology critique, comparatively 
little is given to other dialogues with critical theory — notably neo-Gramscian and 
Foucaultian traditions — that may further expand each theory's powers to analyze 
movements and cultural structures. This said, Frames of Protest is a critically 
important compilation that both reinvigorates and challenges the framing 
tradition, and its open-ended quality invites all of us to participate in an ongoing 
dialogue about the processes of social change. 


Reviewer: Joe Bandy, Bowdoin College 


Inequality and the State 
By John Hills 
Oxford University Press, 2004. 284 pages. $125 (cloth); $37.50 (paper) 


Inequality and the State provides a detailed overview of the vast body of research 
on income inequality in developed countries that has emerged in the past decade. 
Hills focuses on a single case: that of Britain from the end of the 1970s to the 
present. The author brings to bear a wide variety of data sources to examine 
the widening income inequality in Britain during this period, the role that state 
taxation and spending policies has played in these developments, and changing 
public knowledge of and attitudes toward these issues. Each chapter takes up 
a different aspect of inequality and, to the extent possible, compares the British 
experience with those of other developed nations. Together the chapters provide 
an in-depth and compelling look at the relationship between inequality, public 
opinion, and state action. 

Hills begins by examining trends in income inequality and poverty in Britain 
over the past 30 years. The chapters on this topic offer useful discussions of how 
inequality and poverty are typically measured, how the distribution of economic 
resources has changed during the time period, the explanations generally offered 
for these changes in resource distribution, and the relationship between societal- 
level inequality and individual social mobility. The chapters in the second part of 
the book consider the contributing role that the state has played in these trends 
through its taxation and spending policies. In the final chapters Hills considers 
possible future trends in income inequality in Britain and the pressures and 
constraints that the state is likely to face in coming decades. 
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One of the more useful discussions in the book relates to the issue of 
privatization of state services. In contrast to many analyses which measure 
privatization along a single axis, Hills proposes that privatization needs to be 
considered along three dimensions: provision, finance and decision. He then 
argues that by examining privatization in each of these spheres, welfare activity 
in Britain (over 20 years of mostly conservative governments) does not conform 
to the simple portraits of "privatization" typically offered by other analysts. 
Between 1979 and 1999 while overall welfare activity grew by 86 per cent in 
real terms, shifts in the "welfare mix" were relatively modest. While the growth 
of the private sector was important, especially in terms of private provision, the 
author finds that welfare activity is the "pure public" sector (with a combination 
of public finance, provision and control) fell only 3 percentage points during the 
period from 52 to 49 percent of the total of welfare provisions. Such findings 
challenge more conventional analyses of privatization and offer new insights into 
the phenomenon. 

In addition to economic and political statistics, the author also presents data 
in each chapter on public knowledge of and attitudes towards patterns of income 
inequality and poverty and the desired role of the state. Using data from the 
British Social Attitudes survey, he finds that the public largely underestimates 
the extent of income inequality and poverty within British society, that there is 
significant misperception about where spending on cash benefits goes, and that 
individuals generally support higher levels of welfare benefits for those at the low 
end of the income distribution. 

While the public opinions he presents are intriguing, | take issue with Hills' 
interpretation of these data. In particular, the author relies on three questionable 
assumptions. First, Hills places too much weight on the role that public attitudes 
play in determining political action. While it is true that public opinion polls often 
shape the actions of politicians in significant ways, these are not the only factors 
that state actors take into account. Furthermore, the author assumes that public 
perceptions and attitudes are synonymous with political pressure, when indeed it 
is not clear whether (and when) responses to large-scale survey questions in fact 
correspond to conscious and deliberate political action on the part of citizens. 
Finally, the author assumes that the relationship between public attitudes and 
state action is unidirectional. He does not address the fact that, just as public 
opinion can shape the actions of state officials, so too can state activities shape 
the knowledge and opinions of the general public. In his pluralistic model of 
the state, the role that the state itself can play in shaping the perceptions of its 
citizens is largely overlooked. 

Despite these questions of interpretation, /nequality and the State is a 
worthwhile review of contemporary research on income inequality, poverty, social 
taxation and welfare state policies. With its careful explanations of concepts, 
generous summary points at the end of each chapter, and guides to further 
reading, the text is well-suited for an introductory graduate student course on 
income inequality. And despite its focus on Britain, the chapters offer general 
discussions that could resonate with students outside the British context. 


Reviewer: Kevin Riley, University of California, Los Angeles 
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Shock Cities: The Environmental Transformation and Reform 
of Manchester and Chicago 

By Harold L. Platt 

University of Chicago Press, 2005. 628 pages. $49 (cloth) 


| am certain Harold Platt wishes his book were not so awfully timely. His 
encyclopedic Shock Cities, a comparative environmental history of Chicago 
and Manchester from the early 1800s to the 1920s, is positively brimming 
with contemporary relevance, and makes for eerie reading in light of Hurricane 
Katrina. He writes, for example, that in 1872 in Manchester a flood “submerged 
homes and shops from the working-class district of Ancoats to the Duke of 
Bridgewater's wharves at Castlefield, where barges were lifted and deposited 
on land far from their moorings.” Platt thus joins a growing list of commentators 
— among them Eric Klinenberg, Mike Davis and John Barry - whose work on 
the social and political dynamics of environmental disaster in times and places 
variously removed from contemporary New Orleans has something to tell us 
about modern-day catastrophes. 

What is that something? Platt's version of a central trope in this literature is 
the paradox of progress: while economic growth and technological advances 
held out the promise of improving the quality of people's lives, that promise 
was continually subverted by corrupt or greedy public officials and naively 
overconfident engineers. Too often, it was in too many people's interest to 
underplay the serious environmental despoliation caused by urban growth and 
by resource-hungry and profligate industries like brewing and meatpacking. 
Politicians and planners, for example, consistently underestimated the waste 
the cities generated, with dire consequences in Chicago in particular as 
wastewater repeatedly contaminated the drinking water supply, leading to 
recurring outbreaks of typhoid and other diseases. Thus, the extraordinariness 
of disaster is misleading, because more permanent undercurrents of inequality, 
negligence and hubris make disasters possible. 

Subsumed within this overarching paradox-of-progress narrative are three 
intertwined storylines. The first details the massive environmental consequences 
of the two cities” “industrial ecologies." These include pollution of the air and 
water, illness, the re-engineering of river flows, and the spread of urban influences 
over vast stretches of countryside. These cities did not merely knit together 
` and reshape their immediate regions. They also spread their tentacles across 
hundreds or thousands of miles — to the Lake District for drinking water, India and 
the American South for cotton, the mouth of the Mississippi for shipping. 

A second storyline is political, focusing on corruption, the unresponsiveness 
of politicians and the failure to build a polity that protected workers' health and 
interests. Platt richly details the shortcomings of city planning and infrastructural 
development. One vignette tells the story of a booze-filled 1871 expedition by city 
fathers to celebrate the "deep cut" canal that reversed the flow of the Chicago 
River. This story, however, simultaneously displays both the strengths and the 
weaknesses of the book. Despite his wish to the contrary, Platt is better able to 
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show the impact of industrialization and public policy on workers than to make 
them into full participants or to invest them with much life. 

In part, this failure derives from Platt's third subplot: the rise of middle class 
professionals who stand between city politicians and capitalists on the one 
hand and workers on the other. Budding professions and their claims to expert 
knowledge are in every chapter, and engineers, planners, scientists, reformers 
and doctors are Platt's main actors. The period was a golden age for the middle 
class, from the founding of the Manchester Statistical Society in 1833, to the 
rise of the Department for the Inspection of Steam Boilers and Steam Plants in 
Chicago city government. All the while, the members of these new occupational 
groups jockeyed with each other for status. In particular, Platt argues that the 
prestige of "engineers" (who overshadowed scientists) was bad for cities, as 
they generated infrastructure proposals that were doomed to environmental 
inadequacy and failure. 

The book is a massive undertaking, for the reader as well as the writer 
— 500 pages of text, plus a hundred more of notes. It is a treasure trove of 
information about 19th century industrial development and the origins of the 
urban infrastructural planning — sewers and sewage treatment, streets, canals, 
drinking water systems - that we too often take for granted. But there are some 
shortcomings. First, the comparative analysis is not systematic enough. The 
book sometimes reads like two histories squashed together. It is not always 
clear which differences are locally driven and which are national. Second, and 
related, the book would have benefited from a firmer editorial hand. The chapter 
on brewing, for example, is more about London than Manchester. Finally, from a 
sociologist's point of view, it is odd that a book about Chicago's urban ecology in 
the industrial era should include no mention of Robert Park and Ernest Burgess. 

Despite these flaws, this book will be valuable to urbanists, environmental 
sociologists and political scientists. As research into the underfunding of levee 
construction and the destruction of the Louisiana wetlands is now showing us, it 
is the slow accretion of a thousand environmental cuts that puts us in a position 
to suffer "the big one." Harold Platt's Shock Cities is a comprehensive guide to the 
gradual creep of environmental destruction and how it leaves us all worse off. 


Reviewer: Jerome Hodos, Franklin and Marshall College 





Analyzing Inequality: 

Life Chances and Social Mobility in Comparative Perspective 
Edited by Stefan Svallfors 

Stanford University Press, 2005. 175 pages. $45 (cloth) 


For more than three decades, Robert Erikson has made important contributions 
to the study of social inequality, particularly the analysis of intergenerational 
mobility through a comparative lens. Analyzing Inequality: Life Chances and 
Social Mobility in Comparative Perspective is a collection of papers that were 
presented at a 2003 symposium held in his honor. The contributors include some 
prominent figures in stratification research and the scope of Erikson's work is 


1054 ° Social Forces Volume 85, Number 2 * December 2006 


intended to be a unifying theme for the volume. However, the end product is five 
good articles rather than one good book. 

In the introduction, Stefan Svallfors briefly summarizes the progress that has 
been made in the study of social inequality in terms of data, methods, and theory. 
He comes to the uncontroversial conclusion that while social scientists have 
made substantial advances in analyzing stratification over the last few decades, 
significant challenges remain. 

Karl Ulrich Mayer's chapter provides a useful overview of the development of 
life course inequality research on the way to discussing the options and challenges 
in studying how institutional arrangements affect life course trajectories. He 
recommends a comparative approach but warns against the oversimplification 
of the nation as an analytical unit. Classifying countries using an economic 
development scale or a welfare state typology fails to link specific institutional 
mechanisms with life course outcomes. Mayer concludes that the cross-national 
approach is most informative when combined with detailed analyses of each 
nation's specific institutional arrangements. 

John Goldthorpe opens his chapter by challenging the skeptics who claim that 
the notion of “progress” in research is a false one. However, the postmodernists 
and other detractors he argues against are simply straw men set up for the 
purpose of providing an overview of the progress made in social mobility research. 
As with Svallfors's introduction, the advances in data, methods and theory over 
the past half-century are beyond question. More interesting is Goldthorpe's 
question as to why social mobility research has made more progress than most 
other areas of sociology. He credits the institutional support of the Research 
Committee on Social Stratification and Mobility (RC28) of the International 
Sociological Association with serving as a unique focusing lens for those working 
in the field. 

.. Taking up a different dimension of Erikson's work, Tony Atkinson reflects on 
the interaction between social science and public policy in the development of 
international social indicators. (The symposium from which these papers are 
taken was in honor of Erikson's service as the Secretary General of the Swedish 
Council for Working Life and Social Research.) Reviewing the development 
of the European Union's Primary Indicators for Social Inclusion and the nearly 
contemporaneous Millennium Development Goals adopted by the United 
Nations, Atkinson raises a number of important questions and concerns about 
the interpretation and use of these measures. 

The last two papers in the volume depart from the first three in that they are 
empirically grounded and do not employ a comparative approach. Annemette 
Sorenson asks whether changes in family structure observed in Western nations 
in the past 50 years have had an effect on the stratification systems of those 
nations. Drawing exclusively on research findings from the United States, she 
concludes that the decline of the traditional nuclear family has resulted in both 
increased inequality and increased mobility. The latter result is due to the fact 
that single (and remarried) parents are less able to confer advantages on their 
children and therefore, the decreasing proportion of two-parent families weakens 
the overall origin-destination association. However, she warns, the American 
pattern may not be found to the same extent in other nations, especially those 
with more supportive welfare systems. 
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While informative, Sara Arber's contribution is out of step with the remainder 
of the volume, as it reports new empirical results from a single nation. Using data 
from the British General Household Survey, she examines whether gender and 
marital status in older age affect the likelihood of smoking and excessive alcohol 
consumption, and whether class, material well-being and social connectedness 
mediate this relationship. Her results demonstrate the complex interplay of gender 
and marital status in later life and how they lead to differential life outcomes. 

Each of the chapters in this volume addresses some dimension of social 
inequality in a useful manner, but the book neither comes together as a whole 
nor is there much new information presented. The strongest parts of the book 
are the efforts to step back and reflect on some of the middle range issues and 
patterns in the study of stratification. A number of the chapters provide concise 
introductions to some of the issues in the study of social inequality, but with a 
relatively narrow focus. Each individual article will be useful to readers seeking to 
learn more about specific topics but the book as a whole lacks coherence. 


Reviewer: Seth Ovadia, Bowdoin College 


The Poorhouse: America's Forgotten Institution 
By David Wagner 
Rowan & Littlefield, 2005. 179 pages. $66 (cloth); $24.95 (paper) 


No American institution carries the stigma of the poorhouse. Like everyone else, 
most historians have shunned and tried to forget it. Now, David Wagner has 
redressed the neglect with the first recent book-length account of poorhouse 
history. Wagner makes a compelling case for the importance of poorhouse history, 
which, for him, is not antiquarian. Rather, it provokes instructive reflections on 
how America treats its poor, elderly, and homeless citizens today. 

Wagner concentrates on six New England poorhouses, or poor farms as 
they were often called, which he situates in the broader context of poorhouse 
history. He focuses less on early 19th century poorhouse origins than on the 
decades from the 1890s through the 1940s - a largely forgotten but significant 
era preceding the closing of most poorhouses after mid-century. In poorhouse 
records unearthed in various archives, newspapers, town and city documents, 
and interviews, Wagner has turned up rich examples that illuminate daily life, 
administrative practice, staff roles, and political conflicts. With these sources he 
has written a clear and lively narrative full of arresting anecdotes and quotations, 
with comments on historiography and related matters relegated to footnotes. 
At 150 pages of text, the book will be accessible to a broad audience and useful 
for teaching. His explicit reflections on the implications of poorhouse history for 
interpreting the present will prove especially thought-provoking and should spark 
lively debate. 

Wagner emphasizes two themes. One is the agency of inmates who often 
subverted official rules and purposes and, even, on occasion tried to orchestrate 
public opinion to favor their interests. The mixed purposes poorhouses served 
constitute the other theme. Poorhouses were at once punitive and rehabilitative, 
deterrent and humanitarian, short-term residences and long-term homes. And 
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they served diverse populations. Despite important differences from one another, 
most of them housed men temporarily out of work; the sick, mentally ill and 
long term indigent; unwed mothers; and the elderly. Like David Rothman, who 
writes about other Progressive-era institutions in Conscience and Convenience, 
a book he does not cite, Wagner roots the difficulties and failures of poorhouses 
in the contradictions at their core. In this argument, which applies more broadly 
to America's welfare state, Wagner is surely correct. 

Nonetheless, Wagner underemphasizes the developing differentiation among 
19th century institutions. With the creation of mental hospitals, public hospitals 
and orphanages, poorhouses became increasingly residual, homes for the 
elderly and others not serviced elsewhere in burgeoning, increasingly specialized 
state institutional networks. Wagner cannot analyze the impact of institutional 
specialization on poorhouses because he makes only limited use of the sources 
necessary for charting their demography: poorhouse registers. He discovered 
registers for at least two of his six poorhouses but used them in a cursory, almost 
anecdotal rather than analytic manner. He did not, that is, construct an inmate 
database which would have showed trends over time in age, length of residence, 
gender and other variables — data essential to supporting his argument. Indeed, 
he ignored gender ratios which, tellingly, in all poorhouses | have studied tilted 
strongly toward men. 

Wagner's rather fuzzy periodization also neglects ideological and policy shifts 
in relief (the 19th century term for “welfare”) — for one, the gradual displacement 
of the harsh doctrines and practices of Scientific Charity by Progressive-era 
reform in the late 19th and early 20th century. Contrary to Wagner's assertion, 
between 1878 and the early 1890s, 10 of the nation's 40 largest cities abolished 
public outdoor relief and others reduced it drastically. By the early 20th century, 
it had started to creep back into public policy. Except for passing reference to 
mothers' pensions, Wagner does not explore the important implications of this 
ideological and policy shift for poorhouses. Nor does he adequately explore the 
impact of the New Deal. Indeed, he barely mentions Old Age Assistance such 
as Social Security, a product of the 1935 Economic Security Act, and for many 
years far more important to the elderly. He also omits mention of the New Deal's 
crucial work-relief programs, whose impact on poorhouses, as far as | know, 
remains unstudied. 

Morethan any other part of his book, Wagner's ruminations onthe links between 
past and present, the subject of his last chapter, will probably occasion the most 
discussion and controversy. Wagner thinks that historians have exaggerated the 
deficiencies of poorhouses and prematurely written their obituaries. The 1950s 
and 1960s witnessed not their demise, but a hiatus in their centuries' old history. 
In homeless shelters and other late twentieth century institutions, Americans 
have reinvented poorhouses, which, Wagner believes, are worse than the ones 
he studied in the past. In its treatment of the poor, he claims, Americans have 
moved backward not forward. This is an enormous and consequential assertion. 
While | am not convinced, Wagner offers a reasoned argument supported with 
some telling evidence. His book is a fine springboard for an important debate. 


Reviewer: Michael B. Katz, University of Pennsylvania 
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Putting Children First: How Low-Wage Working Mothers Manage Child Care 
By Ajay Chaudry 
Russell Sage Foundation, 2004. 341 pages. $39.95 (cloth); $19.95 (paper) 


Since the passage of the welfare reform bill in 1996, the issue of moving single 
mothers from welfare to work has been a policy priority. Yet there has always been 
a formidable obstacle in attempting to achieve this goal — the lack of affordable, 
dependable, and decent quality child care for such families. Examining this 
fundamental problem is the central theme behind Putting Children First by Ajay 
Chaudry. As Chaudry states, “With the changes following welfare reform leading 
to more single mothers working and with a continuing shortage of child care 
funds, what are low-income mothers doing about child care?” 

Chapter 1 begins by providing an overview and background of the study. The 
sample consisted of 42 low-income single mother families in New York City. 
They were evenly divided between African-American and Latina women. At the 
time of first contact, mothers had a child between the ages of 2 and 3, and all 
were employed at low wage jobs. They were then followed over the course of 
two to three years. During this time the women in the study moved in and out 
of different jobs as well as different types of child care arrangements. Chaudry 
conducted at least three semistructured in-depth interviews with each mother 
during the study period, along with more informal conversations and participant 
observation to document how their situations changed over time. The interview 
material is then used throughout the chapters in order to provide insight and 
understanding into the problems and issues facing these women as they juggle 
their child care needs with their employment needs. 

Chapter 2 looks at the types of child care arrangements that mothers made for 
their young children. Often times they had to settle for whatever they could find 
in a relatively short period of time, rather than what they would have preferred. 
Their options were severely constrained by the care that was available in their 
communities, what they could afford, and what they could quickly arrange as they 
transitioned to work. The types of child care most commonly used were kin and 
informal care, family day care, center-based care, and father care arrangements. 

Chapter 3 examines the spell dynamics of child care use. Chaudry finds that 
the length of each child care spell is relatively short — 7.5 months on average. In 
addition, a majority of children experienced five or more primary nonmaternal care 
spells between the ages of 0 and 4. One of the major reasons for the frequency 
and short term nature of these spells was the inherent instability and changing 
circumstances that mothers faced at work as well as in other aspects of their 
lives. Throughout the book, Chaudry constructs several timelines that graphically 
illustrate the fluctuating circumstances of work, child care, and housing for 
specific families. 

Chapter 4 details the concerns that mothers expressed regarding their 
children’s child care arrangements. These included having to start a child too 
early in care, the difficulty in finding a first child care arrangement, and children 
spending too much time in child care as well as being in too many different 
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child care arrangements. Chapter 5 addresses how mothers coped, adapted and 
learned over time with respect to dealing with these concerns. 

The final chapter turns to the author's policy suggestions for improving 
and strengthening the system of child care for low income mothers and their 
children. These include increasing available care options and supports for 
families with infants, making investments in early education (such as Head 
Start) to encourage childhood development, improving access to child care 
systems and promoting continuity in care, and providing child care assistance 
to all eligible low-income families. 

The book concludes with three very helpful appendixes. Appendix A provides 
a review of the literature regarding child care and low-income families. Appendix 
B describes in further detail the research design and methodology used in the 
study, while Appendix C contains the interview guides. 

Overall, this an extremely insightful study that provides great depth and 
understanding into the difficulties facing low-income single parents as they 
struggle to balance their work and child care needs and concerns. The interview 
material is used effectively throughout, such that the reader is left with a much 
greater appreciation into the frustrations and complexity facing these mothers as 
they attempt to bridge the world of work and child care. As Chaudry notes, "In 
the course of becoming workers, these low-income mothers have found that the 
most complicated challenge they face is not finding or keeping a job, but finding 
and keeping reliable child care they can trust and afford." Putting Children First is 
a skillful and memorable book that reveals this complicated challenge. 


Reviewer: Mark R. Rank, Washington University 
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Call for Papers: Age Discrimination 


` 


Social Forces seeks papers for a special section on the sociology of age 
discrimination. Original papers are invited for consideration that address age 
discrimination and related phenomena across the life course and in a range of 
social contexts. Papers might focus on the social processes, institutions and 
structures that cause or constitute age discrimination, or on social psychological 
and other consequences of age discrimination. Age discrimination is behavior 
and hence can be distinguished from ageist attitudes, but papers that analyze the 
relationship between ageist attitudes and age discrimination will be welcome. We 
are interested in papers dealing with age discrimination in societies other than 
the United States and papers that analyze the intersection of age discrimination 
and other forms of discrimination based on gender, race and class. Papers that 
make a theoretical contribution in this area will be particularly welcomed, as will 
empirical papers based on any sound methodological approach. 


The special section will be edited by Victor W. Marshall, Department of Sociology, 
UNC at Chapel Hill and Director of the UNC Institute on Aging. Papers will be reviewed 
by the section editor, the Soc/a/ Forces editor and at least one blind reviewer. 


The deadline for submitting papers is Feb. 1, 2007. Manuscripts should be 
limited to 5,000-9,000 words including references and endnotes and should be 
e-mailed as a Microsoft Word attachment to Social Forces(Qunc.edu. Anytables . 
or figures must be editable in Microsoft Word or Excel. Do not use any automatic 
formatting feature. Submission fees for this section are waived. /t is essential that 
you note that the submission is for the special section on age discrimination. 


Inquiries about the review process or about a particular manuscript should be 
directed to Victor Marshall at victor marshall@unc.edu, or (919) 843-8067. 


Call for Papers: New Directions in Race Research 


Social Forces seeks papers for a special section on race research planned for 
the December 2007 issue. We are soliciting original papers that address issues 
of race, racial inequality, racism and racial identity construction from U.S. and 
international perspectives. Papers might address new developments in racial 
inequality in housing, education or health; the creation, maintenance and 
changing definition of racial categories; the racial dynamics of colorblindness: and 
racism within the context of immigration, neo-colonialism and the environment. 
We welcome papers that expand our theoretical understanding of race and are 
particularly interested in empirical research that challenges, expands or redirects 
existing race scholarship. 


The special section will be edited by Charles A. Gallagher, Department of 
Sociology, Georgia State University. Papers will be peer reviewed. 


The deadline for submitting papers is April 2, 2007. Manuscripts should be 
limited to 5,000-9,000 words including references and endnotes and should be 
e-mailed as a Microsoft Word attachment to Social_Forces@unc.edu. Any tables 
or figures must be editable in Microsoft Word or Excel. Do not use any automatic 
formatting feature. Submission fees for this section are waived. 
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